Categorie: Tutti - efficiency - frameworks - ai

da Eric Nic mancano 8 ore

690

LLMs Learning Path

The content revolves around various aspects of large language models (LLMs) and their applications. It delves into different types of AI-driven chatbots, including medical QA bots and recommendation systems.

LLMs Learning Path

Business Sectors

Technology and Software Development

Manufacturing

Knowledge Management

Supply Chain Management

Human Resources and Talent Management

Research and Development

Healthcare

Education and Training

Regulatory Compliance

Customer Relationship Management (CRM)

Sales and Marketing

Finance and Banking

e-Business

Basic LLMs Tasks

Content Generation and Correction

Information Extraction

Text-to-Text Transformation

Semantic Search

Sentiment Analysis

Content Personalization

Ethical and Bias Evaluation

Paraphrasing

Language Translation

Text Summarization

Conversational AI

Question Answering

LLMs Learning Path

Efficient LLMs

Efficient Inference
System-Level Inference Efficiency Optimization

vLLM , DeepSpeed-Inference

Algorithm-Level Inference Efficiency Optimization
Effitient Architecture
Long Context LLMs
MOE
Efficient Attention

Hardware-Assisted Attention

FlashAttention, vAttention

Learnable Pattern Strategies

HyperAttention

Fixed Pattern Strategies

Sparse Transformer , Longformer , Lightning Attention-2

Model Compression
Knowledge Distillation
Low-Rank Approximation
Parameter Pruning
Quantization

Quantization-Aware Training

Post-Training Quantization

Weight-Activation Co-Quantization

RPTQ, QLLM

Weight-Only Quantization

GPTQ, AWQ, SpQR

Efficient Fine-Tuning
Frameworks

Unsloth

MEFT (Q-LoRA, QA-LoRA, ...)
PEFT

Prompt Tuning

Prefix Tuning

Adapter-based Tuning

Low-Rank Adaptation (LoRA, DoRA)

AI Apps

Frameworks
Local

Gradio

Jan

Deployment

Anyscale

Hugging Face Inference Endpoints.

Serving

vLLM

BentoML

Integration

LlamaIndex

Scaled Multi-Agents
Local AI Agents
Local Chatbots
LLM-Based Recommendation Chatbot
Medical QA ChatBot
Research Assistant(Talking Papers)

Generative AI Agents

Frameworks and Practices
RASA
Langchain / LangGraph
CrewAI
SmolAgents
Courses
HF Agent Cource
Books
https://arxiv.org/pdf/2401.03428
https://www.arxiv.org/pdf/2504.01990

LLMs Adaptation

Promp-Engineering
RAG
20 Types of RAG Arch
Langchain

Indexes

weaviate

Faiss

Pinecone

Qdrant

Chroma

Prompt

Agent

Chains

Memory

Fine-Tuning ( Transfer learning, Strategies )
- Instruction-tuning - Alignment-tuning - Transfer Learning

Fundamentals

HF Transformers, pyTorch
HF Course: https://huggingface.co/docs/transformers/en/quicktour
Language Modeling and llms
https://arxiv.org/pdf/2302.08575
ChatgptDiscussion: https://chatgpt.com/share/6819003f-e674-8005-af8b-11a7ef37bd70
https://arxiv.org/pdf/2303.18223
https://arxiv.org/pdf/2402.06853
https://arxiv.org/pdf/2303.05759
https://wandb.ai/madhana/Language-Models/reports/Language-Modeling-A-Beginner-s-Guide---VmlldzozMzk3NjI3
- Full Language Modeling - Prefix Language Modeling - Masked Language Modeling - Unified Language Modeling
From Seq-to-Seq and RNN to Attention and Transformers
Papers
Lewis- Wolf
Vasilev