Welcome to CHAT GI-IADS’s documentation!
- Introduction to the documentation
- Overview of Large Language Models
- Applications of Large Language Models
- The Predecessors of Large Language Models
- Rule-Based Systems
- Statistical Models
- Machine Learning in NLP
- Word Embeddings in NLP
- Neural Networks in NLP
- The Emergence of Large Language Models
- Rise of Transformer-Based Models
- Breakthroughs with BERT and GPT
- Recent Advances
- Understanding the Architecture of Transformers
- Applications of LLMs
- Synthetic Data Generation & Data preprocessing
- Synthetic data generation for industrial applications
- Techniques for generating synthetic data
- Drawing Numbers from a Distribution
- Agent-based Modeling (ABM):
- Prompts:
- Data preprocessing for LLMs
- Data: What’s It All About?
- Data Preprocessing and LLMs
- Data cleaning
- Data cleaning methods :
- 1. Handling missing values
- 2. Noise reduction
- 3. Consistency checks
- 4. Deduplication
- Text cleaning and normalization
- Removing noise and normalization
- Tokenization and word embedding
- 1.Tokenization
- 2.Word Embedding
- 1. One-Hot Encoding
- 2. Word2Vec
- 3. Bag of Words (BOW)
- 4. GloVe (Global Vectors for Word Representation)
- 5. FastText
- 6. Papers related to Data cleaning
- Computational Challenges in Training Large Language Models
- Fine-tuning techniques
- How to Fine-tune a Large Language Model (LLM)
- Instruction Fine-Tuning :
- Catasrophic Forgetting :
- How to prevent Catastrophic Forgetting :
- Instruction Multi-Fine-Tuning :
- PEFT
- Why PEFT?
- High performance on consumer hardware
- Low-Rank Adaptation (LoRA)
- Why LoRA?
- Implementation in Transformers
- Qlora
- Implementing QLoRA
- Putting everything together
- Benchmarking and Evaluation
- Prompt engineering
- Retrieval-Augmented Generation (RAG)
- langchain
- Reinforcement Learning from Human Feedback
- Mistral 7B:Use and fine-tune