Shaw TalebiFine-tuning LLMs on Human Feedback (RLHF + DPO)Preference tuning with Python codeFeb 26A response icon2Feb 26A response icon2
Shaw TalebiHow to Train LLMs to “Think” (o1 & DeepSeek-R1)Advanced reasoning models explainedFeb 12A response icon4Feb 12A response icon4
Shaw TalebiI Trained FLUX.1 on My Face (and how you can too)Easy fine-tuning guide with Python and ReplicateFeb 8A response icon2Feb 8A response icon2
InTDS ArchivebyShaw TalebiFine-tuning Multimodal Embedding ModelsAdapting CLIP to YouTube Data (with Python Code)Jan 31A response icon1Jan 31A response icon1
Shaw TalebiFine-Tuning Text Embeddings For Domain-Specific SearchAn overview with Python codeJan 24A response icon6Jan 24A response icon6
InTDS ArchivebyShaw TalebiFine-Tuning BERT for Text ClassificationA hackable example with Python codeOct 17, 2024A response icon2Oct 17, 2024A response icon2
InTDS ArchivebyShaw TalebiLLM Fine-tuning — FAQsAnswering the most common questions I received as an AI consultantSep 26, 2024A response icon2Sep 26, 2024A response icon2
InTDS ArchivebyShaw TalebiCompressing Large Language Models (LLMs)Make LLMs 10X smaller without sacrificing performanceAug 30, 2024A response icon5Aug 30, 2024A response icon5
InTDS ArchivebyShaw TalebiLocal LLM Fine-Tuning on Mac (M1 16GB)Beginner-friendly Python code walkthrough (ft. MLX)Aug 1, 2024A response icon2Aug 1, 2024A response icon2
InTDS ArchivebyShaw TalebiQLoRA — How to Fine-Tune an LLM on a Single GPUAn introduction with Python example code (ft. Mistral-7b)Feb 22, 2024A response icon10Feb 22, 2024A response icon10
InTDS ArchivebyShaw TalebiHow to Build an AI Assistant with OpenAI + PythonStep-by-step guide on using the Assistants API & Fine-tuningFeb 8, 2024A response icon5Feb 8, 2024A response icon5
InTDS ArchivebyShaw TalebiFine-Tuning Large Language Models (LLMs)A conceptual overview with example Python codeSep 11, 2023A response icon5Sep 11, 2023A response icon5