📖 Step 9: AI/LLM#272 / 350

Fine-tuning

📖One-line summary

Further training a pre-trained model on domain data to specialize it.

Further training an already-trained model on your own data so it fits your taste. Like retraining a generalist chef on your restaurant's menu.

Pre-trained model

Our data

↓

Domain-specialized model

Define a JSONL fine-tuning dataset format for internalizing customer-support reply tone, with guidance for 100 samples.

Draw a decision tree for choosing OpenAI fine-tuning vs. Anthropic fine-tuning vs. RAG.

Lay out a regression-test procedure for evaluating a fine-tuned model against the base model.

Try these prompts in your AI coding assistant!