📖 Step 9: AI/LLM#260 / 350

Retrieval-Augmented Generation

Retrieval-Augmented Generation (RAG)

📖One-line summary

A pattern where the model retrieves external documents and answers using them as context.

An approach where the AI searches relevant docs first, drops them into context, and answers with them. An "open-book exam" pattern.

❓Question

🔎Retrieve docs

📄Inject context

💬Answer with citations

Design a basic RAG pipeline that indexes the internal wiki into a vector DB and answers queries — chunking, embedding, retrieval, answering.

Write a system prompt + post-processing logic that requires citations on every RAG answer.

When RAG quality is bad, what do you check, in order? Write a step-by-step diagnosis checklist (chunk size, embedding model, reranker, etc.).

Try these prompts in your AI coding assistant!