📖 Step 9: AI/LLM#258 / 350

Guardrails

📖One-line summary

Safety checks placed around model inputs and outputs to keep behavior within policy.

Safety checks on input and output that keep the AI from going off-policy. Profanity filters, banned-topic blocks — all guardrails.

Safety checks on input and output

🛡️Mask PII in input

↓

🤖LLM call

↓

🛡️Filter forbidden words in output

Write an input guardrail that masks PII (emails, phone numbers, national IDs).

Build a post-processing guardrail that blocks output containing internal keywords; load the keyword list from env.

Write code that calls a small classifier model as a guardrail to detect prompt injection or jailbreak attempts.

Try these prompts in your AI coding assistant!