Model
| Model | Parameters | Output |
|---|---|---|
meta-llama/Llama-Guard-4-12B | 12B | safe or unsafe + category labels |
Classify a single message
safe or unsafe followed by the violated category (e.g., unsafe\nS1 for violence-related content).
Moderation middleware
Build a reusable moderation function for your app:Next steps
- Route between AI models — run moderation with Llama Guard, then route safe content to a generation model.
- Build an AI SaaS backend — full production backend with moderation, chat, and structured output.
- Model catalog — browse all available models.