Available Qwen models
| Model | Category | Context | Strengths |
|---|---|---|---|
| Qwen3-Max | Chat | 128K | Flagship reasoning and instruction following |
| Qwen3.5-397B-A17B | Chat | 128K | MoE architecture, high throughput |
| Qwen3-Coder-480B-A35B-Instruct-Turbo | Code | 256K | Code generation, debugging, refactoring |
| Qwen3-VL-235B-A22B-Instruct | Vision | 128K | Image understanding, OCR, diagram analysis |
| Qwen3-TTS | TTS | — | Natural-sounding speech synthesis |
Chat — Qwen3-Max
Code — Qwen3-Coder
Purpose-built for code generation with 256K context:Vision — Qwen3-VL
Analyze images, extract text, understand diagrams:TTS — Qwen3-TTS
Choosing the right Qwen model
| Task | Model | Why |
|---|---|---|
| General chat, reasoning | Qwen3-Max | Best overall quality |
| High-throughput chat | Qwen3.5-397B-A17B | MoE — fast and cheap per token |
| Code generation, review | Qwen3-Coder-480B | 256K context, code-specialized |
| Image analysis, OCR | Qwen3-VL-235B | Vision-language understanding |
| Speech synthesis | Qwen3-TTS | Natural TTS output |
Tips
- Qwen3-Max is the safe default for most chat tasks.
- Qwen3.5 MoE activates only 17B params per token — use it when you need speed at scale.
- Qwen3-Coder handles 256K context for cross-file refactoring.
- Qwen3-VL supports multiple images in a single request.