Available vision models
| Model | Context | Strengths |
|---|---|---|
Qwen/Qwen3-VL-235B-A22B-Instruct | 128K | Strongest visual reasoning, chart/table extraction, multilingual |
meta-llama/Llama-3.2-90B-Vision-Instruct | 128K | Strong general vision, good at spatial reasoning |
Analyze an image
Read charts and graphs
Pass a chart image and ask the model to extract data points. Works with bar charts, line graphs, tables, and diagrams. Usebase64 encoding for local files:
Next steps
- AI OCR and document extraction — purpose-built models for text extraction from documents and receipts.
- Extract structured data — combine vision output with schemas for production data pipelines.
- Model catalog — browse all available vision models and pricing.