Available OCR models
| Model | Parameters | Strengths |
|---|---|---|
deepseek-ai/DeepSeek-OCR | — | High accuracy on complex layouts, tables, handwriting |
allenai/olmOCR-2-7B-1025 | 2.7B | Fast, lightweight, good for bulk processing |
PaddlePaddle/PaddleOCR-VL-0.9B | 0.9B | Ultra-lightweight, edge-deployable |
Extract text from an image
Receipt parsing with structured output
Extract specific fields from a receipt photo:Next steps
- AI Vision API — analyze images beyond OCR: scene understanding, chart reading, visual Q&A.
- Extract structured data — combine OCR output with schema-based extraction for production pipelines.
- Model catalog — browse all available vision and OCR models.