Recommended models
| Model | Context | Best for |
|---|---|---|
| DeepSeek-V3 | 128K tokens | General-purpose summaries, cost-effective |
| Gemini 2.5 Flash | 1M tokens | Very long documents, research papers |
Single-document summary
Structured summary with key points
Long documents with Gemini Flash
For documents exceeding 128K tokens — full research papers, legal contracts — use Gemini 2.5 Flash with its 1M-token context window:Batch summarization
Tips
- Lower temperature (0.2–0.3) produces more faithful summaries.
- DeepSeek-V3 is the best cost-to-quality ratio for most summarization tasks.
- Gemini 2.5 Flash is the choice when the source document exceeds 128K tokens.
Next steps
- Structured output with Vercel AI SDK
- Chat completions reference
- AI Sentiment Analysis — pair summarization with sentiment scoring