Try Gemini 2.5 Flash in the Workbench
Run this model interactively, tune parameters, and compare outputs.
gemini-2-5-flash
Gemini 2.5 Flash is a multimodal LLM designed for fast, cost-effective reasoning across text, images, audio, and video.
It excels in low-latency, high-volume tasks that require rapid processing with strong reasoning abilities, making it suitable for general-purpose applications where speed and versatility are essential. Its main strengths include an exceptionally long context window (up to 1 million tokens), native support for multiple modalities, and robust multilingual capabilities.
Some other noteworthy features of Gemini 2.5 Flash include deep domain knowledge in science, mathematics, and code, as well as support for agentic use cases and the ability to handle large-scale processing with efficient performance.
| Metric | Value |
|---|---|
| Parameter Count | Unknown |
| Mixture of Experts | Unknown |
| Context Length | 1,048,576 tokens |
| Multilingual | Yes |
| Quantized* | Unknown |
Example request
- Minimal
- Basic parameters
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.