Try Gemini 3.1 Flash-Lite in the Workbench
Run this model interactively, tune parameters, and compare outputs.
gemini-3-1-flash-lite-preview
Gemini 3.1 Flash-Lite Preview is Google’s fastest and most cost-efficient Gemini 3.1 model for high-volume workloads. It is optimized for low-latency, large-scale tasks where responsiveness and cost control are critical.
Some other noteworthy features of Gemini 3.1 Flash-Lite Preview include configurable thinking levels, strong instruction-following for production pipelines, and multimodal support suitable for translation, moderation, UI generation, dashboards, and simulation-style workflows.
| Metric | Value |
|---|---|
| Parameter Count | Unknown |
| Mixture of Experts | Unknown |
| Context Length | 1,048,576 tokens |
| Multilingual | Yes |
| Quantized* | Unknown |
Example request
- Minimal
- Basic parameters
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.