Use this file to discover all available pages before exploring further.
Try Gemini 3.1 Flash-Lite in the Workbench
Run this model interactively, tune parameters, and compare outputs.
Model ID:gemini-3-1-flash-lite-previewGemini 3.1 Flash-Lite Preview is Google’s fastest and most cost-efficient Gemini 3.1 model for high-volume workloads. It is optimized for low-latency, large-scale tasks where responsiveness and cost control are critical.Some other noteworthy features of Gemini 3.1 Flash-Lite Preview include configurable thinking levels, strong instruction-following for production pipelines, and multimodal support suitable for translation, moderation, UI generation, dashboards, and simulation-style workflows.
Metric
Value
Parameter Count
Unknown
Mixture of Experts
Unknown
Context Length
1,048,576 tokens
Multilingual
Yes
Quantized*
Unknown
*Quantization is specific to the inference provider and the model may be offered with different quantization levels by other providers.