Try GPT 4.1 nano in the Workbench
Run this model interactively, tune parameters, and compare outputs.
gpt-4-1-nano-2025-04-14
GPT 4.1 Nano is an LLM designed for tasks requiring low latency such as classification or autocompletion.
It excels in delivering fast responses with minimal cost while maintaining impressive capabilities, featuring the full 1 million token context window despite its lightweight nature.
Some other noteworthy use cases of GPT 4.1 Nano include high-volume operations, content tagging, and powering real-time AI agents where speed and efficiency are critical.
| Metric | Value |
|---|---|
| Parameter Count | Unknown |
| Mixture of Experts | Unknown |
| Context Length | 1,047,576 tokens |
| Multilingual | Yes |
| Quantized* | Unknown |
Example request
- Minimal
- Basic parameters
- All parameters
Fetch model details
The models endpoint returns the full model object, including itsjson_request_schema.