openai
gpt-4.1-nano-2025-04-14
Context
128K
Input
text, image
Output
text
Tool calling
Supported
About this model
OpenAI's most compact and cost-efficient model in the GPT-4.1 series, released in April 2025. Designed for high-volume, latency-sensitive applications where the full capabilities of larger models are not required. It offers strong reasoning for its size and supports multimodal text and image inputs.
Capabilities
text
Available through the unified API
image
Available through the unified API
Quick start
View API docscurl https://api.oneinfer.ai/v1/chat/completions \
-H "Authorization: Bearer $ONEINFER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "b77daa107bc84db6af20b3f817962bf3",
"messages": [
{ "role": "user", "content": "Hello!" }
]
}'Providers
Available routing options for this model through OneInfer.
openai
b77daa107bc84db6af20b3f817962bf3
Input
$0.100 / 1M
Output
$0.400 / 1M
Routing
OneInfer optimized
Pricing
Current OneInfer pricing for this model.
| Usage | Price |
|---|---|
| Input tokens | $0.100 / 1M |
| Output tokens | $0.400 / 1M |
Performance
Published evaluation results associated with this model.
MMLU
score78.5
GPQA Diamond
score59.2
HumanEval
score72.3
API example
curl https://api.oneinfer.ai/v1/chat/completions \
-H "Authorization: Bearer $ONEINFER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "b77daa107bc84db6af20b3f817962bf3",
"messages": [
{ "role": "user", "content": "Hello!" }
]
}'