openai

gpt-4.1-nano-2025-04-14

Context

128K

Input

text, image

Output

text

Tool calling

Supported

About this model

OpenAI's most compact and cost-efficient model in the GPT-4.1 series, released in April 2025. Designed for high-volume, latency-sensitive applications where the full capabilities of larger models are not required. It offers strong reasoning for its size and supports multimodal text and image inputs.

Capabilities

text

Available through the unified API

image

Available through the unified API

Quick start

View API docs

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "b77daa107bc84db6af20b3f817962bf3",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'

Providers

Available routing options for this model through OneInfer.

openai

b77daa107bc84db6af20b3f817962bf3

Available

Input

$0.100 / 1M

Output

$0.400 / 1M

Routing

OneInfer optimized

Pricing

Current OneInfer pricing for this model.

Usage	Price
Input tokens	$0.100 / 1M
Output tokens	$0.400 / 1M

Performance

Published evaluation results associated with this model.

MMLU

score78.5

GPQA Diamond

score59.2

HumanEval

score72.3

API example

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "b77daa107bc84db6af20b3f817962bf3",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'