openai

gpt-5-mini-2025-08-07

Context

256K

Input

text

Output

text

Tool calling

Supported

About this model

OpenAI's most compact and cost-effective GPT-5 model, released August 7, 2025. Designed for high-volume, latency-sensitive applications where the full capabilities of larger models are not required, while maintaining strong performance on common tasks.

Capabilities

text

Available through the unified API

Quick start

View API docs
curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "e98fe549bd2243a2bf9d987b93b12360",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'

Providers

Available routing options for this model through OneInfer.

openai

e98fe549bd2243a2bf9d987b93b12360

Available

Input

$0.250 / 1M

Output

$2.000 / 1M

Routing

OneInfer optimized

Pricing

Current OneInfer pricing for this model.

UsagePrice
Input tokens$0.250 / 1M
Output tokens$2.000 / 1M

Performance

Published evaluation results associated with this model.

No benchmark data is listed for this model.

API example

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "e98fe549bd2243a2bf9d987b93b12360",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'