openai

gpt-4.1-mini-2025-04-14

Context

128K

Input

text, image

Output

text

Tool calling

Supported

About this model

GPT-4.1 Mini is a distilled 400B-parameter version of GPT-4.1 optimized for cost-efficient deployment. Features the same 256-expert MoE architecture with selective expert activation, delivering 80% of GPT-4.1's capability at 30% of the inference cost.

Capabilities

text

Available through the unified API

image

Available through the unified API

Quick start

View API docs
curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "f76daa821ecd4e968d22ace905d749aa",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'

Providers

Available routing options for this model through OneInfer.

openai

f76daa821ecd4e968d22ace905d749aa

Available

Input

$0.400 / 1M

Output

$1.600 / 1M

Routing

OneInfer optimized

Pricing

Current OneInfer pricing for this model.

UsagePrice
Input tokens$0.400 / 1M
Output tokens$1.600 / 1M

Performance

Published evaluation results associated with this model.

Enterprise Applications

Business Analyst Sim88.2
Financial Report Accuracy90.7
Legal Contract Review83.9

Core Capabilities

MMLU82.4
TruthfulQA84.1
GSM8K92.7

Tool Integration

API Call Success92.3
Data Extraction Accuracy89.6
Spreadsheet Automation86.4

API example

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "f76daa821ecd4e968d22ace905d749aa",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'