openai
gpt-4.1-mini-2025-04-14
Context
128K
Input
text, image
Output
text
Tool calling
Supported
About this model
GPT-4.1 Mini is a distilled 400B-parameter version of GPT-4.1 optimized for cost-efficient deployment. Features the same 256-expert MoE architecture with selective expert activation, delivering 80% of GPT-4.1's capability at 30% of the inference cost.
Capabilities
text
Available through the unified API
image
Available through the unified API
Quick start
View API docscurl https://api.oneinfer.ai/v1/chat/completions \
-H "Authorization: Bearer $ONEINFER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "f76daa821ecd4e968d22ace905d749aa",
"messages": [
{ "role": "user", "content": "Hello!" }
]
}'Providers
Available routing options for this model through OneInfer.
openai
f76daa821ecd4e968d22ace905d749aa
Input
$0.400 / 1M
Output
$1.600 / 1M
Routing
OneInfer optimized
Pricing
Current OneInfer pricing for this model.
| Usage | Price |
|---|---|
| Input tokens | $0.400 / 1M |
| Output tokens | $1.600 / 1M |
Performance
Published evaluation results associated with this model.
Enterprise Applications
Business Analyst Sim88.2
Financial Report Accuracy90.7
Legal Contract Review83.9
Core Capabilities
MMLU82.4
TruthfulQA84.1
GSM8K92.7
Tool Integration
API Call Success92.3
Data Extraction Accuracy89.6
Spreadsheet Automation86.4
API example
curl https://api.oneinfer.ai/v1/chat/completions \
-H "Authorization: Bearer $ONEINFER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "f76daa821ecd4e968d22ace905d749aa",
"messages": [
{ "role": "user", "content": "Hello!" }
]
}'