novita
google/gemma-4-31b-it
Context
256K
Input
text, image
Output
text
Tool calling
Supported
About this model
Gemma 4 31B is a state-of-the-art, open-weights multimodal dense model from Google DeepMind. It features a 256K token context window, excels at reasoning, coding, and agentic workflows, and supports interleaved text and image inputs.
Capabilities
text
Available through the unified API
image
Available through the unified API
Quick start
View API docscurl https://api.oneinfer.ai/v1/chat/completions \
-H "Authorization: Bearer $ONEINFER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "d4f78f062753458395a727c93348ec67",
"messages": [
{ "role": "user", "content": "Hello!" }
]
}'Providers
Available routing options for this model through OneInfer.
novita
d4f78f062753458395a727c93348ec67
Input
$0.140 / 1M
Output
$0.400 / 1M
Routing
OneInfer optimized
Pricing
Current OneInfer pricing for this model.
| Usage | Price |
|---|---|
| Input tokens | $0.140 / 1M |
| Output tokens | $0.400 / 1M |
Performance
Published evaluation results associated with this model.
Reasoning
MMLU Pro85.2
GPQA Diamond85.7
AIME 202689.2
Vision
MMMU Pro76.9
MATH-Vision85.6
Coding
LiveCodeBench v680
API example
curl https://api.oneinfer.ai/v1/chat/completions \
-H "Authorization: Bearer $ONEINFER_API_KEY" \
-H "Content-Type: application/json" \
-d '{
"model": "d4f78f062753458395a727c93348ec67",
"messages": [
{ "role": "user", "content": "Hello!" }
]
}'