novita

google/gemma-4-31b-it

Context

256K

Input

text, image

Output

text

Tool calling

Supported

About this model

Gemma 4 31B is a state-of-the-art, open-weights multimodal dense model from Google DeepMind. It features a 256K token context window, excels at reasoning, coding, and agentic workflows, and supports interleaved text and image inputs.

Capabilities

text

Available through the unified API

image

Available through the unified API

Quick start

View API docs

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "d4f78f062753458395a727c93348ec67",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'

Providers

Available routing options for this model through OneInfer.

novita

d4f78f062753458395a727c93348ec67

Available

Input

$0.140 / 1M

Output

$0.400 / 1M

Routing

OneInfer optimized

Pricing

Current OneInfer pricing for this model.

Usage	Price
Input tokens	$0.140 / 1M
Output tokens	$0.400 / 1M

Performance

Published evaluation results associated with this model.

Reasoning

MMLU Pro85.2

GPQA Diamond85.7

AIME 202689.2

Vision

MMMU Pro76.9

MATH-Vision85.6

Coding

LiveCodeBench v680

API example

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "d4f78f062753458395a727c93348ec67",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'