groq

meta-llama/llama-4-scout-17b-16e-instruct

Context

128K

Input

text

Output

text

Tool calling

Supported

About this model

Llama-4-Scout-17B-16E is Meta's efficiency-optimized MoE model featuring 16 experts with ~2B active parameters per token. Designed for exploratory tasks with enhanced web navigation, research synthesis, and information discovery capabilities at low resource cost.

Capabilities

text

Available through the unified API

Quick start

View API docs

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "60b4145e4fe14923bdce3fcdc1a115e9",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'

Providers

Available routing options for this model through OneInfer.

groq

60b4145e4fe14923bdce3fcdc1a115e9

Available

Input

$0.110 / 1M

Output

$0.340 / 1M

Routing

OneInfer optimized

Pricing

Current OneInfer pricing for this model.

Usage	Price
Input tokens	$0.110 / 1M
Output tokens	$0.340 / 1M

Performance

Published evaluation results associated with this model.

Information Discovery

WebSRC87.3

HotpotQA84.9

MultiHopQA82.7

FactVerification91.5

Research Synthesis

SciFact88.6

EvidenceInference85.2

TechReportQA83.4

Web Interaction

WebShop78.9

Mind2Web76.3

WebLINX81.5

General Reasoning

MMLU77.2

GSM8K84.7

HumanEval68.9

API example

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "60b4145e4fe14923bdce3fcdc1a115e9",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'