groq

llama-3.3-70b-versatile

Context

128K

Input

text

Output

text

Tool calling

Supported

About this model

Llama-3.3-70B-Instruct is Meta's flagship 70B parameter model featuring enhanced reasoning, 128K context, and enterprise-grade instruction following. Represents a significant evolution over Llama 3.1 with improved tool integration, safety alignment, and complex task handling.

Capabilities

text

Available through the unified API

Quick start

View API docs

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "41d777609f63458e8deb0e32f634d21d",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'

Providers

Available routing options for this model through OneInfer.

groq

41d777609f63458e8deb0e32f634d21d

Available

Input

$0.590 / 1M

Output

$0.790 / 1M

Routing

OneInfer optimized

Pricing

Current OneInfer pricing for this model.

Usage	Price
Input tokens	$0.590 / 1M
Output tokens	$0.790 / 1M

Performance

Published evaluation results associated with this model.

General Reasoning

MMLU82.7

ARC-Challenge89.4

HellaSwag87.9

Winogrande84.3

Mathematical Reasoning

GSM8K92.8

MATH53.6

Coding

HumanEval76.4

MBPP79.2

CRUXEval71.8

Long-Context

Needle-128K99.3

PG-22 Recall95.1

GovReport-QA92.8

Safety & Alignment

ToxiGen96.2

TruthfulQA73.5

Harmfulness0.4

API example

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "41d777609f63458e8deb0e32f634d21d",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'