novita

meta-llama/llama-3-8b-instruct

Context

8K

Input

text

Output

text

Tool calling

Supported

About this model

Meta-Llama-3-8B-Instruct is an instruction-tuned variant of Meta's 8-billion-parameter transformer, optimized for dialogue and task completion. Trained with RLHF on 15T+ tokens, it delivers strong performance in reasoning, coding, and multilingual tasks while maintaining efficiency for single-GPU deployment. Features improved instruction following and reduced hallucination compared to base models.

Capabilities

text

Available through the unified API

Quick start

View API docs
curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "2dcb0b7ca03747a5a2beb6a01ecb6937",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'

Providers

Available routing options for this model through OneInfer.

novita

2dcb0b7ca03747a5a2beb6a01ecb6937

Available

Input

$0.040 / 1M

Output

$0.040 / 1M

Routing

OneInfer optimized

Pricing

Current OneInfer pricing for this model.

UsagePrice
Input tokens$0.040 / 1M
Output tokens$0.040 / 1M

Performance

Published evaluation results associated with this model.

General Reasoning

MMLU (5-shot)68.4
ARC-Challenge59
HellaSwag82.2
Winogrande74.6

Mathematical Reasoning

GSM8K (8-shot)84.9
MATH (4-shot)30

Code Generation

HumanEval (0-shot)62.2
MBPP (3-shot)61

Instruction Following

IFEval (prompt adherence)81.3

API example

curl https://api.oneinfer.ai/v1/chat/completions \
  -H "Authorization: Bearer $ONEINFER_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "2dcb0b7ca03747a5a2beb6a01ecb6937",
    "messages": [
      { "role": "user", "content": "Hello!" }
    ]
  }'