Meta · Llama 3.3

Llama 3.3 70B Instruct

CurrentVerified · Jun 27, 2026

Text-only instruction-tuned workhorse still served on the Llama API. Confirmed current on llama.developer.meta.com/docs/models: "Model ID: `Llama-3.3-70B-Instruct`" — "A text-only instruction-tuned model with enhanced performance relative to Llama 3.1 70B, and relative to Llama 3.2 90B when used for

profile normalized against the 70-model field

Context window· 131K of 10M1%

Max output—

Output speed—

Affordability—

Capability breadth· 1 of 119%

Capability switches · 1 of 11

Reasoning mode

Tool / function use

Streaming

JSON mode

Structured outputs

Prompt caching

Fine-tuning

Web search

Code execution

Vision input

Audio input

Specifications

Every value carries a primary source and a verification date.

Capacity

Context window

131K

Capabilities

Reasoning mode

Tool / function use

Yes

Vision input

API

API model ID

Llama-3.3-70B-Instruct

General

Knowledge cutoff

December 2023

Release date

December 6, 2024

Benchmarks

Sourced evaluation scores, each verified against its primary source.

MMLU (CoT, 0-shot)

MMLU (CoT) | 0 | macro\_avg/acc | 73.0 | 86.0 | 86.0 | 88.6

86 macro_avg/accVerified

MMLU Pro (CoT, 5-shot)

MMLU Pro (CoT) | 5 | macro\_avg/acc | 48.3 | 66.4 | 68.9 | 73.3

68.9 macro_avg/accVerified

IFEval

IFEval | | | 80.4 | 87.5 | 92.1 | 88.6

92.1 accuracyVerified

GPQA Diamond (CoT, 0-shot)

GPQA Diamond (CoT) | 0 | acc | 31.8 | 48.0 | 50.5 | 49.0

50.5 accuracyVerified

HumanEval (0-shot)

HumanEval | 0 | pass@1 | 72.6 | 80.5 | 88.4 | 89.0

88.4 pass@1Verified

MBPP EvalPlus (base, 0-shot)

MBPP EvalPlus (base) | 0 | pass@1 | 72.8 | 86.0 | 87.6 | 88.6

87.6 pass@1Verified

MATH (CoT, 0-shot)

MATH (CoT) | 0 | sympy\_intersection\_score | 51.9 | 68.0 | 77.0 | 73.8

77 sympy_intersection_scoreVerified

BFCL v2 (0-shot)

BFCL v2 | 0 | overall\_ast\_summary/macro\_avg/valid | 65.4 | 77.5 | 77.3 | 81.1

77.3 overall_ast_summary/macro_avg/validVerified

MGSM (0-shot)

MGSM | 0 | em | 68.9 | 86.9 | 91.1 | 91.6

91.1 exact_matchVerified

Loading…