Loading…
Loading…
DeepSeek's flagship V4 MoE model (1.6T total / 49B active params), released April 2026 — frontier reasoning, coding and agentic performance with a 1M-token context. Open weights.
Every value carries a primary source and a verification date.
Sourced evaluation scores, each verified against its primary source.
MMLU-Pro
MMLU-Pro (EM) | 89.1 | 87.5
GPQA Diamond
GPQA Diamond (Pass@1)
SimpleQA-Verified
SimpleQA-Verified (Pass@1)
Humanity's Last Exam
HLE (Pass@1)
LiveCodeBench
LiveCodeBench (Pass@1)
Codeforces
Codeforces (Rating)
HMMT 2026 Feb
HMMT 2026 Feb (Pass@1)
IMOAnswerBench
IMOAnswerBench (Pass@1)
Apex Shortlist
Apex Shortlist (Pass@1)
Terminal Bench 2.0
Terminal Bench 2.0 (Acc)
SWE-bench Verified
SWE Verified (Resolved)
SWE-bench Pro
SWE Pro (Resolved)
SWE-bench Multilingual
SWE Multilingual (Resolved)
BrowseComp
BrowseComp (Pass@1)
Chinese-SimpleQA
Chinese-SimpleQA (Pass@1)