Loading…
Loading…
Moonshot's latest general-purpose flagship on the Kimi API platform; selectable model id in the Create Chat Completion API enum (kimi-k2.6). Platform docs (platform.moonshot.ai/docs/guide/kimi-k2-6-model) verbatim: 'Kimi K2.6 is Kimi's latest and most intelligent model, possessing stronger and more
Every value carries a primary source and a verification date.
Sourced evaluation scores, each verified against its primary source.
SWE-bench Verified
<td align="center" style="vertical-align: middle">SWE-Bench Verified</td><td align="center" style="vertical-align: middle">80.2</td>
SWE-bench Multilingual
<td align="center" style="vertical-align: middle">SWE-Bench Multilingual</td><td align="center" style="vertical-align: middle">76.7</td>
SWE-bench Pro
<td align="center" style="vertical-align: middle">SWE-Bench Pro</td><td align="center" style="vertical-align: middle">58.6</td>
LiveCodeBench v6
<td align="center" style="vertical-align: middle">LiveCodeBench (v6)</td><td align="center" style="vertical-align: middle">89.6</td>
GPQA Diamond
<td align="center" style="vertical-align: middle">GPQA-Diamond</td><td align="center" style="vertical-align: middle">90.5</td>
AIME 2026
<td align="center" style="vertical-align: middle">AIME 2026</td><td align="center" style="vertical-align: middle">96.4</td>
HMMT 2026 (Feb)
<td align="center" style="vertical-align: middle">HMMT 2026 (Feb)</td><td align="center" style="vertical-align: middle">92.7</td>
MMMU-Pro
<td align="center" style="vertical-align: middle">MMMU-Pro</td><td align="center" style="vertical-align: middle">79.4</td>
Terminal-Bench 2.0 (Terminus-2)
<td align="center" style="vertical-align: middle">Terminal-Bench 2.0<br>(Terminus-2)</td><td align="center" style="vertical-align: middle">66.7</td>
HLE-Full (w/ tools)
<td align="center" style="vertical-align: middle">HLE-Full<br>(w/ tools)</td><td align="center" style="vertical-align: middle">54.0</td>
BrowseComp
<td align="center" style="vertical-align: middle">BrowseComp</td><td align="center" style="vertical-align: middle">83.2</td>