GLM-4.7

CurrentVerified · Jun 27, 2026

Current GLM-4 coding series (siblings: GLM-4.7, GLM-4.7-FlashX, GLM-4.7-Flash). docs.z.ai/guides/llm/glm-4.7 model id verbatim: "model": "glm-4.7" and overview: 'GLM-4.7 Series are Z.AI's models, featuring upgrades in two key areas: enhanced programming capabilities and more stable multi-step reason

profile normalized against the 70-model field

Context window—

Max output· 131K of 384K34%

Output speed—

Affordability· $2.20 / Mtok out99%

Capability breadth· 2 of 1118%

Capability switches · 2 of 11

Reasoning mode

Tool / function use

Streaming

JSON mode

Structured outputs

Prompt caching

Fine-tuning

Web search

Code execution

Vision input

Audio input

Specifications

Every value carries a primary source and a verification date.

Capacity

Max output

131K

Pricing

Input $/Mtok

$0.60 / 1M input tokens USD per 1M tokens

Cached input $/Mtok

$0.11 / Mtok USD/Mtok

Output $/Mtok

$2.20 / 1M output tokens USD per 1M tokens

Capabilities

Reasoning mode

Yes

Tool / function use

Yes

API

API model ID

glm-4.7

Benchmarks

Sourced evaluation scores, each verified against its primary source.

MMLU-Pro

| MMLU-Pro | 84.3 | 83.2 | 84.6 | 85.0 | 90.1 | 88.2 | 87.5 | 87.0 |

84.3 %Verified

GPQA Diamond

| GPQA-Diamond | 85.7 | 81.0 | 84.5 | 82.4 | 91.9 | 83.4 | 85.7 | 88.1 |

85.7 %Verified

Humanity's Last Exam

| HLE | 24.8 | 17.2 | 23.9 | 25.1 | 37.5 | 13.7 | 26.3 | 25.7 |

24.8 %Verified

Humanity's Last Exam (w/ Tools)

| HLE w/ Tools | 42.8 | 30.4 | 44.9 | 40.8 | 45.8 | 32.0 | 35.2 | 42.7 |

42.8 %Verified

AIME 2025

| AIME 2025 | 95.7 | 93.9 | 94.5 | 93.1 | 95.0 | 87.0 | 94.6 | 94.0 |

95.7 %Verified

HMMT Feb 2025

| HMMT Feb. 2025 | 97.1 | 89.2 | 89.4 | 92.5 | 97.5 | 79.2 | 88.3 | 96.3 |

97.1 %Verified

HMMT Nov 2025

| HMMT Nov. 2025 | 93.5 | 87.7 | 89.2 | 90.2 | 93.3 | 81.7 | 89.2 | - |

93.5 %Verified

IMOAnswerBench

| IMOAnswerBench | 82.0 | 73.5 | 78.6 | 78.3 | 83.3 | 65.8 | 76.0 | - |

82 %Verified

LiveCodeBench v6

| LiveCodeBench-v6 | 84.9 | 82.8 | 83.1 | 83.3 | 90.7 | 64.0 | 87.0 | 87.0 |

84.9 %Verified

SWE-bench Verified

| SWE-bench Verified | 73.8 | 68.0 | 71.3 | 73.1 | 76.2 | 77.2 | 74.9 | 76.3 |

73.8 %Verified

SWE-bench Multilingual

| SWE-bench Multilingual | 66.7 | 53.8 | 61.1 | 70.2 | - | 68.0 | 55.3 | - |

66.7 %Verified

Terminal Bench Hard

| Terminal Bench Hard | 33.3 | 23.6 | 30.6 | 35.4 | 39.0 | 33.3 | 30.5 | 43.0 |

33.3 %Verified

Terminal Bench 2.0

| Terminal Bench 2.0 | 41.0 | 24.5 | 35.7 | 46.4 | 54.2 | 42.8 | 35.2 | 47.6 |

41 %Verified

BrowseComp

| BrowseComp | 52.0 | 45.1 | - | 51.4 | - | 24.1 | 54.9 | 50.8 |

52 %Verified

BrowseComp-ZH

| BrowseComp-ZH | 66.6 | 49.5 | 62.3 | 65.0 | - | 42.4 | 63.0 | - |

66.6 %Verified

tau2-Bench

| τ²-Bench | 87.4 | 75.2 | 74.3 | 85.3 | 90.7 | 87.2 | 82.4 | 82.7 |

87.4 %Verified

Loading…

GLM-4.7

CurrentVerified · Jun 27, 2026

Specifications

Every value carries a primary source and a verification date.

Capacity

Max output

131K

Pricing

Input $/Mtok

$0.60 / 1M input tokens USD per 1M tokens

Cached input $/Mtok

$0.11 / Mtok USD/Mtok

Output $/Mtok

$2.20 / 1M output tokens USD per 1M tokens

Capabilities

Reasoning mode

Yes

Tool / function use

Yes

API

API model ID

glm-4.7

Benchmarks

Sourced evaluation scores, each verified against its primary source.

MMLU-Pro

| MMLU-Pro | 84.3 | 83.2 | 84.6 | 85.0 | 90.1 | 88.2 | 87.5 | 87.0 |

84.3 %Verified

GPQA Diamond

| GPQA-Diamond | 85.7 | 81.0 | 84.5 | 82.4 | 91.9 | 83.4 | 85.7 | 88.1 |

85.7 %Verified

Humanity's Last Exam

| HLE | 24.8 | 17.2 | 23.9 | 25.1 | 37.5 | 13.7 | 26.3 | 25.7 |

24.8 %Verified

Humanity's Last Exam (w/ Tools)

| HLE w/ Tools | 42.8 | 30.4 | 44.9 | 40.8 | 45.8 | 32.0 | 35.2 | 42.7 |

42.8 %Verified

AIME 2025

| AIME 2025 | 95.7 | 93.9 | 94.5 | 93.1 | 95.0 | 87.0 | 94.6 | 94.0 |

95.7 %Verified

HMMT Feb 2025

| HMMT Feb. 2025 | 97.1 | 89.2 | 89.4 | 92.5 | 97.5 | 79.2 | 88.3 | 96.3 |

97.1 %Verified

HMMT Nov 2025

| HMMT Nov. 2025 | 93.5 | 87.7 | 89.2 | 90.2 | 93.3 | 81.7 | 89.2 | - |

93.5 %Verified

IMOAnswerBench

| IMOAnswerBench | 82.0 | 73.5 | 78.6 | 78.3 | 83.3 | 65.8 | 76.0 | - |

82 %Verified

LiveCodeBench v6

| LiveCodeBench-v6 | 84.9 | 82.8 | 83.1 | 83.3 | 90.7 | 64.0 | 87.0 | 87.0 |

84.9 %Verified

SWE-bench Verified

| SWE-bench Verified | 73.8 | 68.0 | 71.3 | 73.1 | 76.2 | 77.2 | 74.9 | 76.3 |

73.8 %Verified

SWE-bench Multilingual

| SWE-bench Multilingual | 66.7 | 53.8 | 61.1 | 70.2 | - | 68.0 | 55.3 | - |

66.7 %Verified

Terminal Bench Hard

| Terminal Bench Hard | 33.3 | 23.6 | 30.6 | 35.4 | 39.0 | 33.3 | 30.5 | 43.0 |

33.3 %Verified

Terminal Bench 2.0

| Terminal Bench 2.0 | 41.0 | 24.5 | 35.7 | 46.4 | 54.2 | 42.8 | 35.2 | 47.6 |

41 %Verified

BrowseComp

| BrowseComp | 52.0 | 45.1 | - | 51.4 | - | 24.1 | 54.9 | 50.8 |

52 %Verified

BrowseComp-ZH

| BrowseComp-ZH | 66.6 | 49.5 | 62.3 | 65.0 | - | 42.4 | 63.0 | - |

66.6 %Verified

tau2-Bench

| τ²-Bench | 87.4 | 75.2 | 74.3 | 85.3 | 90.7 | 87.2 | 82.4 | 82.7 |

87.4 %Verified