Stable Gemini 2.5 Flash model, a fast and low-cost prior-generation Flash model with a ~1M token context window; still offered alongside the Gemini 3 family.
profile normalized against the 47-model field
Context window· 1.05M of 1.05M100%
Max output· 66K of 272K24%
Output speed—
Affordability· $2.50 / Mtok out99%
Capability breadth· 2 of 1118%
Capability switches · 2 of 11
Reasoning mode
Tool / function use
Streaming
JSON mode
Structured outputs
Prompt caching
Fine-tuning
Web search
Code execution
Vision input
Audio input
Specifications
Every value carries a primary source and a verification date.
Capacity
Context window
1.05M
Max output
66K
Pricing
Input $/Mtok
$0.30 / Mtok input (text/image/video) USD/Mtok
Output $/Mtok
$2.50 / Mtok output USD/Mtok
Capabilities
Reasoning mode
true
Tool / function use
true
Vision input
Inputs: Text, images, video, audio
API
API model ID
gemini-2.5-flash
General
Knowledge cutoff
January 2025
Benchmarks
Sourced evaluation scores, each verified against its primary source.
GPQA Diamond
Science GPQA diamond single attempt (pass@1) ... Gemini 2.5 Flash GA Thinking 82.8%
82.8 %Verified
Humanity's Last Exam (no tools)
Reasoning & knowledge Humanity's Last Exam (no tools) ... Gemini 2.5 Flash GA Thinking 11.0%