Legacy fast, low-cost model from the Gemini 2.0 generation. Popular for high-volume multimodal tasks with a 1M-token context window; superseded by Gemini 2.5 Flash and Flash-Lite but kept for backward compatibility.
profile normalized against the 47-model field
Context window· 1.05M of 1.05M100%
Max output· 8K of 272K3%
Output speed· 250 tok/s78%
Affordability· $0.40 / Mtok out100%
Capability breadth· 8 of 1173%
Capability switches · 8 of 11
Reasoning mode
Tool / function use
Streaming
JSON mode
Structured outputs
Prompt caching
Fine-tuning
Web search
Code execution
Vision input
Audio input
Specifications
Every value carries a primary source and a verification date.
Capacity
Context window
1.05M
Max output
8K
Pricing
Input $/Mtok
$0.10 / Mtok input USD/Mtok
Output $/Mtok
$0.40 / Mtok output USD/Mtok
Performance
Output speed
250 tok/s tok/s
Time to first token
300 ms ms
Capabilities
Input modalities
["audio","image","text","video"]
Output modalities
["text"]
Reasoning mode
No
Tool / function use
Yes
Streaming
Yes
JSON mode
true
Structured outputs
true
Prompt caching
true
Web search
true
Vision input
true
Audio input
true
API
API model ID
gemini-2.0-flash
General
Knowledge cutoff
August 2024
Benchmarks
Sourced evaluation scores, each verified against its primary source.