Fast multimodal model from the Gemini 3.5 family, optimized for agentic coding, long context and advanced reasoning with low latency.
Context window
1M
tokens
Max output
65,536
tokens
Access:APIHostedDeployment:โ Cloud
Overview
Applications
Access & deployment
APIHosted
Cloud
Weights: Closed
Key parameters
๐ Context: 1M
โ Tools
๐ฅ Input: text, image, audio, videoโฆ
Technical specification
Context window
1M
tokens
Max output tokens
65,536
tokens per response
Knowledge cutoff
1 Jan 2025
Knowledge boundary
License
proprietary
Hardware requirements
Available only through Google cloud infrastructure (Gemini API, Vertex AI, Google AI Studio).
Features:โ Tool use
Modalities
โฌ Input
textimageaudiovideodocuments
โฌ Output
textcode
Capabilities and applications
Native model capabilities
Reasoning
Category: reasoning
Multi-step reasoning
Category: reasoning
Long context
Category: reasoning
Multimodal understanding
Category: multimodal
Coding
Category: coding
Function Calling
Category: planning
Structured output
Category: structured_generation
Audio understanding
Category: audio
Image understanding
Category: vision
Video Understanding
Category: video
Chart understanding
Category: vision
OCR
Category: vision
Multilingual
Category: language
Planning
Category: planning
Interleaved Multimodal Input
Category: reasoning
Benchmark results
14 benchmarks
Terminal-bench 2.1
accuracy ยท Terminus-2 harness
76.2%%
๐ deepmind.google/models/gemini/flash
SWE-Bench Pro (Public)
accuracy ยท Single attempt
55.1%%
๐ deepmind.google/models/gemini/flash
MCP Atlas
accuracy
83.6%%
๐ deepmind.google/models/gemini/flash
Toolathlon
accuracy
56.5%%
๐ deepmind.google/models/gemini/flash
OSWorld-Verified
accuracy
78.4%%
๐ deepmind.google/models/gemini/flash
Finance Agent v2
accuracy
57.9%%
๐ deepmind.google/models/gemini/flash
GDPval-AA
Elo ยท Economically valuable knowledge work
1656
๐ deepmind.google/models/gemini/flash
CharXiv Reasoning
accuracy ยท No tools
84.2%%
๐ deepmind.google/models/gemini/flash
MMMU-Pro
accuracy ยท No tools
83.6%%
๐ deepmind.google/models/gemini/flash
Blueprint-Bench 2
normalized score
33.6%%
๐ deepmind.google/models/gemini/flash
MRCR v2 (8-needle) 128k
accuracy ยท Long context, average
77.3%%
๐ deepmind.google/models/gemini/flash
MRCR v2 (8-needle) 1M
accuracy ยท Pointwise
26.6%%
๐ deepmind.google/models/gemini/flash
Humanity's Last Exam
accuracy ยท Full set, text + MM
40.2%%
๐ deepmind.google/models/gemini/flash
ARC-AGI-2
accuracy
72.1%%
๐ deepmind.google/models/gemini/flash
Technical architecture
Core Architecture
Model Form
Deployment and security
Sources and related pages
2 sources
