Gemini 3 Flash is a multimodal language model by Google DeepMind from the Gemini 3 family, designed for fast inference and low cost while retaining reasoning capabilities comparable to Gemini 3 Pro.
Context window
1M
tokens
Max output
65,536
tokens
Release date
17 December 2025
Access:APIHostedDeployment:☁ Cloud
Overview
Applications
Access & deployment
APIHosted
Cloud
Weights: Closed
Key parameters
📏 Context: 1M
✓ Tools
📥 Input: text, image, audio, video…
Platforms
Technical specification
Context window
1M
tokens
Max output tokens
65,536
tokens per response
Knowledge cutoff
1 Jan 2025
Knowledge boundary
License
proprietary
Hardware requirements
Available only through Google cloud infrastructure (Gemini API, Vertex AI, Google AI Studio).
Features:✓ Tool use
Modalities
⬇ Input
textimageaudiovideodocuments
⬆ Output
textcode
Capabilities and applications
Native model capabilities
Reasoning
Category: reasoning
Multi-step reasoning
Category: reasoning
Long context
Category: reasoning
Multimodal understanding
Category: multimodal
Coding
Category: coding
Function Calling
Category: planning
Structured output
Category: structured_generation
Audio understanding
Category: audio
Image understanding
Category: vision
Video Understanding
Category: video
Chart understanding
Category: vision
Diagram reasoning
Category: reasoning
OCR
Category: vision
Multilingual
Category: language
Planning
Category: planning
Streaming output
Category: reasoning
Interleaved Multimodal Input
Category: reasoning
Benchmark results
15 benchmarks
Humanity's Last Exam
accuracy · No tools, Gemini 3 Flash Thinking
33.7%%
📄 https://deepmind.google/models/gemini/flash/
Full set (text + MM). No tools.
Humanity's Last Exam
accuracy · With search and code execution, Gemini 3 Flash Thinking
43.5%%
📄 https://deepmind.google/models/gemini/flash/
Full set (text + MM). With search and code execution.
GPQA Diamond
accuracy · No tools, Gemini 3 Flash Thinking
90.4%%
📄 https://deepmind.google/models/gemini/flash/
Scientific knowledge, no tools.
ARC-AGI-2
accuracy · ARC Prize Verified, Gemini 3 Flash Thinking
33.6%%
📄 https://deepmind.google/models/gemini/flash/
Abstract reasoning puzzles, ARC Prize verified.
AIME 2025
accuracy · No tools, Gemini 3 Flash Thinking
95.2%%
📄 https://deepmind.google/models/gemini/flash/
Mathematics, no tools.
AIME 2025
accuracy · With code execution, Gemini 3 Flash Thinking
99.7%%
📄 https://deepmind.google/models/gemini/flash/
Mathematics, with code execution.
MMMU-Pro
accuracy · No tools, Gemini 3 Flash Thinking
81.2%%
📄 https://deepmind.google/models/gemini/flash/
Multimodal understanding and reasoning, no tools.
SWE-Bench Verified
accuracy · Single attempt, Gemini 3 Flash Thinking
78.0%%
📄 https://deepmind.google/models/gemini/flash/
Agentic coding, single attempt.
MMMLU
accuracy · Gemini 3 Flash Thinking
91.8%%
📄 https://deepmind.google/models/gemini/flash/
Multilingual Q&A.
Video-MMMU
accuracy · Gemini 3 Flash Thinking
86.9%%
📄 https://deepmind.google/models/gemini/flash/
Knowledge acquisition from videos.
FACTS Benchmark Suite
accuracy · Gemini 3 Flash Thinking
61.9%%
📄 https://deepmind.google/models/gemini/flash/
Factuality benchmark across grounding, parametric knowledge, search, and MM.
SimpleQA Verified
accuracy · Gemini 3 Flash Thinking
68.7%%
📄 https://deepmind.google/models/gemini/flash/
Parametric knowledge.
τ2-bench
accuracy · Gemini 3 Flash Thinking
90.2%%
📄 https://deepmind.google/models/gemini/flash/
Agentic tool use.
Toolathlon
accuracy · Gemini 3 Flash Thinking
49.4%%
📄 https://deepmind.google/models/gemini/flash/
Long horizon real-world software tasks.
MCP Atlas
accuracy · Gemini 3 Flash Thinking
57.4%%
📄 https://deepmind.google/models/gemini/flash/
Multi-step workflows using MCP.
Pricing
Technical architecture
Core Architecture
Model Form
Training Techniques
Deployment and security
☁ Available on platforms
🔒 Security / Enterprise
✓ Verified enterprise information
Gemini 3 Flash dostępny w Vertex AI i Gemini Enterprise. Model przeszedł ewaluacje bezpieczeństwa zgodne z Frontier Safety Framework Google DeepMind. Model card dostępny publicznie.
Updated: 1 May 2026↗ Security documentation
