Robots Atlas>ROBOTS ATLAS
Gemini 3.1 Flash-Lite

Gemini 3.1 Flash-Lite

3.1 Flash-Lite · Family: Gemini
Gemini 3.1 Flash-Lite is the most cost-efficient thinking model in Google DeepMind's Gemini 3 series, designed for high throughput and low latency while retaining reasoning quality.
⏳ Preview⏳ Limited accessLLMMultimodalReasoning modelTool-using model📁 Gemini
Context window
1M
tokens
Max output
65,536
tokens
Release date
29 April 2026
Access:APIHostedDeployment:☁ Cloud

Overview

Gemini 3.1 Flash-Lite is an AI model developed by Google DeepMind, announced on April 29, 2026 as part of the Gemini 3.1 family. It is a scalable thinking model designed for high-volume tasks at low cost and latency.

The model supports flexible reasoning levels, allowing users to select the level of thinking to apply per task. Context window: 1M tokens, max output: 64,000 tokens. Supports function calling, structured output, search as a tool, and code execution.

Available via Google AI Studio, Gemini API, and Vertex AI. Knowledge cutoff: January 2025. Output speed: 363 tokens/s. Lowest price in the series 3: input $0.25/1M, output $1.50/1M tokens.

Classification
LLMMultimodalReasoning modelTool-using model
Family: Gemini
Access & deployment
APIHosted
Cloud
Weights: Closed
Key parameters
📏 Context: 1M
Tools
📥 Input: text, image, audio, video
Platforms

Technical specification

Context window
1M
tokens
Max output tokens
65,536
tokens per response
Knowledge cutoff
1 Jan 2025
Knowledge boundary
License
proprietary
Hardware requirements
Available only through Google cloud infrastructure (Gemini API, Vertex AI, Google AI Studio).
Features:Tool use
Modalities
⬇ Input
textimageaudiovideodocuments
⬆ Output
textcode

Capabilities and applications

Native model capabilities
Reasoning
Category: reasoning
Multi-step reasoning
Category: reasoning
Long context
Category: reasoning
Multimodal understanding
Category: multimodal
Coding
Category: coding
Function Calling
Category: planning
Structured output
Category: structured_generation
Audio understanding
Category: audio
Image understanding
Category: vision
Video Understanding
Category: video
Chart understanding
Category: vision
Multilingual
Category: language
Streaming output
Category: reasoning

Benchmark results

11 benchmarks
Humanity's Last Exam
accuracy · No tools, Gemini 3.1 Flash-Lite High
16.0%%
📄 https://deepmind.google/models/gemini/flash-lite/
Full set (text + MM). No tools.
GPQA Diamond
accuracy · No tools, Gemini 3.1 Flash-Lite High
86.9%%
📄 https://deepmind.google/models/gemini/flash-lite/
Scientific knowledge, no tools.
MMMU-Pro
accuracy · No tools, Gemini 3.1 Flash-Lite High
76.8%%
📄 https://deepmind.google/models/gemini/flash-lite/
Multimodal understanding and reasoning.
CharXiv Reasoning
accuracy · Gemini 3.1 Flash-Lite High
73.2%%
📄 https://deepmind.google/models/gemini/flash-lite/
Information synthesis from complex charts.
Video-MMMU
accuracy · Gemini 3.1 Flash-Lite High
84.8%%
📄 https://deepmind.google/models/gemini/flash-lite/
Knowledge acquisition from videos.
SimpleQA Verified
accuracy · Gemini 3.1 Flash-Lite High
43.3%%
📄 https://deepmind.google/models/gemini/flash-lite/
Parametric knowledge.
FACTS Benchmark Suite
accuracy · Gemini 3.1 Flash-Lite High
40.6%%
📄 https://deepmind.google/models/gemini/flash-lite/
Factuality across grounding, parametric knowledge, search, and MM.
MMMLU
accuracy · Gemini 3.1 Flash-Lite High
88.9%%
📄 https://deepmind.google/models/gemini/flash-lite/
Multilingual Q&A.
LiveCodeBench
accuracy · UI: 1/1/2025-5/1/2025, Gemini 3.1 Flash-Lite High
72.0%%
📄 https://deepmind.google/models/gemini/flash-lite/
Code generation.
MRCR v2 (8-needle, 128k)
accuracy · 128k average, Gemini 3.1 Flash-Lite High
60.1%%
📄 https://deepmind.google/models/gemini/flash-lite/
Long context performance.
MRCR v2 (8-needle, 1M)
accuracy · 1M pointwise, Gemini 3.1 Flash-Lite High
12.3%%
📄 https://deepmind.google/models/gemini/flash-lite/
Very long context performance (1M tokens).

Pricing

Technical architecture

Deployment and security

☁ Available on platforms
🔒 Security / Enterprise
✓ Verified enterprise information

Model card dostępny publicznie.

Updated: 1 May 2026↗ Security documentation