Active

Mar 25, 2025

ProducerGoogle DeepMind

FamilyGemini

APIHosted UICloud

Gemini 2.5 Pro

LLMMultimodalReasoningTool-Using

Gemini 2.5 Pro is Google DeepMind's flagship reasoning model, generally available June 17, 2025. Built on Sparse MoE architecture, supports up to 1M token context, text/audio/image/video input, and integrated thinking mode.

Technical specification

Context window

do 1M tokenów

Parameters

nieujawnione

Max output

License

proprietary

Tools

Yes

Fine-tuning

Weights access

Closed

Knowledge cutoff

Jan 2025

Hardware requirementsAccess via Google Cloud infrastructure (Vertex AI / Gemini API)

Last updated: Apr 20, 2026

Modalities

Input

Text

Image

Audio

Video

Documents

Structured data

URLs

Output

Text

Code

Structured data

summaries

Analytical reports

Image

Capabilities

Reasoning★

Reasoning

Multi-step reasoning★

Reasoning

Long context★

Reasoning

Coding★

Coding

Function Calling

Planning

Structured output★

Structured gen.

Audio understanding

Audio

Image understanding★

Vision

Video Understanding

Other

Chart understanding

Vision

Diagram reasoning

Reasoning

OCR★

Vision

Multilingual★

Language

Planning★

Planning

Streaming output

Reasoning

Interleaved Multimodal Input

Reasoning

Multimodal understanding★

Multimodality

Applications

Pricing

Jun 17, 2025Source

PublicUSDper 1M tokens

Two-tier pricing based on context length. Prompts ≤200K tokens: $1.25/MTok input, $10.00/MTok output (thinking tokens counted as output). Prompts >200K tokens: $2.50/MTok input, $15.00/MTok output. Context caching: $0.31/MTok (≤200K), $0.625/MTok (>200K), storage $4.50/MTok/h. Batch API: ~50% discount. Free tier available in Google AI Studio (data used for product training).

Pricing · Calculator

Token volume10K

INPUT

$1.2500 / per 1M tokens

$0.0125

OUTPUT

$10.0000 / per 1M tokens

$0.1000

CACHE

$0.3100 / per 1M tokens

$0.00310

TOTAL

for 10K tokens

$0.1125

Price per 1M tokens · USD

Standard

All plans3

TextStandard/ per 1M tokens

Input$1.2500

Cache$0.3100

Output$10.0000

Output includes thinking tokens. Context caching: $0.31/MTok read, storage $4.50/MTok/h.

TextStandard/ per 1M tokens

Input$2.5000

Cache$0.6250

Output$15.0000

Higher rate applies for long contexts exceeding 200K tokens.

TextBatch/ per 1M tokens

Input$0.6250

Output$5.0000

Batch API (~50% discount). Asynchronous processing.

Ceny dotyczą płatnego tieru Gemini Developer API (Google AI Studio). Vertex AI ma osobny cennik (cloud.google.com/vertex-ai/generative-ai/pricing) — zasadniczo zbliżony. Thinking tokens liczone jako output tokens. Grounding with Google Search: 1500 darmowych zapytań/dzień, powyżej $35/1000 zapytań.

Security and enterprise

Model evaluated for cybersecurity, CBRN, autonomy, and other risks in accordance with Google DeepMind's Responsible Scaling Policy. Detailed safety assessments are included in the technical report and model card. Advanced mitigations against indirect prompt injection have been implemented.

Technical information

The technical report includes full safety evaluations covering cybersecurity, CBRN, Machine Learning R&D, and Deceptive Alignment. A model card is available at modelcards.withgoogle.com. At Google I/O 2025, Google announced significant improvements to protection against indirect prompt injection attacks, describing Gemini 2.5 as the "most secure model family to date". Deep Think mode underwent additional safety evaluations before broad release. Training data was subject to safety filtering. The paid API tier does not use data for model training, unlike the free tier.

Official Privacy CenterUpdated: June 17, 2025