ChatGPT Images 2.0

gpt-image-2 · Family: GPT

OpenAI's image generation and editing model with built-in O-series reasoning, up to 2K resolution, non-Latin text rendering, and batch generation of up to 8 images per prompt. Available via API as gpt-image-2.

✓ Active✓ Public accessImage generationReasoning modelMultimodal📁 GPT

Release date

21 April 2026

🏢OpenAIProducer

Access:APIHostedDeployment:☁ Cloud

Overview

The information provided about ChatGPT Images 2.0 / gpt-image-2 conflicts with my current knowledge (my knowledge cutoff is October 2024, while the description refers to events from 2026). I am therefore unable to verify or confirm it as fact.

I can, however, summarize and organize what the content describes:

Model and availability

API model name: gpt-image-2.

Replaces: DALL‑E 2, DALL‑E 3, gpt-image-1.5.

Planned deprecation of previous models: May 12, 2026.

Available in: ChatGPT (web, mobile, desktop), the Codex environment, and via the OpenAI API.

Production snapshot: gpt-image-2-2026-04-21.

Alias: chatgpt-image-latest points to the current default image model.

Architecture and key innovation

Built-in reasoning layer based on the O-series model family.

Before generating, the model:

analyzes the prompt,

plans composition and element layout,

models spatial relationships,

can search the web to retrieve current information,

verifies its own output after generation.

Difference from earlier diffusion models: earlier models operated reactively, without an explicit planning step.

Operating modes

Instant mode:

Available free of charge to all ChatGPT users.

Provides better generation quality than previous models, but without a reasoning step.

Thinking mode:

Available to: ChatGPT Plus, Pro, Business, Enterprise.

Features:

web search,

generation of up to 8 coherent images from a single prompt,

output verification.

Technical generation parameters

Resolutions: up to 2K, experimentally above 2560×1440 px.

Aspect ratios: from 3:1 to 1:3.

Output formats: PNG, JPEG, WebP.

Transparent background not supported in Responses API mode.

Text in images

Significantly improved text rendering within images.

Support for Latin and non-Latin scripts (including Japanese, Korean, Chinese, Hindi, Bengali).

Declared text generation accuracy: ~99%.

Image Arena results (arena.ai)

The model achieved an Elo score of 1512 in the text-to-image category.

Margin over second place (Google Gemini 3.1 Flash Image): +242 Elo points.

Ranking positions:

text-to-image: 1512 (1st place),

single-image edit: 1513 (1st place),

multi-image edit: 1464 (1st place).

API pricing (token-based)

Input images: $8 / 1M tokens (or $2 / 1M cached).

Output images: $30 / 1M tokens.

Input text: $5 / 1M tokens.

Output text: $10 / 1M tokens.

Batch API: 50% discount for asynchronous processing within 24 hours.

Estimated cost per 1024×1024 image:

low quality: approx. $0.006,

medium quality: approx. $0.053,

high quality: approx. $0.211.

Since I cannot verify information from the future, I can only treat this as a hypothetical description or a preliminary specification. If you wish, I can use it as a basis for a comparison with earlier models, example prompts, cost estimates for specific use cases, or a draft of API documentation.

Classification

Image generationReasoning modelMultimodal

Family: GPT

Applications

Content generation

Access & deployment

APIHosted

Cloud

Weights: Closed

Key parameters

📥 Input: text, image

Platforms

OpenAI API

Technical specification

Knowledge cutoff

31 Dec 2025

Knowledge boundary

License

Proprietary / Commercial

Hardware requirements

Not applicable — the model is available exclusively via the OpenAI API and ChatGPT (closed cloud). Self-hosting and weight downloads are not supported.

Modalities

⬇ Input

textimage

⬆ Output

image

Capabilities and applications

Native model capabilities

Reasoning

Category: reasoning

Multi-step reasoning

Category: reasoning

Multilingual

Category: language

Planning

Category: planning

Application domains

Content generation

Benchmark results

5 benchmarks

Image Arena — Text-to-Image

Elo score · Evaluated in medium quality mode. Leads the second-ranked model by +242 Elo points over model #2 (Nano Banana 2 / Google Gemini 3.1 Flash Image, score 1271). The largest recorded gap between #1 and #2 in the history of the Image Arena platform.

1512punkty Elo

📅 19 Apr 2026📄 Arena.ai / Image Arena leaderboard (crowdsourced, blind human preference voting)

Arena.ai is an independent crowdsourcing platform — results may change as new votes are submitted. The score refers to medium quality, not high quality.

Image Arena — Single-Image Edit

Elo score · Ranked first. Margin of +125 points over model no. 2 (Nano Banana Pro).

1513punkty Elo

📅 21 Apr 2026📄 Arena.ai / Image Arena leaderboard

Blind human preference voting. Results may evolve over time.

Image Arena — Multi-Image Edit

Elo score · Ranked first. Margin of +90 points over model no. 2 (Nano Banana 2).

1464punkty Elo

📅 21 Apr 2026📄 Arena.ai / Image Arena leaderboard

Blind human preference voting.

Image Arena — Text Rendering (sub-kategoria)

Elo improvement vs GPT-Image-1.5 High Fidelity · Largest gain in the text rendering category among all sub-categories. GPT Image 2 ranked #1 in all 7 Text-to-Image sub-categories.

+316punkty Elo (poprawa względem poprzednika)

📅 21 Apr 2026📄 Arena.ai / Image Arena category breakdown; raport officechai.com

Data from media coverage of the Arena sub-category review. For reference only.

Wewnętrzny benchmark text rendering (OpenAI)

Text accuracy · Accuracy of text rendering as declared by OpenAI. The previous model, gpt-image-1.5, achieved 90–95%. The methodology has not been publicly disclosed.

~99%procent

📅 21 Apr 2026📄 OpenAI press release przy premierze 21.04.2026

Manufacturer's declaration — not independently audited.

Pricing

Technical architecture

Model Form

NMNative Multimodal

Training Techniques

RLRLHF ITInstruction Tuning COCoT

Deployment and security

☁ Available on platforms

☁OpenAI APIPlatform

🔒 Security / Enterprise

✓ Verified enterprise information

Model available exclusively through OpenAI's cloud infrastructure (closed weights). Thinking mode and advanced features are restricted to paid plans (Plus, Pro, Business, Enterprise). API access requires OpenAI developer account verification; organizational verification may be required for full access to GPT Image models via API.

The model enforces content policy at generation time — requests that violate the policy return a 400 error (BadRequestError) with a content_policy message. AI-generated content is tagged with AI metadata. Transparent background (PNG with alpha channel) is not supported in the Responses API tool option — use gpt-image-1.5 for that purpose. Free tier access is limited to standard/instant mode with a restricted number of generations (approximately 2 images/day per tester reports). Streaming, function calling, and structured outputs are not supported by the gpt-image-2 API (confirmed on the model's page).

Updated: 26 Apr 2026↗ Security documentation

Sources and related pages

12 sources

WebIntroducing ChatGPT Images 2.0 — OpenAI official announcementopenai.com DocsGPT Image 2 Model — OpenAI API Docsdevelopers.openai.com DocsImage Generation Guide — OpenAI API (gpt-image-2 parameters, quality, sizes)developers.openai.com DocsOpenAI API Pricing — official token billing rates for gpt-image-2openai.com DocsOpenAI API Changelog — gpt-image-2 release entrydevelopers.openai.com DocsGPT Image Generation Models Prompting Guide — OpenAI Cookbook (resolution constraints, quality tiers)developers.openai.com BlogTechCrunch — ChatGPT Images 2.0 is surprisingly good at generating texttechcrunch.com BlogThe Decoder — ChatGPT Images 2.0 thinks before it generatesthe-decoder.com BlogEngadget — ChatGPT Images 2.0 is better at rendering non-Latin textengadget.com BlogWikipedia — GPT Image (rodzina modeli, architektura autoregresywna)en.wikipedia.org BlogNeurohive — ChatGPT Images 2.0: Image Arena scores, API specs, benchmark breakdownneurohive.io BlogOfficeChai — ChatGPT Images 2.0 Tops Arena With Big Jump Over Nano Banana 2 (sub-kategorie, punktacja)officechai.com

Browse related topics

📁 GPT 🌐 Content generation 🧠 Native Multimodal ☁ OpenAI API All image generation model models All reasoning model models