Robots Atlas>ROBOTS ATLAS
ChatGPT Images 2.0

ChatGPT Images 2.0

gpt-image-2 · Family: GPT
OpenAI's image generation and editing model with built-in O-series reasoning, up to 2K resolution, non-Latin text rendering, and batch generation of up to 8 images per prompt. Available via API as gpt-image-2.
✓ Active✓ Public accessImage generationReasoning modelMultimodal📁 GPT
Release date
21 April 2026
Access:APIHostedDeployment:☁ Cloud

Overview

The information provided about ChatGPT Images 2.0 / gpt-image-2 conflicts with my current knowledge (my knowledge cutoff is October 2024, while the description refers to events from 2026). I am therefore unable to verify or confirm it as fact.

I can, however, summarize and organize what the content describes:

Model and availability

API model name: gpt-image-2.

Replaces: DALL‑E 2, DALL‑E 3, gpt-image-1.5.

Planned deprecation of previous models: May 12, 2026.

Available in: ChatGPT (web, mobile, desktop), the Codex environment, and via the OpenAI API.

Production snapshot: gpt-image-2-2026-04-21.

Alias: chatgpt-image-latest points to the current default image model.

Architecture and key innovation

Built-in reasoning layer based on the O-series model family.

Before generating, the model:

analyzes the prompt,

plans composition and element layout,

models spatial relationships,

can search the web to retrieve current information,

verifies its own output after generation.

Difference from earlier diffusion models: earlier models operated reactively, without an explicit planning step.

Operating modes

Instant mode:

Available free of charge to all ChatGPT users.

Provides better generation quality than previous models, but without a reasoning step.

Thinking mode:

Available to: ChatGPT Plus, Pro, Business, Enterprise.

Features:

web search,

generation of up to 8 coherent images from a single prompt,

output verification.

Technical generation parameters

Resolutions: up to 2K, experimentally above 2560×1440 px.

Aspect ratios: from 3:1 to 1:3.

Output formats: PNG, JPEG, WebP.

Transparent background not supported in Responses API mode.

Text in images

Significantly improved text rendering within images.

Support for Latin and non-Latin scripts (including Japanese, Korean, Chinese, Hindi, Bengali).

Declared text generation accuracy: ~99%.

Image Arena results (arena.ai)

The model achieved an Elo score of 1512 in the text-to-image category.

Margin over second place (Google Gemini 3.1 Flash Image): +242 Elo points.

Ranking positions:

text-to-image: 1512 (1st place),

single-image edit: 1513 (1st place),

multi-image edit: 1464 (1st place).

API pricing (token-based)

Input images: $8 / 1M tokens (or $2 / 1M cached).

Output images: $30 / 1M tokens.

Input text: $5 / 1M tokens.

Output text: $10 / 1M tokens.

Batch API: 50% discount for asynchronous processing within 24 hours.

Estimated cost per 1024×1024 image:

low quality: approx. $0.006,

medium quality: approx. $0.053,

high quality: approx. $0.211.

Since I cannot verify information from the future, I can only treat this as a hypothetical description or a preliminary specification. If you wish, I can use it as a basis for a comparison with earlier models, example prompts, cost estimates for specific use cases, or a draft of API documentation.

Classification
Image generationReasoning modelMultimodal
Family: GPT
Applications
Access & deployment
APIHosted
Cloud
Weights: Closed
Key parameters
📥 Input: text, image
Platforms

Technical specification

Knowledge cutoff
31 Dec 2025
Knowledge boundary
License
Proprietary / Commercial
Hardware requirements
Not applicable — the model is available exclusively via the OpenAI API and ChatGPT (closed cloud). Self-hosting and weight downloads are not supported.
Modalities
⬇ Input
textimage
⬆ Output
image

Capabilities and applications

Native model capabilities
Reasoning
Category: reasoning
Multi-step reasoning
Category: reasoning
Multilingual
Category: language
Planning
Category: planning
Application domains

Benchmark results

5 benchmarks
Image Arena — Text-to-Image
Elo score · Evaluated in medium quality mode. Leads the second-ranked model by +242 Elo points over model #2 (Nano Banana 2 / Google Gemini 3.1 Flash Image, score 1271). The largest recorded gap between #1 and #2 in the history of the Image Arena platform.
1512punkty Elo
📅 19 Apr 2026📄 Arena.ai / Image Arena leaderboard (crowdsourced, blind human preference voting)
Arena.ai is an independent crowdsourcing platform — results may change as new votes are submitted. The score refers to medium quality, not high quality.
Image Arena — Single-Image Edit
Elo score · Ranked first. Margin of +125 points over model no. 2 (Nano Banana Pro).
1513punkty Elo
📅 21 Apr 2026📄 Arena.ai / Image Arena leaderboard
Blind human preference voting. Results may evolve over time.
Image Arena — Multi-Image Edit
Elo score · Ranked first. Margin of +90 points over model no. 2 (Nano Banana 2).
1464punkty Elo
📅 21 Apr 2026📄 Arena.ai / Image Arena leaderboard
Blind human preference voting.
Image Arena — Text Rendering (sub-kategoria)
Elo improvement vs GPT-Image-1.5 High Fidelity · Largest gain in the text rendering category among all sub-categories. GPT Image 2 ranked #1 in all 7 Text-to-Image sub-categories.
+316punkty Elo (poprawa względem poprzednika)
📅 21 Apr 2026📄 Arena.ai / Image Arena category breakdown; raport officechai.com
Data from media coverage of the Arena sub-category review. For reference only.
Wewnętrzny benchmark text rendering (OpenAI)
Text accuracy · Accuracy of text rendering as declared by OpenAI. The previous model, gpt-image-1.5, achieved 90–95%. The methodology has not been publicly disclosed.
~99%procent
📅 21 Apr 2026📄 OpenAI press release przy premierze 21.04.2026
Manufacturer's declaration — not independently audited.

Pricing

Technical architecture

Deployment and security

☁ Available on platforms
🔒 Security / Enterprise
✓ Verified enterprise information

Model available exclusively through OpenAI's cloud infrastructure (closed weights). Thinking mode and advanced features are restricted to paid plans (Plus, Pro, Business, Enterprise). API access requires OpenAI developer account verification; organizational verification may be required for full access to GPT Image models via API.

The model enforces content policy at generation time — requests that violate the policy return a 400 error (BadRequestError) with a content_policy message. AI-generated content is tagged with AI metadata. Transparent background (PNG with alpha channel) is not supported in the Responses API tool option — use gpt-image-1.5 for that purpose. Free tier access is limited to standard/instant mode with a restricted number of generations (approximately 2 images/day per tester reports). Streaming, function calling, and structured outputs are not supported by the gpt-image-2 API (confirmed on the model's page).
Updated: 26 Apr 2026↗ Security documentation