Robots Atlas>ROBOTS ATLAS
OpenAI o3

OpenAI o3

o3 · Family: OpenAI o-series
OpenAI reasoning model released April 16, 2025 with full tool access in ChatGPT, ability to think with images, and a 200K context window. Succeeded by GPT-5.
✓ Active✓ Public accessReasoning modelMultimodalLLM📁 OpenAI o-series
Context window
200K
tokens
Max output
100,000
tokens
Release date
16 April 2025
Access:APIHostedDeployment:☁ Cloud

Overview

OpenAI o3 is a reasoning model in the o-series, released on April 16, 2025 alongside o4-mini. It was the first o-series model with full agentic access to every tool inside ChatGPT — web search, Python interpreter, image generation, and file analysis — and it was trained via reinforcement learning to decide when and how to use them. The model also introduced "thinking with images": images become part of the chain of thought and can be manipulated (rotated, zoomed) during reasoning. In the API, o3 has a 200,000-token context window, 100,000-token max output, and a June 1, 2024 knowledge cutoff. The API identifier is o3 (snapshot o3-2025-04-16). Pricing: USD 2 per 1M input tokens (USD 0.50 cached) and USD 8 per 1M output tokens. The model has been succeeded by GPT-5 but remains available via the API. An OpenAI o3-pro variant was also released in June 2025.

Classification
Reasoning modelMultimodalLLM
Access & deployment
APIHosted
Cloud
Weights: Closed
Key parameters
📏 Context: 200K
Tools
📥 Input: text, image

Technical specification

Context window
200K
tokens
Max output tokens
100,000
tokens per response
Knowledge cutoff
1 Jun 2024
Knowledge boundary
Features:Tool use
Modalities
⬇ Input
textimage
⬆ Output
textcode

Capabilities and applications

Native model capabilities
Reasoning
Category: reasoning
Multi-step reasoning
Category: reasoning
Coding
Category: coding
Long context
Category: reasoning
Multilingual
Category: language
Image understanding
Category: vision
Multimodal understanding
Category: multimodal
Function Calling
Category: planning
Parallel Tool Calls
Ability to invoke multiple external tools simultaneously while generating a response.
Category: reasoning
Planning
Category: planning
Agentic capability
The model's ability to autonomously plan and execute multi-step tasks by sequentially using tools, maintaining context, and adapting to intermediate results.
Category: planning
Computer use
The model's ability to operate a computer interface by interpreting screenshots and generating actions such as clicks, typing, and navigating applications.
Category: planning
Structured output
Category: structured_generation

Benchmark results

6 benchmarks
Codeforces
ELO rating · High reasoning effort, with tools
2727points
📅 16 Apr 2025📄 OpenAI announcement (Introducing OpenAI o3 and o4-mini)
SWE-bench
accuracy · SWE-bench Verified, fixed subset n=477, no custom scaffold
69.1%
📅 16 Apr 2025📄 OpenAI announcement (Introducing OpenAI o3 and o4-mini)
MMMU
accuracy · Multimodal understanding, high reasoning effort
82.9%
📅 16 Apr 2025📄 OpenAI announcement (Introducing OpenAI o3 and o4-mini)
AIME 2025
pass@1 · AIME 2025 with tool access (Python). Without tools the score is lower and not comparable to models without tool access.
98.4%
📅 16 Apr 2025📄 OpenAI announcement (Introducing OpenAI o3 and o4-mini)
GPQA
accuracy · GPQA Diamond, high reasoning effort
83.3%
📅 16 Apr 2025📄 OpenAI announcement (Introducing OpenAI o3 and o4-mini)
Humanity's Last Exam (HLE)
accuracy · Humanity's Last Exam, no tools
20.32%
📅 16 Apr 2025📄 OpenAI announcement (Introducing OpenAI o3 and o4-mini)

Pricing

Deployment and security