OpenAI flagship video-audio model: improved physics and controllability over Sora 1, synchronised dialogue and sound-effect generation, and a "characters" feature.
Release date
30 September 2025
Access:HostedAPIDeployment:โ Cloud
Overview
Access & deployment
HostedAPI
Cloud
Weights: Closed
Key parameters
๐ฅ Input: text, image, video
Technical specification
Modalities
โฌ Input
textimagevideo
โฌ Output
videoaudio
Capabilities and applications
Native model capabilities
Video generation
The model's ability to generate video clips from a text prompt, image or another video, with control over length, resolution and visual characteristics.
Category: video
Image-to-video
The model's ability to animate a static input image โ extending it in time into a consistent video clip according to a description of motion or action.
Category: video
Video understanding
The model's ability to analyse and interpret video content โ recognising actions, motion, events and relationships between objects over time.
Category: video
Technical architecture
Core Architecture
Model Form
