Streaming speech-to-speech translation model served via OpenAI's dedicated Realtime translation endpoint for live multilingual audio.
Context window
16K tokens
tokens
Max output
2,000
tokens
Access:APIDeployment:☁ Cloud
Overview
Access & deployment
API
Cloud
Weights: Closed
Key parameters
📏 Context: 16K tokens
📥 Input: audio
Technical specification
Context window
16K tokens
tokens
Max output tokens
2,000
tokens per response
Knowledge cutoff
30 Sept 2024
Knowledge boundary
Modalities
⬇ Input
audio
⬆ Output
audiotext
Capabilities and applications
Native model capabilities
Live Translation
Real-time speech translation between multiple languages without interrupting the audio stream.
Category: speech
Streaming Speech-to-Text
Real-time conversion of speech to text with immediate output as the speaker is talking.
Category: speech
Technical architecture
Core Architecture
