DeepSeek-V3
Open-weight Mixture-of-Experts language model with 671B total parameters (37B activated per token), developed by DeepSeek AI and released in December 2024.
Technical specification
Modalities
Capabilities
9Reasoning★
Reasoning
Multi-step reasoning★
Reasoning
Long context★
Reasoning
Coding★
Coding
Function Calling
Planning
Structured output★
Structured gen.
Multilingual★
Language
Planning★
Planning
Streaming output
Reasoning
