Flagship open-source agentic model from Z.ai (Zhipu AI), based on a Mixture-of-Experts architecture with 744B total parameters and 40B active parameters.
Context window
200K
tokens
Parameters
744B total (40B active per token)
parameters
Max output
128
tokens
Release date
7 April 2026
Access:APIDownloadDeployment:💻 Local☁ Cloud
Overview
Access & deployment
APIDownload
LocalCloud
Weights: Open weights
Key parameters
📏 Context: 200K
🧩 Parameters: 744B total (40B active per token)
✓ Tools
📥 Input: text
Platforms
Technical specification
Context window
200K
tokens
Parameters
744B total (40B active per token)
parameters
Max output tokens
128
tokens per response
License
MIT
Hardware requirements
Trained on Huawei Ascend 910B (no Nvidia). Local deployment requires an enterprise GPU cluster. Full BF16 model ~1.49 TB.
Features:✓ Tool use
Modalities
⬇ Input
text
⬆ Output
textcodestructured_data
Capabilities and applications
Native model capabilities
Coding
Generating, analysing and modifying source code.
Category: coding
Reasoning
The model's ability to reason logically and solve complex problems.
Category: reasoning
Multi-step reasoning
Carrying out multi-step chains of reasoning across long, complex tasks.
Category: reasoning
Multilingual
Understanding and generating text in many languages.
Category: language
Planning
Forming and executing action plans for complex tasks.
Category: planning
Structured output
Producing data in structured formats such as JSON.
Category: structured_generation
Function Calling
Category: planning
Long context
Maintaining coherence and focus across very long input context.
Category: language
Streaming output
Category: reasoning
Application domains
Benchmark results
7 benchmarks
SWE-bench
58.4%
📅 7 Apr 2026📄 Z.ai (self-reported)
Self-reported by Z.ai. Ranked first among all models on SWE-Bench Pro at the date of publication. Result has not been independently verified.
GPQA
86.2%
📅 7 Apr 2026📄 Z.ai (self-reported)
Result self-reported by Z.ai from the official model card on HuggingFace.
HLE (Humanity's Last Exam)
accuracy · without tools
31.0%
📅 7 Apr 2026📄 Z.ai / zai-org (self-reported, official HuggingFace model card)
Result from the benchmark table in the official model card on HuggingFace (zai-org/GLM-5.1). Self-reported by Z.ai.
HLE (Humanity's Last Exam) with Tools
accuracy · with tools
52.3%
📅 7 Apr 2026📄 Z.ai / zai-org (self-reported, official HuggingFace model card)
Result from the benchmark table in the official model card on HuggingFace (zai-org/GLM-5.1). Self-reported by Z.ai.
AIME 2026
accuracy · competition math
95.3%
📅 7 Apr 2026📄 Z.ai / zai-org (self-reported, official HuggingFace model card)
Result from the benchmark table in the official model card on HuggingFace (zai-org/GLM-5.1). Self-reported by Z.ai.
BrowseComp
accuracy · without context management
68.0%
📅 7 Apr 2026📄 Z.ai / zai-org (self-reported, official HuggingFace model card)
Result from the benchmark table in the official model card on HuggingFace (zai-org/GLM-5.1). Self-reported by Z.ai.
CyberGym
accuracy
68.7%
📅 7 Apr 2026📄 Z.ai / zai-org (self-reported, official HuggingFace model card)
Top result among models in the table. Self-reported by Z.ai.
Pricing
Technical architecture
Deployment and security
☁ Available on platforms
