Base pretrained DBRX model without instruction tuning. 132B total parameters, 36B active (MoE 16 experts, top-4). Pretrained on 12T tokens, 32K context window.
Context window
32K
tokens
Parameters
132B total / 36B active
parameters
Max output
32,000
tokens
Release date
27 March 2024
Access:APIDownloadDeployment:โ Cloud๐ป Local
Overview
Applications
Access & deployment
APIDownload
CloudLocal
Weights: Open weights
Key parameters
๐ Context: 32K
๐งฉ Parameters: 132B total / 36B active
โ Fine-tuning
๐ฅ Input: text
Platforms
Technical specification
Context window
32K
tokens
Parameters
132B total / 36B active
parameters
Max output tokens
32,000
tokens per response
Knowledge cutoff
1 Dec 2023
Knowledge boundary
License
Databricks Open Model License
Hardware requirements
Training: 3,072x NVIDIA H100 connected by 3.2 Tbps InfiniBand. Inference: enterprise-class GPUs (e.g. 8x H100 or A100) with TensorRT-LLM; 8-bit quantization supported.
Features:โ Fine-tuning
Modalities
โฌ Input
text
โฌ Output
textcode
Capabilities and applications
Native model capabilities
Coding
Generating, analysing and modifying source code.
Category: coding
Reasoning
The model's ability to reason logically and solve complex problems.
Category: reasoning
Long context
Maintaining coherence and focus across very long input context.
Category: language
Multilingual
Understanding and generating text in many languages.
Category: language
Application domains
Benchmark results
1 benchmark
MMLU
accuracy ยท 5-shot
73.7%
๐ Databricks DBRX blog (2024-03-27)
Score from Table 1 in the DBRX blog (DBRX Instruct). DBRX Base scores not separately reported.
Deployment and security
โ Available on platforms
