Mid-size DBRX family member: 23.5B total parameters, 6.6B active. Used to study MoE training efficiency. Achieves 45.5% on the Databricks Gauntlet with 1.7x fewer FLOPs than LLaMA2-13B (13B active parameters).
Context window
32K
tokens
Parameters
23.5B total / 6.6B active
parameters
Release date
27 March 2024
Access:APIDeployment:โ Cloud
Overview
Applications
Access & deployment
API
Cloud
Weights: Closed
Key parameters
๐ Context: 32K
๐งฉ Parameters: 23.5B total / 6.6B active
๐ฅ Input: text
Technical specification
Context window
32K
tokens
Parameters
23.5B total / 6.6B active
parameters
License
Databricks internal / research
Hardware requirements
Internal Databricks research model; no public checkpoint available.
Modalities
โฌ Input
text
โฌ Output
textcode
Capabilities and applications
Application domains
Benchmark results
1 benchmark
Databricks Model Gauntlet v0.3
avg score ยท composite avg of 30+ tasks
45.5%
๐ Databricks DBRX blog (2024-03-27)
Technical architecture
Core Architecture
Model Form
Sources and related pages
1 source
Browse related topics
