Second generation of the Mamba architecture (Selective SSM) with the SSD layer, 2–8× faster than Mamba while remaining competitive with Transformers.
Parameters
130M – 2.7B
parameters
Release date
31 May 2024
Access:DownloadDeployment:💻 Local
Overview
Access & deployment
Download
Local
Weights: Open source
Key parameters
🧩 Parameters: 130M – 2.7B
📥 Input: text
Technical specification
Parameters
130M – 2.7B
parameters
License
Apache-2.0
Modalities
⬇ Input
text
⬆ Output
text
Capabilities and applications
Native model capabilities
Language modeling
Ability to predict subsequent tokens and generate coherent natural-language text based on the preceding context.
Category: language
Long context
The model's ability to handle long context and maintain coherence over a large amount of input data.
Category: reasoning
