Open-weight dense language model by Meta AI with 405 billion parameters, 128K token context window, and tool use support, released July 23, 2024.
Context window
128K tokenów
tokens
Parameters
405B
parameters
Release date
23 July 2024
Access:APIDownloadDeployment:☁ Cloud💻 Local
Overview
Access & deployment
APIDownload
CloudLocal
Weights: Open weights
Key parameters
📏 Context: 128K tokenów
🧩 Parameters: 405B
✓ Tools · ✓ Fine-tuning
📥 Input: text, image
Technical specification
Context window
128K tokenów
tokens
Parameters
405B
parameters
License
Llama 3.1 Community License
Hardware requirements
No official minimum requirements are specified.
Local deployment requires a datacenter-class GPU cluster (e.g., NVIDIA H100 / A100).
Features:✓ Tool use✓ Fine-tuning
Modalities
⬇ Input
textimage
⬆ Output
textcodestructured_data
Capabilities and applications
Native model capabilities
Reasoning
The model's ability to reason logically and solve complex problems.
Category: reasoning
Multi-step reasoning
Carrying out multi-step chains of reasoning across long, complex tasks.
Category: reasoning
Long context
Maintaining coherence and focus across very long input context.
Category: language
Coding
Generating, analysing and modifying source code.
Category: coding
Function Calling
Category: planning
Structured output
Producing data in structured formats such as JSON.
Category: structured_generation
Image understanding
Analysing and interpreting the content of images.
Category: vision
Chart understanding
Reading and interpreting charts, tables and diagrams.
Category: vision
OCR
Recognising text within images and documents.
Category: vision
Multilingual
Understanding and generating text in many languages.
Category: language
Planning
Forming and executing action plans for complex tasks.
Category: planning
Streaming output
Category: reasoning
Benchmark results
7 benchmarks
MMLU
88.6
📄 Meta AI – Llama 3.1 Technical Report
GSM8K
96.8
📄 Meta AI – Llama 3.1 Technical Report
HumanEval
89.0
📄 Meta AI – Llama 3.1 Technical Report
MATH
73.8
📄 Meta AI – Llama 3.1 Technical Report
GPQA
0-shot accuracy · Instruction-tuned model (Llama 3.1 405B Instruct), 0-shot
50.7%
📄 Meta Llama 3.1 model card (Hugging Face), July 2024
MMLU-Pro
5-shot accuracy · Instruction-tuned model, 5-shot
73.3%
📄 Meta Llama 3.1 model card (Hugging Face), July 2024
DROP
3-shot F1 · Pre-trained base model (Llama 3.1 405B Base)
84.8%
📄 IBM analysis referencing Meta Llama 3.1 model card, July 2024
Result applies to the base (pre-trained) model, not the instruction-tuned variant.
Sources and related pages
8 sources
ReportThe Llama 3.1 Model FamilyWebIntroducing Llama 3.1Repometa-llama modelsDocsLlama 3.1 405B – Hugging Face Model Card (base)DocsLlama 3.1 405B Instruct – Hugging Face Model CardBlogIntroducing Llama 3.1 – Meta AI BlogWebLlama 3.1 Official Page – llama.comRepoLlama 3.1 Eval Details – meta-llama/llama-models GitHub
Browse related topics
