Evolutionary Model Merge

Method using evolutionary algorithms to automatically discover effective weight and layer combinations of open-source foundation models.

🔬 Research🔬 Research only⚖ Open weightsSpecialized AIMultimodalLLM

Parameters

7B / 10B

parameters

Release date

21 March 2024

🏢Sakana AIProducer

Access:DownloadHostedDeployment:💻 Local☁ Cloud

Overview

Evolutionary Model Merge is a method introduced by Sakana AI on March 21, 2024 that automates foundation model development by combining existing open-source models using evolutionary algorithms (CMA-ES). The method operates in two complementary spaces: Parameter Space (PS) — evolutionary mixing of weights from multiple models — and Data Flow Space (DFS) — evolutionary selection and ordering of layers from different source models. Both approaches can be combined to discover non-trivial, counter-intuitive combinations that human experts would unlikely find manually.

The first release produced three Japanese foundation models: EvoLLM-JP (a Japanese math LLM in 7B and 10B variants, merged from Shisa Gamma 7B v1, WizardMath-7B-V1.1 and Abel-7B-002, plus an EvoLLM-JP-A variant using Arithmo2-Mistral-7B), EvoVLM-JP (a 7B Japanese vision-language model merged from LLaVA-1.6-Mistral-7B and Shisa Gamma 7B v1), and EvoSDXL-JP (a Japanese-aware 4-step SDXL diffusion model).

EvoLLM-JP scored 52.0% (7B) and 55.6% (10B) on MGSM-JA, surpassing all Japanese LLMs below 70B parameters and previous 70B SOTA models. EvoVLM-JP achieved 19.70 ROUGE-L on JA-VG-VQA-500 and 51.25 on JA-VLM-Bench-In-the-Wild, beating the base LLaVA-1.6-Mistral-7B and prior Japanese VLMs. The work was published in Nature Machine Intelligence on January 27, 2025.

Classification

Specialized AIMultimodalLLM

Access & deployment

DownloadHosted

LocalCloud

Weights: Open weights

Key parameters

🧩 Parameters: 7B / 10B

📥 Input: text, image

Technical specification

Parameters

7B / 10B

parameters

License

Apache 2.0 (code, EvoLLM-JP-A-v1-7B, EvoVLM-JP-v1-7B); Microsoft Research License (EvoLLM-JP-v1-7B/10B — research-only)

Modalities

⬇ Input

textimage

⬆ Output

text

Benchmark results

6 benchmarks

MGSM-JA

Accuracy · EvoLLM-JP-v1-7B, Japanese math word problems

52.0%

📄 Sakana AI / arXiv:2403.13187

MGSM-JA

Accuracy · EvoLLM-JP-v1-10B

55.6%

📄 Sakana AI / arXiv:2403.13187

MGSM-JA

Accuracy · EvoLLM-JP-A-v1-7B (Apache 2.0 variant)

52.4%

📄 Sakana AI / GitHub README

Japanese lm-evaluation-harness (avg of 9 tasks)

Average score · EvoLLM-JP-v1-7B; exceeds prior 70B Japanese SOTA

70.5

📄 Sakana AI / arXiv:2403.13187

JA-VG-VQA-500

ROUGE-L · EvoVLM-JP-v1-7B Japanese visual question answering

19.70

📄 Sakana AI / arXiv:2403.13187

JA-VLM-Bench-In-the-Wild

ROUGE-L · EvoVLM-JP-v1-7B; beats LLaVA-1.6-Mistral-7B (41.10) and Japanese Stable VLM (40.50)

51.25

📄 Sakana AI / arXiv:2403.13187

Sources and related pages

8 sources

WebEvolutionary Model Merge - Sakana AIsakana.ai RepoEvoLLM GitHubgithub.com PaperEvolutionary Optimization of Model Merging Recipes (arXiv:2403.13187)arxiv.org PaperEvolutionary optimization of model merging recipes — Nature Machine Intelligence (2025)nature.com RepoSakanaAI/EvoLLM-JP-v1-7B (Hugging Face)huggingface.co RepoSakanaAI/EvoLLM-JP-v1-10B (Hugging Face)huggingface.co RepoSakanaAI/EvoLLM-JP-A-v1-7B (Hugging Face)huggingface.co RepoSakanaAI/EvoVLM-JP-v1-7B (Hugging Face)huggingface.co

Browse related topics

All specialized model models All multimodal model models