Robots Atlas>ROBOTS ATLAS
Evolutionary Model Merge

Evolutionary Model Merge

v1
Method using evolutionary algorithms to automatically discover effective weight and layer combinations of open-source foundation models.
๐Ÿ”ฌ Research๐Ÿ”ฌ Research onlyโš– Open weightsSpecialized AIMultimodalLLM
Parameters
7B / 10B
parameters
Release date
21 March 2024
Access:DownloadHostedDeployment:๐Ÿ’ป Localโ˜ Cloud

Overview

Evolutionary Model Merge is a method introduced by Sakana AI on March 21, 2024 that automates foundation model development by combining existing open-source models using evolutionary algorithms (CMA-ES). The method operates in two complementary spaces: Parameter Space (PS) โ€” evolutionary mixing of weights from multiple models โ€” and Data Flow Space (DFS) โ€” evolutionary selection and ordering of layers from different source models. Both approaches can be combined to discover non-trivial, counter-intuitive combinations that human experts would unlikely find manually.

The first release produced three Japanese foundation models: EvoLLM-JP (a Japanese math LLM in 7B and 10B variants, merged from Shisa Gamma 7B v1, WizardMath-7B-V1.1 and Abel-7B-002, plus an EvoLLM-JP-A variant using Arithmo2-Mistral-7B), EvoVLM-JP (a 7B Japanese vision-language model merged from LLaVA-1.6-Mistral-7B and Shisa Gamma 7B v1), and EvoSDXL-JP (a Japanese-aware 4-step SDXL diffusion model).

EvoLLM-JP scored 52.0% (7B) and 55.6% (10B) on MGSM-JA, surpassing all Japanese LLMs below 70B parameters and previous 70B SOTA models. EvoVLM-JP achieved 19.70 ROUGE-L on JA-VG-VQA-500 and 51.25 on JA-VLM-Bench-In-the-Wild, beating the base LLaVA-1.6-Mistral-7B and prior Japanese VLMs. The work was published in Nature Machine Intelligence on January 27, 2025.

Classification
Specialized AIMultimodalLLM
Access & deployment
DownloadHosted
LocalCloud
Weights: Open weights
Key parameters
๐Ÿงฉ Parameters: 7B / 10B
๐Ÿ“ฅ Input: text, image

Technical specification

Parameters
7B / 10B
parameters
License
Apache 2.0 (code, EvoLLM-JP-A-v1-7B, EvoVLM-JP-v1-7B); Microsoft Research License (EvoLLM-JP-v1-7B/10B โ€” research-only)
Modalities
โฌ‡ Input
textimage
โฌ† Output
text

Benchmark results

6 benchmarks
MGSM-JA
Accuracy ยท EvoLLM-JP-v1-7B, Japanese math word problems
52.0%
๐Ÿ“„ Sakana AI / arXiv:2403.13187
MGSM-JA
Accuracy ยท EvoLLM-JP-v1-10B
55.6%
๐Ÿ“„ Sakana AI / arXiv:2403.13187
MGSM-JA
Accuracy ยท EvoLLM-JP-A-v1-7B (Apache 2.0 variant)
52.4%
๐Ÿ“„ Sakana AI / GitHub README
Japanese lm-evaluation-harness (avg of 9 tasks)
Average score ยท EvoLLM-JP-v1-7B; exceeds prior 70B Japanese SOTA
70.5
๐Ÿ“„ Sakana AI / arXiv:2403.13187
JA-VG-VQA-500
ROUGE-L ยท EvoVLM-JP-v1-7B Japanese visual question answering
19.70
๐Ÿ“„ Sakana AI / arXiv:2403.13187
JA-VLM-Bench-In-the-Wild
ROUGE-L ยท EvoVLM-JP-v1-7B; beats LLaVA-1.6-Mistral-7B (41.10) and Japanese Stable VLM (40.50)
51.25
๐Ÿ“„ Sakana AI / arXiv:2403.13187