RL
RLWRLD foundation model for dexterous manipulation, built on the Multi-Stream Action Transformer (MSAT) architecture with dedicated streams for vision, tactile, torque, and memory.
โ Activeโ Public accessโ Open weightsRobotics foundation modelVision-Language-Action modelMultimodal
Parameters
8.1B (mid-trained)
parameters
Release date
7 May 2026
Access:DownloadDeployment:๐ป Local๐ฑ On-device
Overview
Classification
Robotics foundation modelVision-Language-Action modelMultimodal
Access & deployment
Download
LocalOn-device
Weights: Open weights
Key parameters
๐งฉ Parameters: 8.1B (mid-trained)
โ Fine-tuning
๐ฅ Input: image, video, text, robot sensorsโฆ
Robotics
Dexterous manipulationBimanual manipulationRobot manipulation
Technical specification
Parameters
8.1B (mid-trained)
parameters
License
Open weights (Hugging Face โ RLWRLD)
Hardware requirements
Inference optimized for NVIDIA RTX 5090 + Intel Core Ultra 7 265K class hardware (p50 latency ~43 ms for the all-modality variant via static graph + CUDA Graph + kernel fusion).
Features:โ Fine-tuning
Modalities
โฌ Input
imagevideotextrobot_sensorsrobot_state_data
โฌ Output
robot_actionsmotion_trajectoriesmanipulator_controlrobot_commands
Capabilities and applications
Native model capabilities
Image understanding
Category: vision
Video Understanding
Category: video
Multimodal understanding
Category: multimodal
Planning
Category: planning
Reasoning
Category: reasoning
Multi-step reasoning
Category: reasoning
Robotics
Dexterous manipulationBimanual manipulationRobot manipulation
Benchmark results
10 benchmarks
LIBERO
average success rate ยท RLDX-1-PT, simulation
97.8%
๐
7 May 2026๐ RLWRLD Tech Report (arXiv:2605.03269)
RoboCasa Kitchen
average success rate ยท RLDX-1-PT vs GR00T N1.6 66.2 / ฯโ.โ
62.1
70.6%
๐
7 May 2026๐ RLWRLD Tech Report (arXiv:2605.03269)
RoboCasa GR-1 Tabletop
average success rate ยท RLDX-1-PT, humanoid suite (+10.7%p vs GR00T N1.5 48.0)
58.7%
๐
7 May 2026๐ RLWRLD Tech Report (arXiv:2605.03269)
RoboCasa 365
average success rate ยท RLDX-1-PT, long-horizon multi-stage (+5.2%p vs GR00T N1.6 26.9)
32.1%
๐
7 May 2026๐ RLWRLD Tech Report (arXiv:2605.03269)
SIMPLER Google-VM
average success rate ยท RLDX-1-PT, simulation
81.5%
๐
7 May 2026๐ RLWRLD Tech Report (arXiv:2605.03269)
LIBERO-Plus
total robustness ยท RLDX-1-PT vs GR00T N1.6 72.6 / ฯโ-FAST 64.2
86.7%
๐
7 May 2026๐ RLWRLD Tech Report (arXiv:2605.03269)
ALLEX Conveyor Pick-and-Place
success rate ยท RLDX-1-MT-ALLEX, real-world
87.5%
๐
7 May 2026๐ RLWRLD Tech Report (arXiv:2605.03269)
ALLEX Object-in-Box Selection
success rate ยท RLDX-1-MT-ALLEX, real-world
91.7%
๐
7 May 2026๐ RLWRLD Tech Report (arXiv:2605.03269)
ALLEX Pot-to-Cup Pouring
success rate ยท RLDX-1-MT-ALLEX, real-world
70.8%
๐
7 May 2026๐ RLWRLD Tech Report (arXiv:2605.03269)
DROID Shell Game (memory)
success rate ยท RLDX-1-MT-DROID, Franka Research 3 + AnySkin
91.7%
๐
7 May 2026๐ RLWRLD Tech Report (arXiv:2605.03269)
Technical architecture
Core Architecture
Training Techniques