Robots Atlas>ROBOTS ATLAS
RL

RLDX-1

1
RLWRLD foundation model for dexterous manipulation, built on the Multi-Stream Action Transformer (MSAT) architecture with dedicated streams for vision, tactile, torque, and memory.
โœ“ Activeโœ“ Public accessโš– Open weightsRobotics foundation modelVision-Language-Action modelMultimodal
Parameters
8.1B (mid-trained)
parameters
Release date
7 May 2026
Access:DownloadDeployment:๐Ÿ’ป Local๐Ÿ“ฑ On-device

Overview

RLDX-1 is a dexterity-first foundation model for robotic hands developed by RLWRLD and introduced in May 2026. The model uses the proprietary Multi-Stream Action Transformer (MSAT) architecture, in which each modality (vision, language, proprioception, memory, tactile, torque) is processed in its own dedicated stream and joint self-attention fuses them before action decoding.

RLDX-1 uses a fine-tuned Qwen3-VL 8B as its vision-language backbone (RLDX-1-VLM). The model integrates a Motion Module (multi-frame compression through cognition video tokens), a Physics Module (tactile and torque stream with future-contact-state prediction) and a Cognition Interface with 64 cognition tokens that doubles as the substrate for the long-horizon Memory Module.

RLDX-1 ships in three checkpoints: RLDX-1-PT (pre-trained, embodiment-agnostic) and two 8.1B mid-trained variants โ€” RLDX-1-MT-ALLEX (ALLEX humanoid) and RLDX-1-MT-DROID (Franka Research 3 with AnySkin tactile). The training pipeline includes pre-training, mid-training for the target embodiment, and post-training with DAgger plus Progress-Aware RL driven by a dedicated VLM-critic. Weights and code are released on Hugging Face and GitHub.

In simulation, RLDX-1 reaches 97.8 on LIBERO, 70.6 on RoboCasa Kitchen, 58.7 on RoboCasa GR-1 Tabletop and 32.1 on RoboCasa 365, outperforming ฯ€โ‚€.โ‚…, ฯ€โ‚€-FAST and GR00T N1.5/N1.6. On the real ALLEX benchmark (Conveyor Pick-and-Place, Object-in-Box Selection, Pot-to-Cup Pouring) RLDX-1-MT-ALLEX scores 87.5%, 91.7% and 70.8% respectively, versus below 30% for the baselines.

Classification
Robotics foundation modelVision-Language-Action modelMultimodal
Access & deployment
Download
LocalOn-device
Weights: Open weights
Key parameters
๐Ÿงฉ Parameters: 8.1B (mid-trained)
โœ“ Fine-tuning
๐Ÿ“ฅ Input: image, video, text, robot sensorsโ€ฆ
Robotics
Dexterous manipulationBimanual manipulationRobot manipulation

Technical specification

Parameters
8.1B (mid-trained)
parameters
License
Open weights (Hugging Face โ€” RLWRLD)
Hardware requirements
Inference optimized for NVIDIA RTX 5090 + Intel Core Ultra 7 265K class hardware (p50 latency ~43 ms for the all-modality variant via static graph + CUDA Graph + kernel fusion).
Features:โœ“ Fine-tuning
Modalities
โฌ‡ Input
imagevideotextrobot_sensorsrobot_state_data
โฌ† Output
robot_actionsmotion_trajectoriesmanipulator_controlrobot_commands

Capabilities and applications

Native model capabilities
Image understanding
Category: vision
Video Understanding
Category: video
Multimodal understanding
Category: multimodal
Planning
Category: planning
Reasoning
Category: reasoning
Multi-step reasoning
Category: reasoning
Robotics
Dexterous manipulationBimanual manipulationRobot manipulation

Benchmark results

10 benchmarks
LIBERO
average success rate ยท RLDX-1-PT, simulation
97.8%
๐Ÿ“… 7 May 2026๐Ÿ“„ RLWRLD Tech Report (arXiv:2605.03269)
RoboCasa Kitchen
average success rate ยท RLDX-1-PT vs GR00T N1.6 66.2 / ฯ€โ‚€.โ‚… 62.1
70.6%
๐Ÿ“… 7 May 2026๐Ÿ“„ RLWRLD Tech Report (arXiv:2605.03269)
RoboCasa GR-1 Tabletop
average success rate ยท RLDX-1-PT, humanoid suite (+10.7%p vs GR00T N1.5 48.0)
58.7%
๐Ÿ“… 7 May 2026๐Ÿ“„ RLWRLD Tech Report (arXiv:2605.03269)
RoboCasa 365
average success rate ยท RLDX-1-PT, long-horizon multi-stage (+5.2%p vs GR00T N1.6 26.9)
32.1%
๐Ÿ“… 7 May 2026๐Ÿ“„ RLWRLD Tech Report (arXiv:2605.03269)
SIMPLER Google-VM
average success rate ยท RLDX-1-PT, simulation
81.5%
๐Ÿ“… 7 May 2026๐Ÿ“„ RLWRLD Tech Report (arXiv:2605.03269)
LIBERO-Plus
total robustness ยท RLDX-1-PT vs GR00T N1.6 72.6 / ฯ€โ‚€-FAST 64.2
86.7%
๐Ÿ“… 7 May 2026๐Ÿ“„ RLWRLD Tech Report (arXiv:2605.03269)
ALLEX Conveyor Pick-and-Place
success rate ยท RLDX-1-MT-ALLEX, real-world
87.5%
๐Ÿ“… 7 May 2026๐Ÿ“„ RLWRLD Tech Report (arXiv:2605.03269)
ALLEX Object-in-Box Selection
success rate ยท RLDX-1-MT-ALLEX, real-world
91.7%
๐Ÿ“… 7 May 2026๐Ÿ“„ RLWRLD Tech Report (arXiv:2605.03269)
ALLEX Pot-to-Cup Pouring
success rate ยท RLDX-1-MT-ALLEX, real-world
70.8%
๐Ÿ“… 7 May 2026๐Ÿ“„ RLWRLD Tech Report (arXiv:2605.03269)
DROID Shell Game (memory)
success rate ยท RLDX-1-MT-DROID, Franka Research 3 + AnySkin
91.7%
๐Ÿ“… 7 May 2026๐Ÿ“„ RLWRLD Tech Report (arXiv:2605.03269)