
AgiBot's generalist embodied foundation model (launched March 10, 2025) — ViLLA (Vision-Language-Latent-Action) architecture combining a VLM, Latent Planner and Action Expert in a single policy driving heterogeneous robot platforms.
✓ Active🏢 Enterprise★ FeaturedRobotics foundation modelVision-Language-Action model
Release date
10 March 2025
Deployment:📱 On-device☁ Cloud
Overview
Classification
Robotics foundation modelVision-Language-Action model
Applications
Access & deployment
On-deviceCloud
Weights: Closed
Key parameters
✓ Fine-tuning
📥 Input: image, video, text, robot sensors…
Robotics
Robot manipulationBimanual manipulationDexterous manipulationRobot controlScene understandingEmbodied task planning
Technical specification
License
Proprietary (closed)
Hardware requirements
Deployed locally on NVIDIA Jetson Thor T5000 (2,070 TFLOPS FP4, control latency <10 ms) in the AGIBOT G2 humanoid. Training requires data-center class GPU clusters.
Features:✓ Fine-tuning
Modalities
⬇ Input
imagevideotextrobot_sensorsrobot_state_data
⬆ Output
robot_actionsrobot_commandsmotion_trajectoriesmanipulator_control
Capabilities and applications
Native model capabilities
Cross-embodiment transfer
The ability of a single model to control robots with different morphologies (humanoids, dual-arm rigs, mobile platforms) without training a separate model per platform. Intelligence is decoupled from embodiment, so the same policy runs on hardware with different kinematics and dynamics.
Category: robotics
Vision-language-action grounding
The ability of a VLA model to ground visual perception and a language instruction into a concrete physical robot action. The model understands the scene and intent, then generates an executable action sequence, closing the loop from observation to motion.
Category: robotics
Planning
Forming and executing action plans for complex tasks.
Category: planning
Reasoning
The model's ability to reason logically and solve complex problems.
Category: reasoning
Multimodal understanding
Category: multimodal
Robotics
Robot manipulationBimanual manipulationDexterous manipulationRobot controlScene understandingEmbodied task planning
Application domains
Technical architecture
Core Architecture
Training Techniques
Deployment and security
🤖 Related robots
Sources and related pages
5 sources
PaperAgiBot GO-1 White Paper (PDF)BlogNewsfile / 41Caijing — AgiBot Innovates Robotics with the Launch of Genie Operator-1 (GO-1) (10.03.2025)WebAgiBot World Colosseo — OpenDriveLab (dataset 1M+ real robot demonstrations, GO-1 reference)LinkAgiBot GO-1 Official Launch Video (YouTube)BlogPR Newswire — Agibot Unveils Next-Gen Industrial Embodied Robot G2 (powered by GO-1, 16.10.2025)