Robots Atlas>ROBOTS ATLAS
KAI World Model

Perception · Runtime & Infrastructure

KAI World Model

0.5.0·Kinetix AI

Beta
CATEGORYPerception · Runtime & Infrastructure
READINESSTRL 5
ADOPTION SCALEResearch / Prototype
LICENSESLicenseRef-Proprietary
FIRST RELEASE2025

KAI World Model is a foundation world model developed by Kinetix AI, serving as an internal simulator/predictor for control policies on the KAI humanoid. The model learns the conditional distribution P(image_t+1, state_t+1 | image_t, state_t, action_t) from millions of hours of robot footage and synthetic data from simulators (Isaac Sim, MuJoCo). Once trained, it acts as a neural physics engine in which RL policies can be trained without using the real robot.

The architecture is based on a diffusion transformer (DiT) with conditioning state and cross-attention to language embeddings. The prediction resolution is 256×256 over 8 future timesteps. The model also generates an implicit contact-dynamics representation, crucial for object manipulation and locomotion on uneven terrain.

KAI World Model is a component of the proprietary Kinetix AI stack and is not publicly available. Internally it is used both for offline RL (on collected data) and for online imagination-based planning (Dreamer-style model-based RL). A research API release is announced for 2026.

Type & Roles
Software types
Runtime

A Runtime is the environment or execution layer used to run code, load libraries, manage dependencies, and operate applications or services — either in real time or during normal system operation. In robotics this includes real-time operating system (RTOS) runtimes, ROS 2 executor runtimes, containerised execution environments (Docker, podman), and embedded C++ runtimes on microcontrollers.

Perception Stack

A Perception Stack encompasses the software layers that process data from cameras, LiDARs, IMUs, microphones, and other sensors in order to recognise the surrounding environment, perform localisation, detect and track objects, and interpret the scene. It is typically the first processing stage in an autonomous robot's data pipeline, feeding its outputs to planning and control stacks.

Select an item to see its description.
Main category
Runtime & InfrastructurePerception & Vision Software
Roles in robotics ecosystem
PerceptionMotion PlanningDeveloper Enablement
Software family
Family
Kinetix AI Stack
Maturity & Adoption
5 / 9
Demonstration phase
ResearchPrototypeProduction
Adoption scaleResearch / Prototype
Maintenance statusInternal / Proprietary – Not Public
First release2025
Last update28 April 2026
Deployments

KAI humanoid (Kinetix AI) — internal use for training locomotion and manipulation policies. Publications at ICRA 2025 and CoRL 2025.

Community

No open-source community. Three scientific publications in 2025 and a few conference demos.

ROS supportCompatibility with ROS / ROS 2 ecosystem
Brak wsparcia ROSBrak jakiejkolwiek integracji z ekosystemem ROS
System capabilities
Open source
Source code is publicly available under an open-source license — enables security audits, custom modifications, and integration without licensing barriers.
×
Real-time capable
Designed with timing-determinism guarantees — meets the requirements of control loops, safety systems, and tasks demanding low, predictable latency.
×
⟨/⟩
API available
The software exposes a programmable interface (REST, gRPC, SDK, or language bindings) that enables automation and integration with other systems.
×
📦
Pre-built / binary
Distributed as ready-to-use binary packages, container images, or installers — no need to build from source.
×
Programming languages
PythonCUDA
Operating systems
Ubuntu 22.04JetPack Linux
Minimum hardware requirements
Minimum hardware requirements
CPUAMD EPYC / Intel Xeon (training), Jetson Thor (inference)
RAM (GB)64
GPUTraining: 8× NVIDIA H100. Inference: Jetson Thor / RTX 4090.
Disk (GB)500

Diffusion transformer ~3B parameters. One rollout generation ~120 ms on Thor.

Packaging & distribution
Package managers
Docker / Docker HubPrebuilt Binary (direct download)
CPU architectures
x86_64 (AMD64)NVIDIA Jetson – AArch64 (JetPack)ARM64 / AArch64
Installation difficulty
LevelExpert only
Protocols and interfaces
Communication protocols
gRPCREST API (HTTP/HTTPS)Shared Memory (POSIX / mmap)
Hardware interfaces
PCIe 4.0PCIe 5.0
Latency classes
Soft Real-Time (100–500 ms)Batch / Offline (> 1 min)
Deployment types
On RobotCloudHybrid
Supported simulators
NVIDIA Isaac Sim
NVIDIA Isaac Lab
MuJoCo
Official Docker images
nvcr.io/nvidia/isaac-sim
Licenses
LicenseRef-ProprietaryProprietary – All Rights Reserved

License family: Proprietary – Commercial

Version history
0.5.0Sept 2025

Scaled to 3B parameters, added language conditioning.

0.1.0Mar 2025

First experimental version presented at ICRA 2025.