Robots AtlasRobots Atlas
Logo

MiAI Environment Voice & Semantic Recognition Engine

PrototypeReal-time

MiAI Environment Voice & Semantic Recognition Engine is Xiaomi’s in-house voice and semantic stack, identified by Xiaomi on August 11, 2022 as the layer powering voice interaction on the CyberOne humanoid prototype. It consists of two components named by Xiaomi: the “MiAI environment voice engine” and the “MiAI semantic recognition engine”. According to Xiaomi’s official announcement, the system can recognise 85 environmental sound categories and 45 classifications of human emotion. Xiaomi has not released a public API or developer documentation — the engine exists publicly only as an internal AI layer of the CyberOne prototype.

Maturity and adoption
Technology readiness level and adoption scale
TRL 4
Demonstration phase
13579
First release11 August 2022
Last update1 May 2026
Organizations
Companies involved in software
Main category
Perception
Software types
Software classification by purpose

Perception Stack

A Perception Stack encompasses the software layers that process data from cameras, LiDARs, IMUs, microphones, and other sensors in order to recognise the surrounding environment, perform localisation, detect and track objects, and interpret the scene. It is typically the first processing stage in an autonomous robot's data pipeline, feeding its outputs to planning and control stacks.

Aliases:perception frameworksensor processing stack
Select a type to see the full manifest.
Categories (CMS)
Thematic groups in the content management system
Perception SoftwareVision SoftwareSensor Processing Software
Software roles
Functions performed in the robotics ecosystem
Role

Perception

No additional description for this role.

sensor perceptionscene understandingperception stack
Select a role to see details.
Target robot platforms
Robot platforms it works with