
Unitree G1
Bipedal humanoid robot by Unitree Robotics, designed as a compact research, development, and developer platform.
- Research
- Home Assistance

Perception · Perception & Vision Software
OpenCV 4.13.0·
OpenCV (Open Source Computer Vision Library) is the flagship library of computer vision algorithms, used in nearly every robotic project involving image processing — from a cheap educational mobile robot to autonomous driving systems. Initiated in 1999 by Gary Bradski at Intel Research (Santa Clara), with the goal of providing open-source vision infrastructure. First public release in 2000, version 1.0 in 2006. Since 2012 stewarded by the non-profit OpenCV Foundation; today actively supported by OpenCV.ai. Apache 2.0 license (since 4.5.0 in 2020; previously BSD-3).
OpenCV offers > 4000 optimized functions in 12+ modules: core (matrices, data types), imgproc (filters, geometric transforms), features2d (SIFT, SURF, ORB, AKAZE), calib3d (camera calibration, stereo, fundamental matrix), video (Optical Flow Lucas-Kanade, Farnebäck, DIS), objdetect (Haar cascades, HOG, DNN-based YOLO), dnn (deep neural network inference for Caffe, TensorFlow, ONNX), gapi (Graph API for pipelines), and the CUDA module for GPU acceleration.
Bindings: C++ (native), Python (cv2 — most popular), Java (Android), JavaScript (opencv.js). Cross-platform support: Linux, Windows, macOS, Android, iOS, embedded (Jetson, Raspberry Pi). OpenCV is the foundation of every ROS perception node — built into cv_bridge, image_pipeline, vision_msgs.
A Perception Stack encompasses the software layers that process data from cameras, LiDARs, IMUs, microphones, and other sensors in order to recognise the surrounding environment, perform localisation, detect and track objects, and interpret the scene. It is typically the first processing stage in an autonomous robot's data pipeline, feeding its outputs to planning and control stacks.
An API Library is a software package that exposes programmatic interfaces for communicating with a device, service, or system. In robotics it typically forms a lightweight integration layer built on top of the manufacturer's official API or an open-source project, abstracting low-level protocol details and providing language-native bindings (Python, C++, Java, etc.).
A family of open-source robotic perception libraries: Visual SLAM (ORB-SLAM, RTAB-Map), computer vision (OpenCV), point cloud processing (PCL, Open3D), detection and segmentation (YOLO, Detectron2).
Every ROS perception node, Tesla Autopilot (earlier), Boston Dynamics Spot (cv_bridge), Roomba (iRobot), Skydio drones (in the past), AWS DeepLens, Stripe/PayPal OCR. The standard in 95% of university vision courses. > 50 million pip downloads per year.
github.com/opencv/opencv ~82k★, ~55k forks, > 50k contributors. forum.opencv.org ~30k registered users. > 30 million downloads per year. Books: Learning OpenCV (Bradski/Kaehler) — over 100k copies sold.

Bipedal humanoid robot by Unitree Robotics, designed as a compact research, development, and developer platform.

Unitree H1 is a full-size general-purpose humanoid robot (~180 cm, ~47 kg). Bipedal, 5 DOF per leg + 4 DOF per arm, 3.3 m/s walking speed, 360° perception via 3D LiDAR + depth camera, Unitree M107 PMSM joint motors with ~360 N·m peak knee torque. Standard compute: Intel Core i5/i7; optional NVIDIA Jetson Orin NX.

Boston Dynamics bipedal humanoid robot. The fully electric generation unveiled in 2024 succeeds the hydraulic Atlas that was retired after more than a decade of research.

Figure 03 is the third-generation humanoid robot from Figure AI, designed for Helix, home environments, and scalable mass production.
The lightest perception library — runs even on ESP32-CAM (microcontroller).
License family: Permissive
Current stable release — Python 3.13 support, better fp16.
Full ONNX 1.16 support, DNN improvements for YOLO v8.
License change to Apache 2.0. Added RISC-V support.
C++11 requirement, GAPI (Graph API), DNN module.
Modular architecture, opencv_contrib, T-API (transparent OpenCL).
Introduction of C++ API — a fundamental paradigm shift.
First stable C-API release.