Skip to content

China's MOYA Robot and the New Humanoid AI Race

A
Alex Chen
June 22, 2026
11 min read
Science & Tech
China's MOYA Robot and the New Humanoid AI Race - Image from the article

Quick Summary

From China's eerily lifelike MOYA robot to Boston Dynamics Atlas and Alibaba's Qwen Robot, the humanoid AI revolution is accelerating fast. Here's what matters.

In This Article

The Humanoid Robot Race Just Got Uncomfortably Real

Something shifted recently in the world of humanoid robotics — and it wasn't subtle. China's MOYA robot, built by Shanghai-based startup Droidup, arrived with warm silicone skin, mirrored facial expressions, and a price tag of $173,000. Around the same time, Boston Dynamics published data showing Atlas can now be trained on millions of simulation hours in a single day. Alibaba launched a modular AI brain designed to connect China's sprawling hardware ecosystem. And a full-size bipedal robot played an autonomous table tennis match with no human in the loop.

This isn't a single breakthrough. It's a convergence — and the pace is moving faster than most analysts were comfortable predicting even two years ago. Here's a clear-eyed look at what's actually happening, what the numbers mean, and why this moment matters beyond the viral clips.

MOYA: What China's Most Unsettling Robot Actually Reveals

Droidup's MOYA humanoid robot is the obvious place to start, because it generated the most polarised reaction — and for good reason. She stands 5'5", weighs around 70 lbs, and is built on the Walker 3 platform, the same skeleton that carried a Droidup robot to third place in the world's first humanoid half-marathon. But MOYA isn't optimised for endurance. She's optimised for the moment you sit across from her and forget, however briefly, that she isn't human.

The engineering choices that drive that effect are deliberate. Internal cameras behind each eye track facial expressions in real time and trigger mirrored responses — smiles, nods, subtle narrowing of the eyes. The silicone skin runs at 90–97°F, which maps almost exactly to the surface temperature of a human hand. Underneath that skin, padding mimics the density distribution of human fat and muscle, structured around an artificial spine rather than rigid joints. That spine isn't cosmetic. Distributing torque through a flexible central column the way a human vertebral system does is a meaningful mechanical design choice, and it signals that Droidup is iterating toward biomechanical fidelity, not just surface aesthetics.

The company's headline claim — that MOYA walks with 92% accuracy compared to natural human gait — is their own unverified figure. Watching the footage critically, the movement is noticeably less robotic than most competitors, but Reddit's r/Singularity community called it accurately: it reads closer to a careful, deliberate elderly gait than a natural one. The word "natural" in the marketing is doing considerable work.

The financial picture adds another layer of complexity. Droidup is approximately two years old, has raised $28.5 million total, and has already spent nearly half on recruitment and payroll for 32 employees. Their lead investor holds a total valuation of around $50 million — which is thin runway for a company trying to build production-ready humanoids at $173,000 per unit. The promo video editing drew pointed criticism for concealing the robot's actual manipulation capability, and the hands — described by multiple reviewers as resembling wooden sticks — remain a conspicuous weak point.

None of that means MOYA is irrelevant. Unscripted footage from live public events, where real attendees approached and touched the robot, produced reactions that polished demos rarely capture. The target verticals — elder care, hospitals, museums, train stations — are exactly where the uncanny valley matters less and the warmth of presence matters more. If Droidup can hit their first production batch of 50 units with genuine reliability, the concept has legs regardless of the hype cycle around the launch.

Boston Dynamics Atlas: When Simulation Becomes a Weapon

If MOYA represents the ambitious fringe, Atlas represents what happens when serious industrial capital meets robotics research at scale. A recent analysis from KB Securities analyst Kang Sun Jin argues that Atlas is approaching the autonomy threshold actually required for factory deployment — and the technical underpinning of that claim is worth understanding.

Boston Dynamics, operating as a Hyundai Motor Group subsidiary, can now run simulation workloads equivalent to millions of training hours in a single day. Skills developed in simulation transfer to a physical Atlas unit in approximately one hour. That compression of the development cycle — from weeks to hours — is genuinely discontinuous with where the field was in 2022. It's made possible by a strategic combination: Google DeepMind on the learning architecture side, NVIDIA supplying high-performance computing infrastructure for large-scale simulation.

Atlas's hardware design amplifies this advantage. Most humanoids are simulation nightmares — dozens of actuator types, complex asymmetric mechanical systems that are difficult to model accurately in virtual environments. Atlas runs on just two actuator types across the entire body, with symmetric arm and leg design that simplifies both the control system and the simulation model. Eliminating cables across joints enables continuous rotation and reduces the mechanical failure modes that inflate maintenance costs in industrial settings.

China's MOYA Robot and the New Humanoid AI Race

The practical output of this approach showed up in a specific test: Atlas successfully moved a refrigerator weighing over 100 lbs despite only being trained on loads of 50–70 lbs. That generalisation beyond training distribution — handling novel weight and centre-of-mass distributions through whole-body coordination, balance anticipation, and force modulation — is precisely the capability gap that has kept humanoids out of real factory floors. The athletic demonstrations Atlas is known for, the handstands and backflips, aren't marketing stunts. They're specifically used to train slip recovery, dynamic balance, and endurance responses that transfer directly to unstructured industrial environments.

KB Securities projects Atlas could capture 15% of the total global humanoid robot market by 2035, with up to 60% of the premium industrial humanoid segment. Those are bold numbers — but they're not arbitrary. They're grounded in a technology stack that currently has no close peer.

Agibot's Table Tennis Test: Why This Benchmark Actually Matters

Agibot's Yuan Jing A3 completing a fully autonomous table tennis match didn't generate the same viral attention as MOYA — but it probably should have. Table tennis is one of the most demanding real-time benchmarks available to roboticists, and the reasons why make the achievement easier to appreciate.

A table tennis ball travels at over 5 metres per second with spin, trajectory deviation, and bounce variation occurring in fractions of a second. The robot must run a continuously closed perception-planning-execution loop: visual input, trajectory prediction, whole-body motion planning, precise paddle placement, and tactical switching between offensive and defensive play — all without pause. There's no opportunity to batch-process sensor data or pause for planning. The loop has to be essentially continuous.

Agibot solved this with a combination of Spike Ping Pong, described as the world's first table tennis motion control algorithm built specifically for bipedal humanoids, and a 20kHz high-frequency event camera developed in collaboration with Professor Hang Tun. That camera delivers visual response speeds approximately 10 times faster than conventional imaging solutions, with millimetre-level prediction accuracy for paddle positioning. The result is the first documented instance of a full-size bipedal humanoid completing an autonomous match from start to finish with no remote control or scripting.

This matters beyond the sport itself. The same closed-loop visual-motor integration required to return a spinning ball at 5 m/s is architecturally related to what's needed for a humanoid to catch objects thrown unpredictably on a factory floor, or respond to a patient losing balance in a care setting. Agibot is developing a transferable capability, not demonstrating a party trick.

Alibaba's Qwen Robot: Building the Brain for China's Hardware Army

Perhaps the most strategically significant development in this cluster is one that received the least consumer attention: Alibaba's launch of Qwen Robot, their first embodied AI model family, developed by Tongyi Lab.

The problem Qwen Robot addresses is fundamental. Vision-language models can understand a command like "go to the kitchen, find the red cup, and put it on the shelf" — but understanding a task and physically executing it are separated by a massive engineering gap. Robot training data arrives in formats incompatible with internet-scale training data. Mixing those sources carelessly creates conflicts in the model's learned representations rather than improvements. And gathering high-quality robot interaction data is expensive and slow.

Alibaba's solution is architectural segmentation. Qwen-Robot-Nav handles navigation — in testing on a Unitree GO2 quadruped running NVIDIA Jetson Thor hardware with a single low-resolution camera, it navigated an unfamiliar apartment across multiple rooms following spoken commands with no preloaded maps, maintaining an inference latency of 196 milliseconds. Qwen-Robot-Manip handles physical manipulation, trained on over 38,000 hours of open-source data, and recently topped the generalist category at the RoboVerse Challenge benchmark with a task success rate of 45%. Qwen-Robot-World acts as the predictive layer, modelling how environments change and letting the robot reason about outcomes before committing to actions.

Free Weekly Newsletter

Enjoying this guide?

Get the best articles like this one delivered to your inbox every week. No spam.

China's MOYA Robot and the New Humanoid AI Race

The strategic implication is larger than any individual model score. The United States has DeepMind, NVIDIA, Figure, Skild, and Physical Intelligence working on robot intelligence. China has an enormous and rapidly expanding hardware base — Unitree, Agibot, UBTECH, Xiaomi Robotics, Xpeng, Galbot, and more. Qwen Robot is Alibaba's attempt to become the AI operating layer that unifies all of that hardware into a coherent, interconnected ecosystem. If it succeeds, it changes the competitive structure of the entire industry.

What the Convergence Actually Means

Taken individually, each of these developments is interesting. Taken together, they describe something more significant: the simultaneous maturation of multiple layers of the humanoid robotics stack — hardware biomechanics, simulation infrastructure, real-time perception, and AI reasoning — at the same moment.

MOYA demonstrates that social and emotional presence in robots is no longer purely theoretical; it's a product you can order, even if the execution is still rough around the edges. Atlas demonstrates that industrial deployment is no longer a five-year horizon problem. Agibot demonstrates that real-time closed-loop motor intelligence is achievable in a full-size bipedal platform. And Qwen Robot demonstrates that the AI infrastructure layer needed to connect hardware to genuine task reasoning is being built right now, at scale.

The question worth asking isn't whether humanoid robots will reach broad deployment. It's which architecture, which AI stack, and which regulatory environment will define the terms of that deployment. The answer is being determined in engineering labs, factory floors, and elder care facilities right now — not in ten years.

Frequently Asked Questions

What is China's MOYA robot and what makes it different from other humanoids? MOYA is a humanoid robot developed by Shanghai-based startup Droidup. What distinguishes it from most humanoids is its focus on social realism rather than industrial performance — warm silicone skin running at 90–97°F, real-time facial expression mirroring via cameras behind each eye, an artificial spine for natural torso movement, and internal padding designed to mimic human fat and muscle density. It's designed to be present in social settings like elder care and hospitality, not to lift objects in a warehouse.

How does Boston Dynamics Atlas train so quickly compared to other robots? Atlas benefits from a simulation infrastructure — built in partnership with Google DeepMind and NVIDIA — that can compress millions of equivalent training hours into a single day. Its standardised hardware design, using only two actuator types across the whole body with symmetric limb architecture, makes it far easier to simulate accurately than most humanoids. Skills developed in simulation transfer to physical hardware in roughly one hour, a cycle time that was effectively impossible two years ago.

Why is table tennis considered a serious robotics benchmark? Table tennis requires a robot to run a fully closed perception-planning-execution loop in near-continuous real time. Balls travel at over 5 metres per second with variable spin and trajectory. The robot must predict landing points at millimetre precision, plan whole-body motion, execute precise strikes, and switch between offensive and defensive strategies — all without pausing the loop. The underlying capabilities — rapid visual processing, dynamic balance, precise motor control under uncertainty — are directly transferable to demanding real-world applications.

What is Alibaba's Qwen Robot and why does it matter strategically? Qwen Robot is Alibaba's modular embodied AI model family, designed to bridge the gap between language understanding and physical robot control. It comprises three specialised models: one for navigation, one for manipulation, and one for world modelling and outcome prediction. Strategically, it matters because China has a large and growing humanoid hardware base but has lacked a unified AI intelligence layer to connect it. Qwen Robot is Alibaba's attempt to become that layer — essentially the Android of Chinese robotics infrastructure.

Is MOYA's 92% human gait accuracy claim independently verified? No. The 92% figure is Droidup's own claim and has not been independently verified by any third-party research or benchmark body. Observers watching the demo footage generally describe the movement as noticeably smoother than most humanoids but more consistent with a careful, deliberate elderly gait than a naturally fluid human stride. The figure should be treated as a marketing claim until independent testing is published.

Frequently Asked Questions

The Humanoid Robot Race Just Got Uncomfortably Real

Something shifted recently in the world of humanoid robotics — and it wasn't subtle. China's MOYA robot, built by Shanghai-based startup Droidup, arrived with warm silicone skin, mirrored facial expressions, and a price tag of $173,000. Around the same time, Boston Dynamics published data showing Atlas can now be trained on millions of simulation hours in a single day. Alibaba launched a modular AI brain designed to connect China's sprawling hardware ecosystem. And a full-size bipedal robot played an autonomous table tennis match with no human in the loop.

This isn't a single breakthrough. It's a convergence — and the pace is moving faster than most analysts were comfortable predicting even two years ago. Here's a clear-eyed look at what's actually happening, what the numbers mean, and why this moment matters beyond the viral clips.

MOYA: What China's Most Unsettling Robot Actually Reveals

Droidup's MOYA humanoid robot is the obvious place to start, because it generated the most polarised reaction — and for good reason. She stands 5'5", weighs around 70 lbs, and is built on the Walker 3 platform, the same skeleton that carried a Droidup robot to third place in the world's first humanoid half-marathon. But MOYA isn't optimised for endurance. She's optimised for the moment you sit across from her and forget, however briefly, that she isn't human.

The engineering choices that drive that effect are deliberate. Internal cameras behind each eye track facial expressions in real time and trigger mirrored responses — smiles, nods, subtle narrowing of the eyes. The silicone skin runs at 90–97°F, which maps almost exactly to the surface temperature of a human hand. Underneath that skin, padding mimics the density distribution of human fat and muscle, structured around an artificial spine rather than rigid joints. That spine isn't cosmetic. Distributing torque through a flexible central column the way a human vertebral system does is a meaningful mechanical design choice, and it signals that Droidup is iterating toward biomechanical fidelity, not just surface aesthetics.

The company's headline claim — that MOYA walks with 92% accuracy compared to natural human gait — is their own unverified figure. Watching the footage critically, the movement is noticeably less robotic than most competitors, but Reddit's r/Singularity community called it accurately: it reads closer to a careful, deliberate elderly gait than a natural one. The word "natural" in the marketing is doing considerable work.

The financial picture adds another layer of complexity. Droidup is approximately two years old, has raised $28.5 million total, and has already spent nearly half on recruitment and payroll for 32 employees. Their lead investor holds a total valuation of around $50 million — which is thin runway for a company trying to build production-ready humanoids at $173,000 per unit. The promo video editing drew pointed criticism for concealing the robot's actual manipulation capability, and the hands — described by multiple reviewers as resembling wooden sticks — remain a conspicuous weak point.

None of that means MOYA is irrelevant. Unscripted footage from live public events, where real attendees approached and touched the robot, produced reactions that polished demos rarely capture. The target verticals — elder care, hospitals, museums, train stations — are exactly where the uncanny valley matters less and the warmth of presence matters more. If Droidup can hit their first production batch of 50 units with genuine reliability, the concept has legs regardless of the hype cycle around the launch.

Boston Dynamics Atlas: When Simulation Becomes a Weapon

If MOYA represents the ambitious fringe, Atlas represents what happens when serious industrial capital meets robotics research at scale. A recent analysis from KB Securities analyst Kang Sun Jin argues that Atlas is approaching the autonomy threshold actually required for factory deployment — and the technical underpinning of that claim is worth understanding.

Boston Dynamics, operating as a Hyundai Motor Group subsidiary, can now run simulation workloads equivalent to millions of training hours in a single day. Skills developed in simulation transfer to a physical Atlas unit in approximately one hour. That compression of the development cycle — from weeks to hours — is genuinely discontinuous with where the field was in 2022. It's made possible by a strategic combination: Google DeepMind on the learning architecture side, NVIDIA supplying high-performance computing infrastructure for large-scale simulation.

Atlas's hardware design amplifies this advantage. Most humanoids are simulation nightmares — dozens of actuator types, complex asymmetric mechanical systems that are difficult to model accurately in virtual environments. Atlas runs on just two actuator types across the entire body, with symmetric arm and leg design that simplifies both the control system and the simulation model. Eliminating cables across joints enables continuous rotation and reduces the mechanical failure modes that inflate maintenance costs in industrial settings.

The practical output of this approach showed up in a specific test: Atlas successfully moved a refrigerator weighing over 100 lbs despite only being trained on loads of 50–70 lbs. That generalisation beyond training distribution — handling novel weight and centre-of-mass distributions through whole-body coordination, balance anticipation, and force modulation — is precisely the capability gap that has kept humanoids out of real factory floors. The athletic demonstrations Atlas is known for, the handstands and backflips, aren't marketing stunts. They're specifically used to train slip recovery, dynamic balance, and endurance responses that transfer directly to unstructured industrial environments.

KB Securities projects Atlas could capture 15% of the total global humanoid robot market by 2035, with up to 60% of the premium industrial humanoid segment. Those are bold numbers — but they're not arbitrary. They're grounded in a technology stack that currently has no close peer.

Agibot's Table Tennis Test: Why This Benchmark Actually Matters

Agibot's Yuan Jing A3 completing a fully autonomous table tennis match didn't generate the same viral attention as MOYA — but it probably should have. Table tennis is one of the most demanding real-time benchmarks available to roboticists, and the reasons why make the achievement easier to appreciate.

A table tennis ball travels at over 5 metres per second with spin, trajectory deviation, and bounce variation occurring in fractions of a second. The robot must run a continuously closed perception-planning-execution loop: visual input, trajectory prediction, whole-body motion planning, precise paddle placement, and tactical switching between offensive and defensive play — all without pause. There's no opportunity to batch-process sensor data or pause for planning. The loop has to be essentially continuous.

Agibot solved this with a combination of Spike Ping Pong, described as the world's first table tennis motion control algorithm built specifically for bipedal humanoids, and a 20kHz high-frequency event camera developed in collaboration with Professor Hang Tun. That camera delivers visual response speeds approximately 10 times faster than conventional imaging solutions, with millimetre-level prediction accuracy for paddle positioning. The result is the first documented instance of a full-size bipedal humanoid completing an autonomous match from start to finish with no remote control or scripting.

This matters beyond the sport itself. The same closed-loop visual-motor integration required to return a spinning ball at 5 m/s is architecturally related to what's needed for a humanoid to catch objects thrown unpredictably on a factory floor, or respond to a patient losing balance in a care setting. Agibot is developing a transferable capability, not demonstrating a party trick.

Alibaba's Qwen Robot: Building the Brain for China's Hardware Army

Perhaps the most strategically significant development in this cluster is one that received the least consumer attention: Alibaba's launch of Qwen Robot, their first embodied AI model family, developed by Tongyi Lab.

The problem Qwen Robot addresses is fundamental. Vision-language models can understand a command like "go to the kitchen, find the red cup, and put it on the shelf" — but understanding a task and physically executing it are separated by a massive engineering gap. Robot training data arrives in formats incompatible with internet-scale training data. Mixing those sources carelessly creates conflicts in the model's learned representations rather than improvements. And gathering high-quality robot interaction data is expensive and slow.

Alibaba's solution is architectural segmentation. Qwen-Robot-Nav handles navigation — in testing on a Unitree GO2 quadruped running NVIDIA Jetson Thor hardware with a single low-resolution camera, it navigated an unfamiliar apartment across multiple rooms following spoken commands with no preloaded maps, maintaining an inference latency of 196 milliseconds. Qwen-Robot-Manip handles physical manipulation, trained on over 38,000 hours of open-source data, and recently topped the generalist category at the RoboVerse Challenge benchmark with a task success rate of 45%. Qwen-Robot-World acts as the predictive layer, modelling how environments change and letting the robot reason about outcomes before committing to actions.

The strategic implication is larger than any individual model score. The United States has DeepMind, NVIDIA, Figure, Skild, and Physical Intelligence working on robot intelligence. China has an enormous and rapidly expanding hardware base — Unitree, Agibot, UBTECH, Xiaomi Robotics, Xpeng, Galbot, and more. Qwen Robot is Alibaba's attempt to become the AI operating layer that unifies all of that hardware into a coherent, interconnected ecosystem. If it succeeds, it changes the competitive structure of the entire industry.

What the Convergence Actually Means

Taken individually, each of these developments is interesting. Taken together, they describe something more significant: the simultaneous maturation of multiple layers of the humanoid robotics stack — hardware biomechanics, simulation infrastructure, real-time perception, and AI reasoning — at the same moment.

MOYA demonstrates that social and emotional presence in robots is no longer purely theoretical; it's a product you can order, even if the execution is still rough around the edges. Atlas demonstrates that industrial deployment is no longer a five-year horizon problem. Agibot demonstrates that real-time closed-loop motor intelligence is achievable in a full-size bipedal platform. And Qwen Robot demonstrates that the AI infrastructure layer needed to connect hardware to genuine task reasoning is being built right now, at scale.

The question worth asking isn't whether humanoid robots will reach broad deployment. It's which architecture, which AI stack, and which regulatory environment will define the terms of that deployment. The answer is being determined in engineering labs, factory floors, and elder care facilities right now — not in ten years.

Frequently Asked Questions

What is China's MOYA robot and what makes it different from other humanoids? MOYA is a humanoid robot developed by Shanghai-based startup Droidup. What distinguishes it from most humanoids is its focus on social realism rather than industrial performance — warm silicone skin running at 90–97°F, real-time facial expression mirroring via cameras behind each eye, an artificial spine for natural torso movement, and internal padding designed to mimic human fat and muscle density. It's designed to be present in social settings like elder care and hospitality, not to lift objects in a warehouse.

How does Boston Dynamics Atlas train so quickly compared to other robots? Atlas benefits from a simulation infrastructure — built in partnership with Google DeepMind and NVIDIA — that can compress millions of equivalent training hours into a single day. Its standardised hardware design, using only two actuator types across the whole body with symmetric limb architecture, makes it far easier to simulate accurately than most humanoids. Skills developed in simulation transfer to physical hardware in roughly one hour, a cycle time that was effectively impossible two years ago.

Why is table tennis considered a serious robotics benchmark? Table tennis requires a robot to run a fully closed perception-planning-execution loop in near-continuous real time. Balls travel at over 5 metres per second with variable spin and trajectory. The robot must predict landing points at millimetre precision, plan whole-body motion, execute precise strikes, and switch between offensive and defensive strategies — all without pausing the loop. The underlying capabilities — rapid visual processing, dynamic balance, precise motor control under uncertainty — are directly transferable to demanding real-world applications.

What is Alibaba's Qwen Robot and why does it matter strategically? Qwen Robot is Alibaba's modular embodied AI model family, designed to bridge the gap between language understanding and physical robot control. It comprises three specialised models: one for navigation, one for manipulation, and one for world modelling and outcome prediction. Strategically, it matters because China has a large and growing humanoid hardware base but has lacked a unified AI intelligence layer to connect it. Qwen Robot is Alibaba's attempt to become that layer — essentially the Android of Chinese robotics infrastructure.

Is MOYA's 92% human gait accuracy claim independently verified? No. The 92% figure is Droidup's own claim and has not been independently verified by any third-party research or benchmark body. Observers watching the demo footage generally describe the movement as noticeably smoother than most humanoids but more consistent with a careful, deliberate elderly gait than a naturally fluid human stride. The figure should be treated as a marketing claim until independent testing is published.

Z

About Zeebrain Editorial

Our editorial team is dedicated to providing clear, well-researched, and high-utility content for the modern digital landscape. We focus on accuracy, practicality, and insights that matter.

More from Science & Tech

Related Guides

Keep exploring this topic

Explore More Categories

Keep browsing by topic and build depth around the subjects you care about most.