Robotics’ ‘ChatGPT moment’ could come within 2 years, founder of China’s Unitree says



The “ChatGPT moment” for the robotics industry could arrive in as little as two years if powerful artificial intelligence technology develops to propel robots’ movements, according to the founder of China’s industry leader Unitree Robotics.

Wang Xingxing defined this moment as the first time a robot could perform a task, such as cleaning a room or bringing a bottle of water to a targeted person, in a venue that it had never been to before.

“If things develop fast, it could happen in the next year or two, or maybe two to three years”, he said on Saturday at the World Robot Conference in Beijing.

Although both robot hardware, such as dexterous hands, and training data were good enough to enable the feat, the crucial element of “AI for embodied intelligence is completely inadequate”, he said.

He had “doubts” about whether popular vision language action (VLA) models, which used a rather “dumb” architecture, were up to the task, he said. Although Unitree also used such models, along with reinforcement learning to improve pre-trained VLAs in downstream tasks, the approach required a lot of optimisation, he said.

Another approach, generating a video or interactive model based on text prompts and making robots follow this to perform tasks, could have a “higher probability” of succeeding in robot motion control, Wang said.

He cited Google’s general-purpose Genie 3 “world model”, which was launched on Tuesday and is billed as capable of generating models of dynamic worlds that include information on physical properties, as an example of technology development in this area.



Source link

Scroll to Top