MIT debuts a large language model-inspired method for teaching robots new skills

MIT this week showcased a new model for training robots. Rather than the standard set of focused data used to teach robots new tasks, the method goes big, mimicking the massive troves of information used to train large language models (LLMs).

The researchers note that imitation learning — in which the agent learns by following an individual performing a task — can fail when small challenges are introduced. These could be things like lighting, a different setting, or new obstacles. In those scenarios, the robots simply don’t have enough data to draw upon in order to adapt.

The team looked to models like GPT-4 for a kind of brute force data approach to problem solving.

“In the language domain, the data are all just sentences,” says Lirui Wang, the new paper’s lead author. “In robotics, given all the heterogeneity in the data, if you want to pretrain in a similar manner, we need a different architecture.”

The team introduced a new architecture called Heterogeneous Pretrained Transformers (HPT), which pulls together information from different sensors and different environments. A transformer was then used to pull together the data into training models. The larger the transformer, the better the output.

Users then input the robot design, configuration, and the job they want done.

“Our dream is to have a universal robot brain that you could download and use for your robot without any training at all,” CMU associate professor David Held said of the research. “While we are just in the early stages, we are going to keep pushing hard and hope scaling leads to a breakthrough in robotic policies, like it did with large language models.”

The research was founded, in part, by Toyota Research Institute. Last year at TechCrunch Disrupt, TRI debuted a method for training robots overnight. More recently, it struck a watershed partnership that will unite its robot learning research with Boston Dynamics hardware.

Source link

MIT debuts a large language model-inspired method for teaching robots new skills

Recent posts

JobGet, a ‘LinkedIn’ for hourly workers, acquires rival Snagajob

Waymo robotaxis will be on the Uber app in Austin, Atlanta in early 2025

Kirin offers a taste of its electric salt spoon at CES 2025

Intuitive Machines CEO: ‘We now have the platform for a lunar economy’

Fluid Truck files for Chapter 11 bankruptcy and pursues sale after leadership shakeup

This Mexican fintech isn’t too worried about Trump’s tariff threats

Consumer tech is bouncing back, and consumer founders like Brynn Putnam are bouncing back with it

Peak XV has reaped $1.2B in the year since it split from Sequoia

Energy Revolution Ventures’ $18M fund lays a bet on ‘new chemistry’ startups in energy and hydrogen

Ex-PayPal COO David Sacks is Trump’s new crypto and AI ‘czar’

Intenty nudges you to provide a reason every time you unlock your phone

Will Smith eating spaghetti and other weird AI benchmarks that took off in 2024

India’s Physics Wallah raises $210M at $2.8B valuation even as edtech funding remains scarce

‘Hospital at home’ startup Doccla raises $46 million for its European expansion

Autodesk CTO Raji Arasu calls for diversity in the teams building AI

Related articles

Meta, X approved ads containing violent anti-Muslim, antisemitic hate speech ahead of German election, study finds

Court filings show Meta staffers discussed using copyrighted content for AI training

Brian Armstrong says Coinbase spent $50M fighting SEC lawsuit – and beat it

iOS 18.4 will bring Apple Intelligence-powered ‘Priority Notifications’

Nvidia CEO Jensen Huang says market got it wrong about DeepSeek’s impact

Report: OpenAI plans to shift compute needs from Microsoft to SoftBank

Norway’s 1X is building a humanoid robot for the home

Sakana walks back claims that its AI can dramatically speed up model training

Company

Follow us