June 26, 2023 report

DeepMind unveils self-training RoboCat

by Peter Grad , Tech Xplore

An unknown admirer of felines once remarked, "Cats and computers both have one thing in common—they both rule the Internet."

At Google's DeepMind, researchers recently married artificial intelligence with a robot named RoboCat, and while it does not yet rule the Internet, it is expected to make a big leap into a future world of self-training automatons.

Utilizing the same technology behind large language models, the DeepMind team, comprised of more than 30 researchers, said it had made a breakthrough with a robotic cat that not only learns new tasks quickly but can improve its performance by constructing its own performance data.

"RoboCat has a virtuous cycle of training," DeepMind said in a paper published on the preprint server arXiv. "The more new tasks it learns, the better it gets at learning additional new tasks."

Up to now, robots generally have performed specific, pre-programmed tasks. With the introduction of large language models, robot skills sets began to broaden, though training on the massive volumes of data required enormous amounts of time.

DeepMind said Robocat, however, can quickly learn new tasks, such as fitting various-shaped puzzle pieces into the proper holes or placing fruit in a bowl. It then was able to progress and perform more complex tasks "based on a data set of millions of trajectories" from prior tasks and new self-generated data.

"These improvements were due to RoboCat's growing breadth of experience, similar to how people develop a more diverse range of skills as they deepen their learning in a given domain," researchers said.

As RoboCat improved its technique, its new learned behaviors were transferred to other robots that in turn built upon those skills.

The robot fine-tuned its performance on between 100 and 1,000 demonstrations from a human-controlled robotic arm. Spin-off models were then trained on specific tasks and that data was entered in the general instruction pool.

While RoboCat initially achieved a 36% success rate tackling tasks it had not previously learned, it improved its performance over time. Through self-training, its success rate doubled.

"RoboCat learns much faster than other state-of-the-art models," DeepMind researchers said. "It can pick up a new task with as few as 100 demonstrations because it draws from a large and diverse data set."

The development is seen as a major step towards accelerating robotics research, "as it reduces the need for human-supervised training, and is an important step towards creating a general-purpose robot."

The paper, "RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation," was published June 20.

Will such robots eventually dispense with the need for human intervention?

That question was addressed 100 years ago in the 1921 play "R.U.R.: Rossum's Universal Robots," a tale by Czech writer Karel Čapek.

The play imagined a factory that created synthetic humanoids that worked continuously and eventually shrank labor costs by 80%. The word "robot" was used for the first time in this play, after the Czech word "robota," which meant "forced labor by serfs."

In the end, the robots rebelled and extinguished humanity.

RoboCats, we can hope, will be friendlier.

Though we must also remember what humorist Will Rogers once said: "Letting the cat out of the bag is a whole lot easier than putting it back in."

More information: Konstantinos Bousmalis et al, RoboCat: A Self-Improving Foundation Agent for Robotic Manipulation, arXiv (2023). DOI: 10.48550/arxiv.2306.11706

DeepMind: www.deepmind.com/blog/robocat- … roving-robotic-agent

Journal information: arXiv

Citation: DeepMind unveils self-training RoboCat (2023, June 26) retrieved 30 June 2024 from https://techxplore.com/news/2023-06-deepmind-unveils-self-training-robocat.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

A robot that can autonomously explore real-world environments

168 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

DeepMind unveils self-training RoboCat

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

A robot that can autonomously explore real-world environments

Researchers expand ability of robots to learn from videos

Researchers develop algorithm to divvy up tasks for human-robot teams

DayDreamer: An algorithm to quickly teach robots new behaviors in the real world

Robots learn household tasks by watching humans

An imitation learning approach to train robots without the need for real human demonstrations

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Software engineers develop a way to run AI language models without matrix multiplication

Phys.org

Medical Xpress

Science X

DeepMind unveils self-training RoboCat

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

A robot that can autonomously explore real-world environments

Researchers expand ability of robots to learn from videos

Researchers develop algorithm to divvy up tasks for human-robot teams

DayDreamer: An algorithm to quickly teach robots new behaviors in the real world

Robots learn household tasks by watching humans

An imitation learning approach to train robots without the need for real human demonstrations

Recommended for you

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Software engineers develop a way to run AI language models without matrix multiplication

Your Privacy