March 18, 2024 feature

A new framework to collect training data and teach robots new manipulation policies

by Ingrid Fadelli , Tech Xplore

In recent years, roboticists and computer scientists have been trying to develop increasingly efficient methods to teach robots new skills. Many of the methods developed so far, however, require a large amount of training data, such as annotated human demonstrations of how to perform a task.

Researchers at Stanford University, Columbia University and Toyota Research Institute recently developed Universal Manipulation Interface (UMI), a framework to collect training data and transfer skills from human demonstrations in the wild to policies deployable on robots.

This framework, introduced in a paper posted to the preprint server arXiv, could contribute to the advancement of robotic systems, by speeding up and facilitating their training on new object manipulation tasks.

"In the last year, the robotics community saw huge advancement in robotic capability and task complexity, driven by wave of imitation learning algorithms including our prior work 'Diffusion Policy,'" Cheng Chi, co-author of the paper, told Tech Xplore.

"These algorithms take in human teleoperation datasets and produces an end-to-end deep neural network that drives robot actions directly from pixels. These methods are so powerful that we felt with sufficiently large and diverse demonstration datasets, there is no obvious ceiling on their capabilities.

"However, unlike other fields such as natural language processing (NLP) or computer vision (CV), there isn't widely available robotic data on the Internet, thus we have to collect data ourselves."

Compiling large datasets containing a wide range of demonstration data via teleoperation (i.e., the remote operation of physical robots) can be both expensive and time-consuming. Moreover, the logistics required to transport robots complicate the collection of varied data.

Chi and his colleagues set out to tackle these reported challenges of robot training in a scalable and efficient way. The key objective of their recent study was to develop a scalable method to collect real-world robotics training data in a wide range of environments.

Credit: Chi et al

"Back in 2020, our lab published a work called 'Grasping in the wild' that pioneered the idea of using a hand-held gripper device, combined with wrist-mounted camera, to collect data in the wild," Chi explained. "However, limited by the learning algorithms at the time as well as some hardware design flaws, the system is limited to simple tasks like object grasping."

Building on their previous works, Chi and his colleagues designed a new system to collect data and train robots. This system, dubbed UMI, includes a hand-held robotic gripper and a deep learning framework that combines the advantageous features of recently developed imitation learning algorithms, such as "Diffusion Policy."

"UMI is a data collection and policy learning framework that allows direct skill transfer from in-the-wild human demonstrations to deployable robot policies," Chi explained. "It consists of two components. The first is a physical interface (i.e., the 3D printed grippers mounted with GoPros) to capture all the information necessary for policy learning while remaining highly intuitive, cost-effective, portable and reliable. The second is a policy interface (i.e., API) that defines a standard way to learn from the data that enables cross-hardware transfer (i.e., deploying to multiple real-world robots)."

The framework developed by Chi and his collaborators has numerous advantages over other methods to collect data and train robotic manipulators. First, the UMI grippers they developed were much more intuitive than previously introduced teleoperation approaches.

"A data collector can demonstrate much harder tasks much faster compared to teleportation," Chi said, "As a result, the learned policy becomes more effective."

Credit: Chi et al

The second advantage of UMI is that it enables the collection of large and diverse datasets that allow robots to generalize well across unseen environments and object manipulation tasks. Collecting this data using UMI is also far cheaper and more feasible than compiling annotated training datasets using conventional methods.

"UMI also enables cross-hardware generalization," Chi said. "Any research lab can retrofit their industrial robot arms with UMI-compatible grippers and cameras, and directly deploy the policies we trained, or take advantage of the data we collected for pre-training. In comparison, most of the dataset that currently exists are specific to a robot embodiment and often to a specific lab environment. As a result, UMI could enable large-scale robotic data sharing across academia, similarly to datasets used in NLP and CV community."

In initial experiments, the UMI approach yielded very promising results. It was found to enable highly intuitive end-to-end imitation learning, training robots on various complex manipulation tasks with limited engineering efforts on the part of researchers, including dishwashing and folding clothes.

"Our experiments also showed that, with diverse data, end-to-end imitation learning can generalize to in-the-wild, unseen environments and unseen objects," Chi said. "In contrast, the standard for evaluating these end-to-end imitation learning methods previously has been using the same environment for both training and testing. Collectively, the evidence we collected suggests that with sufficiently large and diverse robotics dataset, general-purpose robots such as home robots might become feasible, even without a paradigm change on learning algorithms."

The new framework introduced by Chi and his collaborators could soon be used to collect other training datasets and tested on a wider range of complex manipulation tasks. The design of the UMI gripper and its underlying software are open-source and can be accessed by other teams on GitHub.

"We now wish to further expand the capabilities and observation modalities of UMI, by improving the hardware and adapting them to a broader range of robots," Chi added. "We also plan to collect even more data and use those data to further improve learning algorithms."

More information: Cheng Chi et al, Universal Manipulation Interface: In-The-Wild Robot Teaching Without In-The-Wild Robots, arXiv (2024). DOI: 10.48550/arxiv.2402.10329

Journal information: arXiv

Citation: A new framework to collect training data and teach robots new manipulation policies (2024, March 18) retrieved 29 June 2024 from https://techxplore.com/news/2024-03-framework-robots-policies.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

A deep reinforcement learning approach to enhance autonomous robotic grasping and assembly

92 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

22 hours ago

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

A new framework to collect training data and teach robots new manipulation policies

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

A deep reinforcement learning approach to enhance autonomous robotic grasping and assembly

Testing an unsupervised deep learning model for robot imitation of human motions

A computer vision–based teleoperation system that can be applied to different robots

A new framework that could simplify imitation learning in robotics

Human-like real-time sketching by a humanoid robot

An imitation learning approach to train robots without the need for real human demonstrations

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Software engineers develop a way to run AI language models without matrix multiplication

Phys.org

Medical Xpress

Science X

A new framework to collect training data and teach robots new manipulation policies

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

A deep reinforcement learning approach to enhance autonomous robotic grasping and assembly

Testing an unsupervised deep learning model for robot imitation of human motions

A computer vision–based teleoperation system that can be applied to different robots

A new framework that could simplify imitation learning in robotics

Human-like real-time sketching by a humanoid robot

An imitation learning approach to train robots without the need for real human demonstrations

Recommended for you

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Software engineers develop a way to run AI language models without matrix multiplication

Your Privacy