November 21, 2018 feature

RoboTurk: A crowdsourcing platform for imitation learning in robotics

by Ingrid Fadelli , Tech Xplore

Imitation learning is a branch of machine learning that trains machines to mimic human behavior while completing particular tasks. These techniques show great promise in the field of robotics, as they tackle some of the shortcomings of reinforcement learning, such as exploration and reward specification.

Despite encouraging results, imitation learning studies have so far been limited to modest-sized datasets due to difficulties in collecting large quantities of task demonstrations using existing methods. To address these limitations, a team of researchers supervised by Dr. Silvio Savarese and Dr. Fei-Fei Li at Stanford University have developed RoboTurk, a crowdsourcing platform for high-quality 6-DoF trajectory-based teleoperation using widely available smartphone devices.

"We wanted to create something like ImageNet for Robotics," Ajay Mandlekar, one of the researchers who carried out the study, told TechXplore. "We believe that data is a key limitation in the field of robot learning. While there are plenty of methods that learn from data, such as data-driven control and reinforcement learning, most methods collect their own data. As a result, the data is often of a low quality, for instance resulting in the robot moving its arm randomly. This type of exploration can be difficult and unsafe, but we believe that humans can help."

ImageNet is a renowned image database created by Dr. Li, commonly used in computer vision and object recognition research. The crowdsourcing platform developed by Stanford Vision and Learning Lab was designed to serve as a similar resource for robotics and imitation learning studies.

"Unlike ImageNet, such a data collection system needed to be dynamic, allowing us to collect data repeatedly, often on-demand, and perhaps even using collaborative learning," Yuke Zhu, who was also involved in the development of Roboturk, told TechXplore. "This is because the data that is collected depends on what types of actions the robot takes in the environment."

The researchers' ultimate goal is to train robots on advanced manipulation skills, allowing them to complete tasks within industrial settings such as packaging or assembly. They found that while imitation learning showed great potential in this context, existing datasets were very limited due to difficulties in collecting large quantities of task demonstrations.

"In other domains such as computer vision and natural language processing, large-scale supervision for datasets is often collected with the assistance of crowdsourcing," Mandlekar said. "This enables a scalable mechanism for diverse human supervision on an extensive set of problem instances. However, collecting large amounts of data has been a challenge for robotics tasks, as they demand real-time interaction and feedback from annotators, placing difficult constraints on remote teleoperation platforms."

The group at Stanford Vision and Learning Lab hence developed RoboTurk, a crowdsourcing platform that allows researchers to scale up the skills and tasks that robots can perform autonomously, through the use of scalable human supervision. Via RoboTurk, remote workers can log onto a website and collect task demonstrations, using their smartphone as a motion controller.

"RoboTurk is supported by a cloud-based simulation backend that streams video to a client's web browser using low-latency communication protocols," Mandlekar explained. "This ensures homogenous quality of service regardless of a client's computer resources, resulting in a platform that is intuitive to use and has a low barrier to entry, which are the core requirements of a crowdsourced task. RoboTurk supports multiple robots, tasks, and simulators, and can easily be extended to support others."

The researchers evaluated their platform on three manipulation tasks of varying durations, ranging from 15 to 120 seconds. They found that RoboTurk shared statistical similarities with special purpose hardware, such as virtual reality controllers. They also observed that poor network conditions did not substantially affect users' ability to perform tasks successfully on the platform. Using RoboTurk, they collected 137.5 hours of manipulation data from remote workers, with over 2200 successful task demonstrations in 22 hours of total system usage.

"I think that the most meaningful part of the platform is how it will enable humans and robots to interact," Animesh Garg, postdoctoral student leading the project, told TechXplore. "Robots are the smart tools of the future. We should not think of them as a replacement for humans but rather as a way to extend our capabilities. This empowers humans to be more productive and focus on higher-level intelligence problems, in the same way in which the advent of computers made it easier for people to use math as a tool to solve problems of interest."

RoboTurk effectively enables policy learning on multi-step manipulation tasks with sparse rewards. In addition, Mandlekar and his colleagues found that using larger quantities of demonstrations during policy learning had notable benefits, leading to better performance and greater learning consistency.

In the future, RoboTurk could become a key resource in the field of robotics, aiding the development of more advanced and better performing robots. The researchers are now applying RoboTurk to real robots, while also developing algorithms that can use the data they collected to teach robots low-level skills.

"Robots are a very exciting technology that will enable people to be more productive and independent in all spheres of human activity, for instance providing a helping hand in the kitchen, caretakers for the senior population, and better care for patients," Garg said. "One of the things that excites us is the democratization of manufacturing. This technology could enable people to make and sell custom products without the need of special purpose equipment, just as YouTube has democratized content creation and distribution, allowing anyone to create and share videos."

More information: RoboTurk: A crowdsourcing platform for robotic skill learning through imitation. arXiv: 1811.02790 [cs.RO]. arxiv.org/abs/1811.02790

crowdncloud.ai/

Citation: RoboTurk: A crowdsourcing platform for imitation learning in robotics (2018, November 21) retrieved 16 April 2024 from https://techxplore.com/news/2018-11-roboturk-crowdsourcing-platform-imitation-robotics.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Robots learn tasks from people

68 shares

Feedback to editors

Taichi: A large-scale diffractive hybrid photonic AI chiplet

1 hour ago

New insight about the working principles of bipolar membranes could guide future fuel cell design

2 hours ago

Using sound waves for photonic machine learning: Study lays foundation for reconfigurable neuromorphic building blocks

5 hours ago

Samsung returns to top of the smartphone market: Industry tracker

6 hours ago

Safeguarding the future of online security with AI and metasurfaces

18 hours ago

Security vulnerability in browser interface allows computer access via graphics card

21 hours ago

AI's new power of persuasion: Study shows LLMs can exploit personal information to change your mind

21 hours ago

Research team manufactures the first universal, programmable and multifunctional photonic chip

21 hours ago

Researchers develop stretchable quantum dot display

22 hours ago

Mimicking fish to create the ideal deep-sea submersible

22 hours ago

Load comments (0)

RoboTurk: A crowdsourcing platform for imitation learning in robotics

Taichi: A large-scale diffractive hybrid photonic AI chiplet

New insight about the working principles of bipolar membranes could guide future fuel cell design

Using sound waves for photonic machine learning: Study lays foundation for reconfigurable neuromorphic building blocks

Samsung returns to top of the smartphone market: Industry tracker

Safeguarding the future of online security with AI and metasurfaces

Security vulnerability in browser interface allows computer access via graphics card

AI's new power of persuasion: Study shows LLMs can exploit personal information to change your mind

Research team manufactures the first universal, programmable and multifunctional photonic chip

Researchers develop stretchable quantum dot display

Mimicking fish to create the ideal deep-sea submersible

Robots learn tasks from people

A new developmental framework could allow robots to optimize hyper-parameters autonomously

End-to-end learning of co-speech gesture generation for humanoid robots

Robot DE NIRO: A robotics platform for human-centered interactions

Shape-shifting modular robot is more than the sum of its parts

Using reinforcement learning to achieve human-like balance control strategies in robots

Taichi: A large-scale diffractive hybrid photonic AI chiplet

Using sound waves for photonic machine learning: Study lays foundation for reconfigurable neuromorphic building blocks

AI's new power of persuasion: Study shows LLMs can exploit personal information to change your mind

Engineers recreate Star Trek's Holodeck using ChatGPT and video game assets

New computer vision tool can count damaged buildings in crisis zones and accurately estimate bird flock sizes

Adding a telescopic leg beneath a quadcopter to create a hopping drone

Phys.org

Medical Xpress

Science X

RoboTurk: A crowdsourcing platform for imitation learning in robotics

Taichi: A large-scale diffractive hybrid photonic AI chiplet

New insight about the working principles of bipolar membranes could guide future fuel cell design

Using sound waves for photonic machine learning: Study lays foundation for reconfigurable neuromorphic building blocks

Samsung returns to top of the smartphone market: Industry tracker

Safeguarding the future of online security with AI and metasurfaces

Security vulnerability in browser interface allows computer access via graphics card

AI's new power of persuasion: Study shows LLMs can exploit personal information to change your mind

Research team manufactures the first universal, programmable and multifunctional photonic chip

Researchers develop stretchable quantum dot display

Mimicking fish to create the ideal deep-sea submersible

Related Stories

Robots learn tasks from people

A new developmental framework could allow robots to optimize hyper-parameters autonomously

End-to-end learning of co-speech gesture generation for humanoid robots

Robot DE NIRO: A robotics platform for human-centered interactions

Shape-shifting modular robot is more than the sum of its parts

Using reinforcement learning to achieve human-like balance control strategies in robots

Recommended for you

Taichi: A large-scale diffractive hybrid photonic AI chiplet

Using sound waves for photonic machine learning: Study lays foundation for reconfigurable neuromorphic building blocks

AI's new power of persuasion: Study shows LLMs can exploit personal information to change your mind

Engineers recreate Star Trek's Holodeck using ChatGPT and video game assets

New computer vision tool can count damaged buildings in crisis zones and accurately estimate bird flock sizes

Adding a telescopic leg beneath a quadcopter to create a hopping drone

Your Privacy