November 19, 2020

Showing robots how to drive a car... in just a few easy lessons

by Caitlin Dawson, University of Southern California

Imagine if robots could learn from watching demonstrations: you could show a domestic robot how to do routine chores or set a dinner table. In the workplace, you could train robots like new employees, showing them how to perform many duties. On the road, your self-driving car could learn how to drive safely by watching you drive around your neighborhood.

Making progress on that vision, USC researchers have designed a system that lets robots autonomously learn complicated tasks from a very small number of demonstrations—even imperfect ones. The paper, titled Learning from Demonstrations Using Signal Temporal Logic, was presented at the Conference on Robot Learning (CoRL), Nov. 18.

The researchers' system works by evaluating the quality of each demonstration, so it learns from the mistakes it sees, as well as the successes. While current state-of-art methods need at least 100 demonstrations to nail a specific task, this new method allows robots to learn from only a handful of demonstrations. It also allows robots to learn more intuitively, the way humans learn from each other—you watch someone execute a task, even imperfectly, then try yourself. It doesn't have to be a "perfect" demonstration for humans to glean knowledge from watching each other.

"Many machine learning and reinforcement learning systems require large amounts of data data and hundreds of demonstrations—you need a human to demonstrate over and over again, which is not feasible," said lead author Aniruddh Puranic, a Ph.D. student in computer science at the USC Viterbi School of Engineering.

"Also, most people don't have programming knowledge to explicitly state what the robot needs to do, and a human cannot possibly demonstrate everything that a robot needs to know. What if the robot encounters something it hasn't seen before? This is a key challenge."

Credit: University of Southern California

Learning from demonstrations

Learning from demonstrations is becoming increasingly popular in obtaining effective robot control policies—which control the robot's movements—for complex tasks. But it is susceptible to imperfections in demonstrations and also raises safety concerns as robots may learn unsafe or undesirable actions.

Also, not all demonstrations are equal: some demonstrations are a better indicator of desired behavior than others and the quality of the demonstrations often depends on the expertise of the user providing the demonstrations.

To address these issues, the researchers integrated "signal temporal logic" or STL to evaluate the quality of demonstrations and automatically rank them to create inherent rewards.

In other words, even if some parts of the demonstrations do not make any sense based on the logic requirements, using this method, the robot can still learn from the imperfect parts. In a way, the system is coming to its own conclusion about the accuracy or success of a demonstration.

"Let's say robots learn from different types of demonstrations—it could be a hands-on demonstration, videos, or simulations—if I do something that is very unsafe, standard approaches will do one of two things: either, they will completely disregard it, or even worse, the robot will learn the wrong thing," said co-author Stefanos Nikolaidis, a USC Viterbi assistant professor of computer science.

"In contrast, in a very intelligent way, this work uses some common sense reasoning in the form of logic to understand which parts of the demonstration are good and which parts are not. In essence, this is exactly what also humans do."

Take, for example, a driving demonstration where someone skips a stop sign. This would be ranked lower by the system than a demonstration of a good driver. But, if during this demonstration, the driver does something intelligent—for instance, applies their brakes to avoid a crash—the robot will still learn from this smart action.

Adapting to human preferences

Signal temporal logic is an expressive mathematical symbolic language that enables robotic reasoning about current and future outcomes. While previous research in this area has used "linear temporal logic", STL is preferable in this case, said Jyo Deshmukh, a former Toyota engineer and USC Viterbi assistant professor of computer science .

"When we go into the world of cyber physical systems, like robots and self-driving cars, where time is crucial, linear temporal logic becomes a bit cumbersome, because it reasons about sequences of true/false values for variables, while STL allows reasoning about physical signals."

Puranic, who is advised by Deshmukh, came up with the idea after taking a hands-on robotics class with Nikolaidis, who has been working on developing robots to learn from YouTube videos. The trio decided to test it out. All three said they were surprised by the extent of the system's success and the professors both credit Puranic for his hard work.

"Compared to a state-of-the-art algorithm, being used extensively in many robotics applications, you see an order of magnitude difference in how many demonstrations are required," said Nikolaidis.

The system was tested using a Minecraft-style game simulator, but the researchers said the system could also learn from driving simulators and eventually even videos. Next, the researchers hope to try it out on real robots. They said this approach is well suited for applications where maps are known beforehand but there are dynamic obstacles in the map: robots in household environments, warehouses or even space exploration rovers.

"If we want robots to be good teammates and help people, first they need to learn and adapt to human preference very efficiently," said Nikolaidis. "Our method provides that."

"I'm excited to integrate this approach into robotic systems to help them efficiently learn from demonstrations, but also effectively help human teammates in a collaborative task."

More information: Learning from Demonstrations using Signal Temporal Logic: drive.google.com/file/d/1MH8KV … tLV0iUP163NIxV1/view

Provided by University of Southern California

Citation: Showing robots how to drive a car... in just a few easy lessons (2020, November 19) retrieved 19 April 2024 from https://techxplore.com/news/2020-11-robots-car-easy-lessons.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

An imitation learning approach to train robots without the need for real human demonstrations

56 shares

Feedback to editors

Team develops a way to teach a computer to type like a human

9 hours ago

Universal 'cocktail electrolyte' developed for 4.6 V ultra-stable fast charging of commercial lithium-ion batteries

10 hours ago

Garbage could replace a quarter of petroleum-based jet fuel every year

11 hours ago

For more open and equitable public discussions on social media, try 'meronymity'

12 hours ago

Mess is best: Disordered structure of battery-like devices improves performance

12 hours ago

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

13 hours ago

An ink for 3D-printing flexible devices without mechanical joints

13 hours ago

Floating solar's potential to support sustainable development

14 hours ago

Harvesting vibrational energy from 'colored noise'

15 hours ago

New understanding of energy losses in emerging light source

15 hours ago

Load comments (0)

Showing robots how to drive a car... in just a few easy lessons

Adapting to human preferences

Team develops a way to teach a computer to type like a human

Universal 'cocktail electrolyte' developed for 4.6 V ultra-stable fast charging of commercial lithium-ion batteries

Garbage could replace a quarter of petroleum-based jet fuel every year

For more open and equitable public discussions on social media, try 'meronymity'

Mess is best: Disordered structure of battery-like devices improves performance

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

An ink for 3D-printing flexible devices without mechanical joints

Floating solar's potential to support sustainable development

Harvesting vibrational energy from 'colored noise'

New understanding of energy losses in emerging light source

An imitation learning approach to train robots without the need for real human demonstrations

AVID: a framework to enhance imitation learning in robots

By observing humans, robots learn to perform complex tasks, such as setting a table

Dog training methods help teach robots to learn new tricks

Army robots get driver education for difficult tasks

Teaching humanoid robots different locomotion behaviors using human demonstrations

Team develops a way to teach a computer to type like a human

Garbage could replace a quarter of petroleum-based jet fuel every year

Using sim-to-real reinforcement learning to train robots to do simple tasks in broad environments

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

Researchers use machine learning to create a fabric-based touch sensor

Student engineering team successfully builds and runs hydrogen-powered engine

Phys.org

Medical Xpress

Science X

Showing robots how to drive a car... in just a few easy lessons

Adapting to human preferences

Team develops a way to teach a computer to type like a human

Universal 'cocktail electrolyte' developed for 4.6 V ultra-stable fast charging of commercial lithium-ion batteries

Garbage could replace a quarter of petroleum-based jet fuel every year

For more open and equitable public discussions on social media, try 'meronymity'

Mess is best: Disordered structure of battery-like devices improves performance

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

An ink for 3D-printing flexible devices without mechanical joints

Floating solar's potential to support sustainable development

Harvesting vibrational energy from 'colored noise'

New understanding of energy losses in emerging light source

Related Stories

An imitation learning approach to train robots without the need for real human demonstrations

AVID: a framework to enhance imitation learning in robots

By observing humans, robots learn to perform complex tasks, such as setting a table

Dog training methods help teach robots to learn new tricks

Army robots get driver education for difficult tasks

Teaching humanoid robots different locomotion behaviors using human demonstrations

Recommended for you

Team develops a way to teach a computer to type like a human

Garbage could replace a quarter of petroleum-based jet fuel every year

Using sim-to-real reinforcement learning to train robots to do simple tasks in broad environments

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

Researchers use machine learning to create a fabric-based touch sensor

Student engineering team successfully builds and runs hydrogen-powered engine

Your Privacy