December 10, 2020 report

Deep reinforcement-learning architecture combines pre-learned skills to create new sets of skills on the fly

by Bob Yirka , Tech Xplore

A team of researchers from the University of Edinburgh and Zhejiang University has developed a way to combine deep neural networks (DNNs) to create a new type of system with a new kind of learning ability. The group describes their new architecture and its performance in the journal Science Robotics.

Deep neural networks are able to learn functions by training on multiple examples repeatedly. To date, they have been used in a wide variety of applications such as recognizing faces in a crowd or deciding whether a loan applicant is credit-worthy. In this new effort, the researchers have combined several DNNs developed for different applications to create a new system with the benefits of all of its constituent DNNs. They report that the resulting system was more than just the sum of its parts—it was able to learn new functions that none of the DNNs could do working alone. The researchers call it a multi-expert learning architecture (MELA).

More specifically, the work involved training several DNNs for different functions. One learned to make a robot trot, for example; another could navigate around obstacles. All of the DNNs were then connected to a gating neural network that learned over time how to call the other DNNs when something came up that required its special skillset as it controlled a robot moving around its environment. That resulting system was then able to carry out all of the skills of all of the combined DNNs.

Narrated video discussing how MELA works and its novelty. Credit: Yang et al., Sci Robot. 5, eabb2174 (2020)

But that was not the end of the exercise—as the MELA learned more about its constituent parts and their abilities, it learned to use them together through trial and error in ways that it had not been taught. It learned, for example, how to combine getting up after falling with dealing with a slippery floor, or what to do if one of its motors failed. The researchers suggest their work marks a new milestone in robotics research, providing a new paradigm in which humans do not have to intercede when a robot encounters problems it has not experienced before.

Video of outdoor experiments where a MELA-programmed robot recovers from being kicked to the ground. Credit: Yang et al., Sci Robot. 5, eabb2174 (2020)

Video of the four-legged robot trotting on different kinds of surfaces, including slippery ones. Credit: Yang et al., Sci Robot. 5, eabb2174 (2020)

Video of the MELA-programmed robot trotting on pebbles or grass outside, when out of nowhere, a human pushes it to the ground. The robot recovers quickly, thanks to MELA. Credit: Yang et al., Sci Robot. 5, eabb2174 (2020)

Video of MELA encountering various challenges in simulations. Credit: Yang et al., Sci Robot. 5, eabb2174 (2020)

Video of less efficient fall recovery using default controllers of the four-legged robot. Credit: Yang et al., Sci Robot. 5, eabb2174 (2020)

More information: Chuanyu Yang et al. Multi-expert learning of adaptive legged locomotion, Science Robotics (2020). DOI: 10.1126/scirobotics.abb2174

Journal information: Science Robotics

Citation: Deep reinforcement-learning architecture combines pre-learned skills to create new sets of skills on the fly (2020, December 10) retrieved 17 July 2024 from https://techxplore.com/news/2020-12-deep-reinforcement-learning-architecture-combines-pre-learned.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researchers find a way to fool deep neural networks into 'recognizing' images that aren't there

521 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

11 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

13 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

15 hours ago

Large language models make human-like reasoning mistakes, researchers find

16 hours ago

Unveiling a new class of synthetic fuels

16 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

16 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

17 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

20 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

21 hours ago

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

Deep reinforcement-learning architecture combines pre-learned skills to create new sets of skills on the fly

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Researchers find a way to fool deep neural networks into 'recognizing' images that aren't there

FoolChecker: A platform to check how robust an image is against adversarial attacks

Illusory motion reproduced by deep neural networks trained for prediction

Early Bird uses 10 times less energy to train deep neural networks

Using artificial intelligence to assess ulcerative colitis

A method to protect audio classifiers against adversarial attacks

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

Phys.org

Medical Xpress

Science X

Deep reinforcement-learning architecture combines pre-learned skills to create new sets of skills on the fly

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

Researchers find a way to fool deep neural networks into 'recognizing' images that aren't there

FoolChecker: A platform to check how robust an image is against adversarial attacks

Illusory motion reproduced by deep neural networks trained for prediction

Early Bird uses 10 times less energy to train deep neural networks

Using artificial intelligence to assess ulcerative colitis

A method to protect audio classifiers against adversarial attacks

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

Your Privacy