share this!
6
8
Share
Email

March 9, 2020

Researchers introduce new algorithm to reduce machine learning time

A research team led by Prof. LI Huiyun from the Shenzhen Institutes of Advanced Technology (SIAT) of the Chinese Academy of Sciences introduced a simple deep reinforcement learning (DRL) algorithm with m-out-of-n bootstrap technique and aggregated multiple deep deterministic policy gradient (DDPG) algorithm structures.

Named "bootstrapped aggregated multi-DDPG" (BAMDDPG), the new algorithm accelerated the training process and increased the performance in the area of intelligent artificial research.

The researchers tested their algorithm on 2-D robot and open racing car simulator (TORCS). The experiment results on the 2-D robot arm game showed that the reward gained by the aggregated policy was 10%-50% better than those gained by subpolicies, and experiment results on the TORCS demonstrated that the new algorithm could learn successful control policies with less training time by 56.7%.

DDPG algorithm operating over continuous space of actions has attracted great attention for reinforcement learning. However, the exploration strategy through dynamic programming within the Bayesian belief state space is rather inefficient even for simple systems. This usually results in failure of the standard bootstrap when learning an optimal policy.

The proposed algorithm uses the centralized experience replay buffer to improve the exploration efficiency. M-out-of-n bootstrap with random initialization produces reasonable uncertainty estimates at low computational cost, helping in the convergence of the training. The proposed bootstrapped and aggregated DDPG can reduce the learning time.

BAMDDPG enables each agent to use experiences encountered by other agents. This makes the training of subpolicies of BAMDDPG more efficient since each agent owns a wider vision and more environment information.

This method is effective to the sequential and iterative training data, where the data exhibit long-tailed distribution, rather than the norm distribution implicated by the independent identically distributed data assumption. It can learn the optimal policies with much less training time for tasks with continuous space of actions and states.

The study, titled "Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm," was published in Hindawi.

More information: Junta Wu et al. Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm, Mathematical Problems in Engineering (2020). DOI: 10.1155/2020/4275623

Provided by Chinese Academy of Sciences

Citation: Researchers introduce new algorithm to reduce machine learning time (2020, March 9) retrieved 27 April 2024 from https://techxplore.com/news/2020-03-algorithm-machine.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Google's robot learns to walk in real world

14 shares

Feedback to editors

Proof of concept study shows path to easier recycling of solar modules

17 hours ago

New circuit boards can be repeatedly recycled

19 hours ago

Researchers develop an automated benchmark for language-based task planners

19 hours ago

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

19 hours ago

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

19 hours ago

Researchers outline path forward for tandem solar cells

21 hours ago

Researcher develop high-performance amorphous p-type oxide semiconductor

21 hours ago

Scientists create new atomic clock that is both ultra-precise and sturdy

21 hours ago

A framework to compare lithium battery testing data and results during operation

Apr 26, 2024

New approach could make reusing captured carbon far cheaper, less energy-intensive

Apr 26, 2024

Load comments (0)

Researchers introduce new algorithm to reduce machine learning time

Proof of concept study shows path to easier recycling of solar modules

New circuit boards can be repeatedly recycled

Researchers develop an automated benchmark for language-based task planners

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

Researchers outline path forward for tandem solar cells

Researcher develop high-performance amorphous p-type oxide semiconductor

Scientists create new atomic clock that is both ultra-precise and sturdy

A framework to compare lithium battery testing data and results during operation

New approach could make reusing captured carbon far cheaper, less energy-intensive

Google's robot learns to walk in real world

Using imitation and reinforcement learning to tackle long-horizon robotic tasks

New method proposed to achieve better robot self-learning

A method for self-supervised robotic learning that entails setting feasible goals

A new developmental reinforcement learning approach for sensorimotor space enlargement

Researchers develop efficient distributed deep learning

Researchers develop an automated benchmark for language-based task planners

Study explores why human-inspired machines can be perceived as eerie

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Microsoft claims that small, localized language models can be powerful as well

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Phys.org

Medical Xpress

Science X

Researchers introduce new algorithm to reduce machine learning time

Proof of concept study shows path to easier recycling of solar modules

New circuit boards can be repeatedly recycled

Researchers develop an automated benchmark for language-based task planners

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

Researchers outline path forward for tandem solar cells

Researcher develop high-performance amorphous p-type oxide semiconductor

Scientists create new atomic clock that is both ultra-precise and sturdy

A framework to compare lithium battery testing data and results during operation

New approach could make reusing captured carbon far cheaper, less energy-intensive

Related Stories

Google's robot learns to walk in real world

Using imitation and reinforcement learning to tackle long-horizon robotic tasks

New method proposed to achieve better robot self-learning

A method for self-supervised robotic learning that entails setting feasible goals

A new developmental reinforcement learning approach for sensorimotor space enlargement

Researchers develop efficient distributed deep learning

Recommended for you

Researchers develop an automated benchmark for language-based task planners

Study explores why human-inspired machines can be perceived as eerie

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Microsoft claims that small, localized language models can be powerful as well

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Your Privacy