October 21, 2020

Robots deciding their next move need help prioritizing

by University of Illinois at Urbana-Champaign

As robots replace humans in dangerous situations such as search and rescue missions, they need to be able to quickly assess and make decisions—to react and adapt like a human being would. Researchers at the University of Illinois at Urbana-Champaign used a model based on the game Capture the Flag to develop a new take on deep reinforcement learning that helps robots evaluate their next move.

The team of researchers chose Capture the Flag because it's played with two teams, each with multiple teammates, where the opposing team is also making decisions.

"Robots can learn how to react in an environment like a competitive game by using a kind of trial and error process, called reinforcement learning. They learn what actions to take in a given situation by playing the game," said Huy Tran, a researcher in the Department of Aerospace Engineering at UIUC. "The challenge is to figure out how to create agents that can also adapt to unexpected situations."

Tran said his team realized that the robots needed help in prioritizing tasks.

"Given the overall task of capturing the flag, there are actually sub tasks to accomplish the along the way, which we model in a hierarchical structure. What we wanted to explore was whether or not this type of hierarchy would help with the ability to adapt."

With hierarchical deep reinforcement learning, Tran said tasks are split up—to capture the flag or to tag a member of the opposing team to eliminate them—so the model can handle more complex problems.

"By breaking the task into sub tasks, we can improve adaptation. We trained a high-level decision maker who assigns a sub task for each agent to focus on." Tran said. The hierarchical structure helps by making updates to the model simpler, Tran said. Only the hierarchical controller would need to be updated rather than each of the agents.

"This approach has the potential to solve interesting and challenging problems, but there are a lot of issues that we still need to address before we can deploy these systems in real-world situations. For example, we learned that this framework can help with adaptation," Tran said, "but we recognize that in this study we decided what the sub tasks should be based on our own intuition of how the game works. That is not ideal because it has our own biases. What we're doing now is looking at new techniques to allow agents to figure out what those sub goals should be on their own."

The study, "Evaluating Adaptation Performance of Hierarchical Deep Reinforcement Learning," was written by Neale Van Stralen, Seung Hyun Kim, Huy T. Tran, and Girish Chowdhary. The research was funded by the Defense Advanced Research Projects Agency and was presented at the 2020 IEEE International Conference on Robotics and Automation (ICRA) and published in the conference proceedings. A short video illustrates the work that includes the hierarchical controller in action.

More information: Neale Van Stolen et al. Evaluating Adaptation Performance of Hierarchical Deep Reinforcement Learning, 2020 IEEE International Conference on Robotics and Automation (ICRA) (2020). DOI: 10.1109/ICRA40945.2020.9197052

Provided by University of Illinois at Urbana-Champaign

Citation: Robots deciding their next move need help prioritizing (2020, October 21) retrieved 29 June 2024 from https://techxplore.com/news/2020-10-robots-prioritizing.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researchers exploit weaknesses of master game bots

44 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

Robots deciding their next move need help prioritizing

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Researchers exploit weaknesses of master game bots

Using imitation and reinforcement learning to tackle long-horizon robotic tasks

A deep learning model achieves super-human performance at Gran Turismo Sport

An AI taught itself to play a video game and now it's beating humans

Using reinforcement learning to achieve human-like balance control strategies in robots

Teaching humanoid robots different locomotion behaviors using human demonstrations

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Software engineers develop a way to run AI language models without matrix multiplication

Phys.org

Medical Xpress

Science X

Robots deciding their next move need help prioritizing

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Researchers exploit weaknesses of master game bots

Using imitation and reinforcement learning to tackle long-horizon robotic tasks

A deep learning model achieves super-human performance at Gran Turismo Sport

An AI taught itself to play a video game and now it's beating humans

Using reinforcement learning to achieve human-like balance control strategies in robots

Teaching humanoid robots different locomotion behaviors using human demonstrations

Recommended for you

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Software engineers develop a way to run AI language models without matrix multiplication

Your Privacy