August 2, 2021 report

Using generalization techniques to make AI systems more versatile

by Bob Yirka , Tech Xplore

A group at DeepMind called the Open-Ended Learning Team has developed a new way to train AI systems to play games. Instead of exposing it to millions of prior games, as is done with other game playing AI systems, the group at DeepMind has given its new AI system agents a set of minimal skills that they use to achieve a simple goal (such as spotting another player in a virtual world) and then build on it. The researchers created a virtual world called XLand—a colorful virtual world that has a general video game appearance. In it, AI players, which the researchers call agents, set off to achieve a general goal, and as they do, they acquire skills that they can use to achieve other goals. The researchers then switch the game around, giving the agents a new goal but allowing them to retain the skills they have learned in prior games. The group has written a paper describing their efforts and have posted it on the arXiv preprint server.

One example of the technique involves an agent attempting to make its way to a part of its world that is too high to climb onto directly and for which there are no access points such as stairs or ramps. In bumbling around, the agent finds that it can move a flat object it finds to serve as a ramp and thus make its way up to where it needs to go. To allow their agents to learn more skills, the researchers created 700,000 scenarios or games in which the agents faced approximately 3.4 million unique tasks. By taking this approach, the agents were able to teach themselves how to play multiple games, such as tag, capture the flag and hide and seek. The researchers call their approach endlessly challenging. Another interesting aspect of XLand is that there exists a sort of overlord, an entity that keeps tabs on the agents and notes which skills they are learning and then generates new games to strengthen their skills. With this approach, the agents will keep learning as long as they are given new tasks.

In running their virtual world, the researchers found that the agents learned new skills, generally by accident, that they found useful and then built on them, leading to more advanced skills such as resorting to experimentation when running out of options, cooperating with other agents and learning how to use objects as tools. They suggest their approach is a step toward creating generally capable algorithms that learn how to play new games on their own—skills that might one day be used by autonomous robots.

More information: Adam Stooke et al, Open-Ended Learning Leads to Generally Capable Agents, arXiv:2107.12808v1 [cs.LG] arxiv.org/abs/2107.12808

deepmind.com/blog/article/gene … from-open-ended-play

Citation: Using generalization techniques to make AI systems more versatile (2021, August 2) retrieved 16 August 2024 from https://techxplore.com/news/2021-08-techniques-ai-versatile.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Kids' love for video games can improve classroom learning, study finds

313 shares

Feedback to editors

Engineers design tiny batteries for powering cell-sized robots

10 hours ago

Leaf-like solar concentrators promise major boost in solar efficiency

11 hours ago

Why does AI beat humans at the strategy game Diplomacy?

12 hours ago

New technique prints metal oxide thin film circuits at room temperature

13 hours ago

Studies highlight challenges and solutions in making large language models trustworthy

14 hours ago

Finding security flaws in Android ahead of malicious hackers

14 hours ago

Robot planning tool accounts for human carelessness

15 hours ago

From shrimp to steel: Introducing nature-inspired metalworking

15 hours ago

'AI Scientist' model designed to conduct scientific research autonomously

16 hours ago

Global AI adoption is outpacing risk understanding, researchers warn

16 hours ago

Load comments (0)

Using generalization techniques to make AI systems more versatile

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Kids' love for video games can improve classroom learning, study finds

Researchers exploit weaknesses of master game bots

DeepMind AI shows off winning cooperative team behavior

VRKitchen: An interactive virtual environment to train and test AI agents

Researchers use AI to simulate soccer with inspiration from world's top players

Learning to teach to speed up learning

A two-stage framework to improve LLM-based anomaly detection and reactive planning

Robot planning tool accounts for human carelessness

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Why does AI beat humans at the strategy game Diplomacy?

Studies highlight challenges and solutions in making large language models trustworthy

Phys.org

Medical Xpress

Science X

Using generalization techniques to make AI systems more versatile

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Related Stories

Kids' love for video games can improve classroom learning, study finds

Researchers exploit weaknesses of master game bots

DeepMind AI shows off winning cooperative team behavior

VRKitchen: An interactive virtual environment to train and test AI agents

Researchers use AI to simulate soccer with inspiration from world's top players

Learning to teach to speed up learning

Recommended for you

A two-stage framework to improve LLM-based anomaly detection and reactive planning

Robot planning tool accounts for human carelessness

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Why does AI beat humans at the strategy game Diplomacy?

Studies highlight challenges and solutions in making large language models trustworthy

Your Privacy