December 8, 2017 weblog

AlphaZero algorithm can pick up victory moves in chess

by Nancy Owano , Tech Xplore

(Tech Xplore)—DeepMind AI has claimed a chess crown. Google's DeepMind division played 100 games against Stockfish 8 and won or drew all of them, said the BBC.

The BBC said that details published on arXiv stated that algorithm AlphaZero was able to outperform Stockfish only 4 hours after being given the rules of chess and being told to learn by playing simulations against itself.

The team paper describing the work is on arXiv. They reported that software had been generalized and was able to learn other games.

The authors wrote about the AlphaZero algorithm achieving, "tabula rasa, superhuman performance in many challenging domains," not just in chess. With no knowledge other than game rules, the algorithm achieved in 24 hours what the authors said was "a superhuman" level of play in chess, shogi (Japanese chess) and Go, "and convincingly defeated a world-champion program in each case."

"Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm" is the title of the paper, which was submitted December 5.

What's remarkable, though, goes beyond the chess win alone. James Vincent in The Verge found the true remarkable feat was that "in less than 24 hours, the same computer program was able to teach itself how to play three complex board games at superhuman levels. That's a new feat for the world of AI." [Go, chess, shogi.]

University of Oxford's Prof. Michael Wooldridge was quoted by the BBC. "The general trajectory in DeepMind seems to be to solve a problem and then demonstrate it can really ramp up performance, and that's very impressive." At the same time, Wooldridge observed that the three games were fairly "closed" in the sense they had limited sets of rules to contend with. "In the real world we don't know what is round the corner," he explained. "Coping when you don't know what is coming is much more complicated, and things will get even more exciting when DeepMind moves on to more open problems."

AlphaZero wasn't specifically designed to play chess. James Vincent in The Verge: "In each case, it was given some basic rules (like how knights move in chess, and so on) but was programmed with no other strategies or tactics. It simply got better by playing itself over and over again at an accelerated pace—a method of training AI known as "reinforcement learning.""

The authors said that the AlphaZero algorithm was "a more generic version" of the AlphaGo Zero algorithm that introduced in the context of Go. "It replaces the handcrafted knowledge and domainspecific augmentations used in traditional game-playing programs with deep neural networks and a tabula rasa reinforcement learning algorithm."

More information: Mastering Chess and Shogi by Self-Play with a General Reinforcement Learning Algorithm, arXiv:1712.01815 [cs.AI] arxiv.org/abs/1712.01815

Abstract
The game of chess is the most widely-studied domain in the history of artificial intelligence. The strongest programs are based on a combination of sophisticated search techniques, domain-specific adaptations, and handcrafted evaluation functions that have been refined by human experts over several decades. In contrast, the AlphaGo Zero program recently achieved superhuman performance in the game of Go, by tabula rasa reinforcement learning from games of self-play. In this paper, we generalise this approach into a single AlphaZero algorithm that can achieve, tabula rasa, superhuman performance in many challenging domains. Starting from random play, and given no domain knowledge except the game rules, AlphaZero achieved within 24 hours a superhuman level of play in the games of chess and shogi (Japanese chess) as well as Go, and convincingly defeated a world-champion program in each case.

Citation: AlphaZero algorithm can pick up victory moves in chess (2017, December 8) retrieved 17 July 2024 from https://techxplore.com/news/2017-12-alphazero-algorithm-victory-chess.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Google's new Go-playing AI learns fast, and even thrashed its former self

58 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

11 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

13 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

15 hours ago

Large language models make human-like reasoning mistakes, researchers find

16 hours ago

Unveiling a new class of synthetic fuels

16 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

16 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

17 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

20 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

21 hours ago

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (3)

AlphaZero algorithm can pick up victory moves in chess

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Google's new Go-playing AI learns fast, and even thrashed its former self

Giraffe machine has taught itself to play chess at higher levels

DeepMind's AI team explores navigation powers with 3-D maze

Institute offers public chess challenge to learn more about how we think (Update)

Forgetting in neural networks just got less catastrophic

Self-taught, 'superhuman' AI now even smarter: makers

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Visual abilities of language models found to be lacking depth

Reasoning skills of large language models are often overestimated, researchers find

A new model to plan and control the movements of humanoids in 3D environments

Researchers introduce generative AI to analyze complex tabular data

Computer scientists develop new and improved camera inspired by the human eye

Phys.org

Medical Xpress

Science X

AlphaZero algorithm can pick up victory moves in chess

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

Google's new Go-playing AI learns fast, and even thrashed its former self

Giraffe machine has taught itself to play chess at higher levels

DeepMind's AI team explores navigation powers with 3-D maze

Institute offers public chess challenge to learn more about how we think (Update)

Forgetting in neural networks just got less catastrophic

Self-taught, 'superhuman' AI now even smarter: makers

Recommended for you

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Visual abilities of language models found to be lacking depth

Reasoning skills of large language models are often overestimated, researchers find

A new model to plan and control the movements of humanoids in 3D environments

Researchers introduce generative AI to analyze complex tabular data

Computer scientists develop new and improved camera inspired by the human eye

Your Privacy