June 6, 2016 weblog

Google team taking upper hand if misbehaving AI agent attempts anything terribly smart

by Nancy Owano , Tech Xplore

artificial intelligence — Credit: CC0 Public Domain

(Tech Xplore)—The twin effects of popular fiction (machines turning into killer squadrons) and scientific progress (machine learning) have made AI both exciting and scary in its future potential.

Google knows. TechRadar and other tech sites are reporting that Google is thinking up a kill switch for dangerous AI.

As TechRadar put it, humans can keep the upper hand, for now. David Nield reported: "Google is working on an AI 'kill switch' that allows human operators to turn off super intelligent systems no matter how big their egos get. It's called "safe interruptibility" and it's being developed as part of the DeepMind system."

An open letter went out last year that caused as much of a stir as the worrisome statements preceding it over building superintelligent machines. Notable names in that letter included Elon Musk and Stephen Hawking, and the letter focused on the need to ensure AI research benefits humanity.

"An Open Letter: Research Priorities for Robust and Beneficial Artificial Intelligence" said that "There is now a broad consensus that AI research is progressing steadily, and that its impact on society is likely to increase. The potential benefits are huge, since everything that civilization has to offer is a product of human intelligence; we cannot predict what we might achieve when this intelligence is magnified by the tools AI may provide, but the eradication of disease and poverty are not unfathomable. Because of the great potential of AI, it is important to research how to reap its benefits while avoiding potential pitfalls."

That open letter resonates with the step taken by authors who have released a paper exploring the best ways to prevent self-learning machines from turning off the "big red button" humans might use to shut them down.

Laurent Orseau, Google DeepMind, London, and Stuart Armstrong, The Future of Humanity Institute, University of Oxford, have written "Safely Interruptible Agents."

(DeepMind is the AI research lab owned by Google. DeepMind was founded by Demis Hassabis, Shane Legg and Mustafa Suleyman in London, 2010 and was acquired by Google in early 2014.)

The authors ask, "Given that the human operator has designed a correct reward function for the task, how to make sure that human interventions during the learning process will not induce a bias toward undesirable behaviors?"

They said that "We have proposed a framework to allow a human operator to repeatedly safely interrupt a reinforcement learning agent while making sure the agent will not learn to prevent or induce these interruptions. Safe interruptibility can be useful to take control of a robot that is misbehaving..."

The authors in their abstract pointed out that "now and then it may be necessary for a human operator to press the big red button to prevent the agent from continuing a harmful sequence of actions—harmful either for the agent or for the environment—and lead the agent into a safer situation...This paper explores a way to make sure a learning agent will not learn to prevent (or seek!) being interrupted by the environment or a human operator."

Nield explained what is happening here: "At the heart of the issue is the idea of rewards, or goals that the software code has been programmed to aim for - as AI gets cleverer, it may develop its own rewards that we haven't anticipated (it starts making its own judgements, in other words)."

Timothy Seppala, Associate Editor, Engadget, said, "Essentially, DeepMind has developed a framework that'll keep AI from learning how to prevent—or induce—human interruption of whatever it's doing."

If you check out the story postings on this you see numerous references to the red button as the kill switch. The button obviously is not a real hardware piece the authors are constructing; Darren Orf in Gizmodo described the nature of the contents: "The paper is filled with math 99 percent of us will never understand, which basically describes methods for building what the paper cheekily calls a 'big red button' into AI."

Tyler Lee in Ubergizmo described "a panic button of sorts" to instantly interrupt AI and stop it from doing any harm.

More information: Safely Interruptible Agents (PDF): intelligence.org/files/Interruptibility.pdf

Citation: Google team taking upper hand if misbehaving AI agent attempts anything terribly smart (2016, June 6) retrieved 29 June 2024 from https://techxplore.com/news/2016-06-google-team-upper-misbehaving-ai.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Scientists urge artificial intelligence safety focus

30 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (1)

Google team taking upper hand if misbehaving AI agent attempts anything terribly smart

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Scientists urge artificial intelligence safety focus

DeepMind's AI team explores navigation powers with 3-D maze

Google buys artificial intelligence firm DeepMind

Google teams with Oxford to teach machines to think

How your brain learns to ride the subway—and why AI developers care

Google DeepMind acquisition researchers working on a Neural Turing Machine

Security experts find millions of users running malware infected extensions from Google Chrome Web Store

New security loophole allows spying on internet users visiting websites and watching videos

AI browser plug-ins to help consumers improve digital privacy literacy, combat manipulative design

Can we rid artificial intelligence of bias?

Orphan articles: The 'dark matter' of Wikipedia

The tentacles of retracted science reach deep into social media: A simple button could change that

Phys.org

Medical Xpress

Science X

Google team taking upper hand if misbehaving AI agent attempts anything terribly smart

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Scientists urge artificial intelligence safety focus

DeepMind's AI team explores navigation powers with 3-D maze

Google buys artificial intelligence firm DeepMind

Google teams with Oxford to teach machines to think

How your brain learns to ride the subway—and why AI developers care

Google DeepMind acquisition researchers working on a Neural Turing Machine

Recommended for you

Security experts find millions of users running malware infected extensions from Google Chrome Web Store

New security loophole allows spying on internet users visiting websites and watching videos

AI browser plug-ins to help consumers improve digital privacy literacy, combat manipulative design

Can we rid artificial intelligence of bias?

Orphan articles: The 'dark matter' of Wikipedia

The tentacles of retracted science reach deep into social media: A simple button could change that

Your Privacy