August 19, 2020

Researchers ask AI to explain itself

by Chad Boutin, National Institute of Standards and Technology

NIST asks AI to explain itself — NIST scientists have proposed four principles for judging how explainable an artificial intelligence's decisions are. Credit: B. Hayes/NIST

It's a question that many of us encounter in childhood: "Why did you do that?" As artificial intelligence (AI) begins making more consequential decisions that affect our lives, we also want these machines to be capable of answering that simple yet profound question. After all, why else would we trust AI's decisions?

This desire for satisfactory explanations has spurred scientists at the National Institute of Standards and Technology (NIST) to propose a set of principles by which we can judge how explainable AI's decisions are. Their draft publication, Four Principles of Explainable Artificial Intelligence (Draft NISTIR 8312), is intended to stimulate a conversation about what we should expect of our decision-making devices.

The report is part of a broader NIST effort to help develop trustworthy AI systems. NIST's foundational research aims to build trust in these systems by understanding their theoretical capabilities and limitations and by improving their accuracy, reliability, security, robustness and explainability, which is the focus of this latest publication.

The authors are requesting feedback on the draft from the public—and because the subject is a broad one, touching upon fields ranging from engineering and computer science to psychology and legal studies, they are hoping for a wide-ranging discussion.

"AI is becoming involved in high-stakes decisions, and no one wants machines to make them without an understanding of why," said NIST electronic engineer Jonathon Phillips, one of the report's authors. "But an explanation that would satisfy an engineer might not work for someone with a different background. So, we want to refine the draft with a diversity of perspective and opinions."

An understanding of the reasons behind the output of an AI system can benefit everyone the output touches. If an AI contributes to a loan approval decision, for example, this understanding might help a software designer improve the system. But the applicant might want insight into the AI's reasoning as well, either to understand why she was turned down, or, if she was approved, to help her continue acting in ways that maintain her good credit rating.

According to the authors, the four principles for explainable AI are:

AI systems should deliver accompanying evidence or reasons for all their outputs.
Systems should provide explanations that are meaningful or understandable to individual users.
The explanation correctly reflects the system's process for generating the output.
The system only operates under conditions for which it was designed or when the system reaches a sufficient confidence in its output. (The idea is that if a system has insufficient confidence in its decision, it should not supply a decision to the user.)

While these principles are straightforward enough on the surface, Phillips said that individual users often have varied criteria for judging an AI's success at meeting them. For instance, the second principle—how meaningful the explanation is—can imply different things to different people, depending on their role and connection to the job the AI is doing.

"Think about Kirk and Spock and how each one talks," Phillips said, referencing the Star Trek characters. "A doctor using an AI to help diagnose disease may only need Spock's explanation of why the machine recommends a particular treatment, while the patient might be OK with less technical detail but want Kirk's background on how it relates to his life."

Phillips and his co-authors align their concepts of explainable AI to relevant previous work in artificial intelligence, but they also compare the demands for explainability we place on our machines to those we place on our fellow humans. Do we measure up to the standards we are asking of AI? After exploring how human decisions hold up in light of the report's four principles, the authors conclude that—spoiler alert—we don't.

"Human-produced explanations for our own choices and conclusions are largely unreliable," they write, citing several examples. "Without conscious awareness, people incorporate irrelevant information into a variety of decisions from personality trait judgments to jury decisions."

However, our awareness of this apparent double standard could eventually help us better understand our own decisions and create a safer, more transparent world.

"As we make advances in explainable AI, we may find that certain parts of AI systems are better able to meet societal expectations and goals than humans are," said Phillips, whose past research indicates that collaborations between humans and AI can produce greater accuracy than either one working alone. "Understanding the explainability of both the AI system and the human opens the door to pursue implementations that incorporate the strengths of each."

For the moment, Phillips said, the authors hope the comments they receive advance the conversation.

"I don't think we know yet what the right benchmarks are for explainability," he said. "At the end of the day we're not trying to answer all these questions. We're trying to flesh out the field so that discussions can be fruitful."

More information: Phillips et al., Four Principles of Explainable Artificial Intelligence. (2020). nvlpubs.nist.gov/nistpubs/ir/2 … ST.IR.8312-draft.pdf

Provided by National Institute of Standards and Technology

This story is republished courtesy of NIST. Read the original story here.

Citation: Researchers ask AI to explain itself (2020, August 19) retrieved 26 July 2024 from https://techxplore.com/news/2020-08-ai.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

People prefer robots to explain themselves – and a brief summary doesn't cut it

68 shares

Feedback to editors

Generative AI creates personalized storybooks for the future of child language learning

6 hours ago

Study explores win–win potential of grass-powered energy production

6 hours ago

Novel algorithm for discovering anomalies in data outperforms current software

7 hours ago

Deep learning models can be trained with limited data: New method could reduce errors in computational imaging

8 hours ago

Experts warn against hype for deriving green hydrogen from direct seawater electrolysis

9 hours ago

New microgrids model takes into account a fair design of decentralized energy systems

9 hours ago

Engineers develop magnetic tunnel junction–based device to make AI more energy efficient

9 hours ago

Robot Spot configured to find and stun weeds using a blowtorch

10 hours ago

Magnetic fields help understand light particle splitting for boosting solar cell efficiency

11 hours ago

OpenAI to challenge Google with new search functionality

Jul 25, 2024

Load comments (0)

Researchers ask AI to explain itself

Generative AI creates personalized storybooks for the future of child language learning

Study explores win–win potential of grass-powered energy production

Novel algorithm for discovering anomalies in data outperforms current software

Deep learning models can be trained with limited data: New method could reduce errors in computational imaging

Experts warn against hype for deriving green hydrogen from direct seawater electrolysis

New microgrids model takes into account a fair design of decentralized energy systems

Engineers develop magnetic tunnel junction–based device to make AI more energy efficient

Robot Spot configured to find and stun weeds using a blowtorch

Magnetic fields help understand light particle splitting for boosting solar cell efficiency

OpenAI to challenge Google with new search functionality

People prefer robots to explain themselves – and a brief summary doesn't cut it

An approach to enhance machine learning explanations

ExAG: An image-guessing game to evaluate the helpfulness of machine explanations

New NIST guidelines aim to help IT system developers build security in from the ground up

Stop gambling with black box and explainable models on high-stakes decisions

Humans and machines can improve accuracy when they work together

Engineers develop magnetic tunnel junction–based device to make AI more energy efficient

Robot Spot configured to find and stun weeds using a blowtorch

Generative AI creates personalized storybooks for the future of child language learning

Novel algorithm for discovering anomalies in data outperforms current software

Deep learning models can be trained with limited data: New method could reduce errors in computational imaging

OpenAI to challenge Google with new search functionality

Phys.org

Medical Xpress

Science X

Researchers ask AI to explain itself

Generative AI creates personalized storybooks for the future of child language learning

Study explores win–win potential of grass-powered energy production

Novel algorithm for discovering anomalies in data outperforms current software

Deep learning models can be trained with limited data: New method could reduce errors in computational imaging

Experts warn against hype for deriving green hydrogen from direct seawater electrolysis

New microgrids model takes into account a fair design of decentralized energy systems

Engineers develop magnetic tunnel junction–based device to make AI more energy efficient

Robot Spot configured to find and stun weeds using a blowtorch

Magnetic fields help understand light particle splitting for boosting solar cell efficiency

OpenAI to challenge Google with new search functionality

Related Stories

People prefer robots to explain themselves – and a brief summary doesn't cut it

An approach to enhance machine learning explanations

ExAG: An image-guessing game to evaluate the helpfulness of machine explanations

New NIST guidelines aim to help IT system developers build security in from the ground up

Stop gambling with black box and explainable models on high-stakes decisions

Humans and machines can improve accuracy when they work together

Recommended for you

Engineers develop magnetic tunnel junction–based device to make AI more energy efficient

Robot Spot configured to find and stun weeds using a blowtorch

Generative AI creates personalized storybooks for the future of child language learning

Novel algorithm for discovering anomalies in data outperforms current software

Deep learning models can be trained with limited data: New method could reduce errors in computational imaging

OpenAI to challenge Google with new search functionality

Your Privacy