December 23, 2020 feature

Exploring the notion of shortcut learning in deep neural networks

by Ingrid Fadelli , Tech Xplore

Over the past few years, artificial intelligence (AI) tools, particularly deep neural networks, have achieved remarkable results on a number of tasks. However, recent studies have found that these computational techniques have a number of limitations. In a recent paper published in Nature Machine Intelligence, researchers at Tübingen and Toronto universities explored and discussed a problem known as 'shortcut learning' that appears to underpin many of the shortcomings of deep neural networks identified in recent years.

"I decided to start working on this project during a science-related travel in the U.S., together with Claudio Michaelis, a dear colleague and friend of mine," Robert Geirhos, one of the researchers who carried out the study, told TechXplore. "We first attended a deep learning conference, then visited an animal research laboratory, and finally, a human vision conference. Somewhat surprisingly, we noticed the very same pattern in very different settings: 'shortcut learning,' or 'cheating,' appeared to be a common characteristic across both artificial and biological intelligence."

Geirhos and Michaelis believed that shortcut learning, the phenomenon they observed, could explain the discrepancy between the excellent performance and iconic failures of many deep neural networks. To investigate this idea further, they teamed up with other colleagues, including Jörn-Henrik Jacobsen, Richard Zemel, Wieland Brendel, Matthias Bethge and Felix Wichmann.

The researchers each contributed to the study in unique ways, aligned with their fields of expertise, which ranged from neuroscience to machine learning and psychophysics. Their paper includes examples of shortcut learning and cheating both in machines and animals—for instance, specific failures of deep neural networks, as well as instances where rats 'cheated' in experiments and students cheated in exams.

"We hope that our perspective provides a good introduction to the problem and encourages the adoption of stronger and more appropriate testing methods to prevent cheating before attributing high-level abilities to machines," Geirhos said. "Given that the article is a perspective, we build upon many fantastic articles from a broad range of authors, each contributing their piece to the puzzle. For me personally, an important precursor was the project that I presented at the ICLR and VSS conferences, discovering a texture bias in neural networks—an instance of shortcut learning."

The term shortcut learning describes the process through which machines attempt to identify the simplest solution or a 'shortcut' to solve a given problem. For example, a deep neural network may realize that a particular texture patch or part of an object (e.g., a car tire) is typically enough for them to predict the presence of a car in an image, and might thus start predicting the presence of a car in images even when they only include car tires.

"Shortcut learning essentially means that neural networks love to cheat," Geirhos said. "At first glance, AI often seems to work excellently—for example, it can recognize whether a picture contains animals, e.g., sheep. Only upon closer inspection, it is discovered that the neural network has cheated and just looked at the background."

An example of a neural network cheating is a situation in which it categorizes an empty green landscape as 'sheep' simply because it previously processed images in which sheep were standing in front of a natural landscape, while failing to recognize an actual sheep when it is in an unusual setting (e.g., on the beach). This is one of the many examples that Geirhos and his colleagues mention in their paper.

While this is a straightforward example of shortcut learning, often these patterns of cheating are far more subtle. They can be so subtle that researchers sometimes struggle to identify the cheating strategy that an artificial neural network is adopting and may simply be aware that it is not solving a task in the way they hoped it would.

"This pattern of cheating has parallels in everyday life, for example, when pupils prepare for class tests and only learn facts by heart without developing a true understanding of the problem," Geirhos said. "Unfortunately, in the field of AI, shortcut learning not only leads to deceptively good performance, but under certain circumstances, also to discrimination, for example, when an AI prefers to propose men for jobs because previous positions have already been filled mainly by men."

The paper defines, describes and explores the concept of shortcut learning, while also explaining how it can affect the performance of deep neural networks and drawing analogies with behaviors observed in humans and other animals. Their work could inspire other research teams to examine the shortcomings of deep neural networks in more detail, perhaps aiding the development of solutions that prevent them from cheating. Geirhos and some of his colleagues are now developing stronger test methods to scrutinize the limitations of both existing and emerging deep neural network-based models.

"We encourage our colleagues to jointly develop and apply stronger test procedures: As long as one has not examined whether an algorithm can cope with unexpected images, such as a cow on the beach, cheating must at least be considered a serious possibility," Geirhos said. "All that glitters is not gold: Just because AI is reported to achieve high scores on a benchmark doesn't mean that AI has also solved the problem we actually care about; sometimes, AI just finds a shortcut. Fortunately, however, current methods of artificial intelligence are by no means stupid, just too lazy: If challenged sufficiently, they can learn highly complex relationships—but if they have discovered a simple shortcut, they would be the last to complain about it."

More information: Shortcut learning in deep neural networks. Nature Machine Intelligence(2020). DOI: 10.1038/s42256-020-00257-z

ImageNet-trained CNNs are biased towards texture; increasing shape bias improves accuracy and robustness. openreview.net/forum?id=Bygh9j09KX

Recognition in terra incognita. openaccess.thecvf.com/content_ … _ECCV_2018_paper.pdf

Journal information: Nature Machine Intelligence

Citation: Exploring the notion of shortcut learning in deep neural networks (2020, December 23) retrieved 29 June 2024 from https://techxplore.com/news/2020-12-exploring-notion-shortcut-deep-neural.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Misinformation or artifact: A new way to think about machine learning

181 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (1)

Exploring the notion of shortcut learning in deep neural networks

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Misinformation or artifact: A new way to think about machine learning

New data processing module makes deep neural networks smarter

Neuroscience opens the black box of artificial intelligence

Fooling deep neural networks for object detection with adversarial 3-D logos

The brain's memory abilities inspire AI experts in making neural networks less 'forgetful'

DeepMind uses neural network to help explain meta-learning in people

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Phys.org

Medical Xpress

Science X

Exploring the notion of shortcut learning in deep neural networks

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Misinformation or artifact: A new way to think about machine learning

New data processing module makes deep neural networks smarter

Neuroscience opens the black box of artificial intelligence

Fooling deep neural networks for object detection with adversarial 3-D logos

The brain's memory abilities inspire AI experts in making neural networks less 'forgetful'

DeepMind uses neural network to help explain meta-learning in people

Recommended for you

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Your Privacy