Study highlights how AI models take potentially dangerous 'shortcuts' in solving complex recognition tasks

Deep convolutional neural networks (DCNNs) don't see objects the way humans do—using configural shape perception—and that could be dangerous in real-world AI applications, says Professor James Elder, co-author of a York University study published today.

Published in the Cell Press journal iScience, Deep learning models fail to capture the configural nature of human shape perception is a collaborative study by Elder, who holds the York Research Chair in Human and Computer Vision and is Co-Director of York's Centre for AI & Society, and Assistant Psychology Professor Nicholas Baker at Loyola College in Chicago, a former VISTA postdoctoral fellow at York.

The study employed novel visual stimuli called "Frankensteins" to explore how the human brain and DCNNs process holistic, configural object properties.

"Frankensteins are simply objects that have been taken apart and put back together the wrong way around," says Elder. "As a result, they have all the right local features, but in the wrong places."

The investigators found that while the human visual system is confused by Frankensteins, DCNNs are not—revealing an insensitivity to configural object properties.

"Our results explain why deep AI models fail under certain conditions and point to the need to consider tasks beyond object recognition in order to understand visual processing in the brain," Elder says. "These deep models tend to take 'shortcuts' when solving complex recognition tasks. While these shortcuts may work in many cases, they can be dangerous in some of the real-world AI applications we are currently working on with our industry and government partners," Elder points out.

One such application is traffic video safety systems: "The objects in a busy traffic scene—the vehicles, bicycles and pedestrians—obstruct each other and arrive at the eye of a driver as a jumble of disconnected fragments," explains Elder. "The brain needs to correctly group those fragments to identify the correct categories and locations of the objects. An AI system for traffic safety monitoring that is only able to perceive the fragments individually will fail at this task, potentially misunderstanding risks to vulnerable road users."

According to the researchers, modifications to training and architecture aimed at making networks more brain-like did not lead to configural processing, and none of the networks were able to accurately predict trial-by-trial human object judgements. "We speculate that to match human configural sensitivity, networks must be trained to solve broader range of object tasks beyond category recognition," notes Elder.

More information: Nicholas Baker et al, Deep learning models fail to capture the configural nature of human shape perception, iScience (2022). DOI: 10.1016/j.isci.2022.104913

Journal information: iScience

Provided by York University

Study highlights how AI models take potentially dangerous 'shortcuts' in solving complex recognition tasks

Advancing human-like perception in self-driving vehicles

New approach uses generative AI to imitate human motion

AI and holography bring 3D augmented reality to regular glasses

Lab's AI work results in increased revenue, decreased land requirements for wind power industry

Teaching robots to move by sketching trajectories

New large learning model shows how AI might shape LGBTQIA+ advocacy

A framework to detect hallucinations in the text generated by LLMs

Why getting in touch with our 'gerbil brain' could help machines listen better

A new, low-cost, high-efficiency photonic integrated circuit

Scientists determine disorder improves lithium-ion battery life

Chemists present roadmap to a carbon-neutral refinery by 2050

Flexible pseudocapacitor defies climate extremes, packs energy punch

A low-energy process for high-performance solar cells could simplify the manufacturing process

Researchers identify cause of electron-hole separation in thin-film solar cells to increase solar cell efficiency

Video shows how swarms of miniature robots simultaneously clean up microplastics and microbes

Computer scientists discover vulnerability in cloud server hardware used by AMD and Intel chips

New process brings commercialization of CO₂ utilization technology to produce formic acid one step closer

Researchers engineer sound-suppressing silk to reduce noise transmission in a large room

Scientists create robot snails that can move independently using tracks or work together to climb

Study highlights how AI models take potentially dangerous 'shortcuts' in solving complex recognition tasks

Let us know if there is a problem with our content

Thank you for taking time to provide your feedback to the editors

Share article

E-MAIL THE STORY