September 26, 2022

AI worse at recognizing images than humans

by National Research University Higher School of Economics

Researchers from HSE University and Moscow Polytechnic University have discovered that AI models are unable to represent features of human vision due to a lack of tight coupling with the respective physiology, so they are worse at recognizing images. The results of the study were published in the Proceedings of the Seventh International Congress on Information and Communication Technology.

To understand how machine perception of images differs from human perception, scientists uploaded images of classical visual illusions to the IBM Watson Visual Recognition online service. Most of them were geometric silhouettes, partially hidden by geometric shapes of the background color. The system tried to determine the nature of the image and indicated the degree of certainty in its response.

It turned out that artificial intelligence is not able to recognize any imaginary figure, with the exception of a colored imaginary triangle. Due to the high contrast with the background, it was recognized correctly.

"Objects similar to those that we used during the experiment can be found in real life," says Vladimir Vinnikov, an analyst at the Laboratory of Methods for Big Data Analysis of HSE Faculty of Computer Science and author of the study. "For example, autopilot of a car or airplane perceives a trailer or a radio tower, which at night are indicated only by marker lights, the same way as we perceive imaginary geometric shapes."

The human eye is constantly moving involuntarily, and the photosensitive surface of its retina has the shape of a hemisphere. A person can see an illusion if the image is a vector, i.e., if it includes reference points and curves connecting them. The human imagination will complete the picture due to constant eye movement, a physiological feature of our vision.

In optoelectronic systems everything is arranged differently. Their light-sensitive matrix has a flat, usually rectangular shape, and the lens system itself is not nearly as free in movement as the human eye. Therefore, artificial intelligence cannot complete imaginary lines that connect fragments of a geometric illusion. Machine vision sees only what is actually depicted, whereas people complete the image in their imagination based on its outlines.

Today, neural network image recognition systems are actively spreading in the commercial sector. However, the question of how accurately machines recognize images is still open. Human lives may depend on the accuracy of recognition. For example, an accident may occur if the autopilot of a car or airplane does not recognize an object with low contrast relative to the background and is not able to dodge an obstacle in time.

Scientists believe that inaccuracy of machine image recognition can be corrected. For example, they can complement the recognition of raster images, which represent a grid of pixels, by simulating physiological features of eye movement that allow the eye to see two-dimensional and three-dimensional scenes. An alternative way is to add vector description of the images, which will help to program the machine to bypass the image along the trajectories specified by the vectors.

"Imaginary objects should definitely be used as tests in systems that depend on the recognition of photo and video streams, for example, in autopilots of cars or drones. This will help to avoid the risks associated with the use of machine intelligence systems in industry and transport systems," says Vinnikov.

More information: Vladimir Vinnikov et al, Deficiencies of Computational Image Recognition in Comparison to Human Counterpart, Proceedings of Seventh International Congress on Information and Communication Technology (2022). DOI: 10.1007/978-981-19-1607-6_43

Provided by National Research University Higher School of Economics

Citation: AI worse at recognizing images than humans (2022, September 26) retrieved 19 April 2024 from https://techxplore.com/news/2022-09-ai-worse-images-humans.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Extra 'eye' movements are the key to better self-driving cars

36 shares

Feedback to editors

Researchers develop sodium battery capable of rapid charging in just a few seconds

3 hours ago

Greater access to clean water, thanks to a better membrane

4 hours ago

Silent flight edges closer to take off, according to new research

5 hours ago

A flexible and efficient DC power converter for sustainable-energy microgrids

5 hours ago

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

6 hours ago

To build a better AI helper, start by modeling the irrational behavior of humans

6 hours ago

Versatile fibers offer improved energy storage capacity for wearable devices

7 hours ago

Harnessing solar energy for high-efficiency NH₃ production

7 hours ago

A dexterous four-legged robot that can walk and handle objects simultaneously

9 hours ago

Climate change will increase value of residential rooftop solar panels across US, study finds

11 hours ago

Load comments (1)

AI worse at recognizing images than humans

Researchers develop sodium battery capable of rapid charging in just a few seconds

Greater access to clean water, thanks to a better membrane

Silent flight edges closer to take off, according to new research

A flexible and efficient DC power converter for sustainable-energy microgrids

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Versatile fibers offer improved energy storage capacity for wearable devices

Harnessing solar energy for high-efficiency NH₃ production

A dexterous four-legged robot that can walk and handle objects simultaneously

Climate change will increase value of residential rooftop solar panels across US, study finds

Extra 'eye' movements are the key to better self-driving cars

Skeletal shapes key to rapid recognition of objects

Breaking AIs to make them better

Facebook enhances AI computer vision with SEER

Convolutional neural networks can be tricked by the same visual illusions as people

Convolution neural network used to identify dog breeds from photographs

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Team develops a way to teach a computer to type like a human

For more open and equitable public discussions on social media, try 'meronymity'

Using sim-to-real reinforcement learning to train robots to do simple tasks in broad environments

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

Phys.org

Medical Xpress

Science X

AI worse at recognizing images than humans

Researchers develop sodium battery capable of rapid charging in just a few seconds

Greater access to clean water, thanks to a better membrane

Silent flight edges closer to take off, according to new research

A flexible and efficient DC power converter for sustainable-energy microgrids

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Versatile fibers offer improved energy storage capacity for wearable devices

Harnessing solar energy for high-efficiency NH₃ production

A dexterous four-legged robot that can walk and handle objects simultaneously

Climate change will increase value of residential rooftop solar panels across US, study finds

Related Stories

Extra 'eye' movements are the key to better self-driving cars

Skeletal shapes key to rapid recognition of objects

Breaking AIs to make them better

Facebook enhances AI computer vision with SEER

Convolutional neural networks can be tricked by the same visual illusions as people

Convolution neural network used to identify dog breeds from photographs

Recommended for you

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Team develops a way to teach a computer to type like a human

For more open and equitable public discussions on social media, try 'meronymity'

Using sim-to-real reinforcement learning to train robots to do simple tasks in broad environments

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

Your Privacy