December 18, 2018

New AI system mimics how humans visualize and identify objects

by Matthew Chin, University of California, Los Angeles

New AI system developed at UCLA, Stanford mimics how humans visualize and identify objects — A “computer vision” system developed at UCLA can identify objects based on only partial glimpses, like by using these photo snippets of a motorcycle. Credit: University of California, Los Angeles

UCLA and Stanford University engineers have demonstrated a computer system that can discover and identify the real-world objects it "sees" based on the same method of visual learning that humans use.

The system is an advance in a type of technology called "computer vision," which enables computers to read and identify visual images. It could be an important step toward general artificial intelligence systems—computers that learn on their own, are intuitive, make decisions based on reasoning and interact with humans in a much more human-like way. Although current AI computer vision systems are increasingly powerful and capable, they are task-specific, meaning that their ability to identify what they see is limited by how much they've been trained and programmed by humans.

Even today's best computer vision systems cannot create a full picture of an object after seeing only certain parts of it—and the systems can be fooled by viewing the object in an unfamiliar setting. Engineers are aiming to make computer systems with those abilities—just like humans can understand that they are looking at a dog, even if the animal is hiding behind a chair and only the paws and tail are visible. Humans, of course, can also easily intuit where the dog's head and the rest of its body are, but that ability still eludes most artificial intelligence systems.

Current computer vision systems are not designed to learn on their own. They must be trained on exactly what to learn, usually by reviewing thousands of images in which the objects they're trying to identify are labeled for them. Computers, of course, also can't explain their rationale for determining what the object in a photo represents: AI-based systems don't build an internal picture or a common-sense model of learned objects the way humans do.

The engineers' new method, described in the Proceedings of the National Academy of Sciences, shows a way around those shortcomings.

New AI system mimics how humans visualize and identify objects — The system understands what a human body is by looking at thousands of images with people in them, and then ignoring nonessential background objects. Credit: University of California, Los Angeles

The approach is made up of three broad steps. First, the system breaks up an image into small chunks, which the researchers call "viewlets." Second, the computer learns how those viewlets fit together to form the object in question. And finally, it looks at what other objects are in the surrounding area, and whether or not information about those objects is relevant to describing and identifying the primary object.

To help the new system "learn" more like humans, the engineers decided to immerse it in an internet replica of the environment humans live in.

"Fortunately, the internet provides two things that help a brain-inspired computer vision system learn in the same way that humans do," said Vwani Roychowdhury, a UCLA professor of electrical and computer engineering and the study's principal investigator. "One is a wealth of images and videos that depict the same types of objects. The second is that those objects are shown from many perspectives—obscured, bird's eye, up-close—and they are placed in all different kinds of environments."

To develop the framework, the researchers drew insights from cognitive psychology and neuroscience.

"Starting as infants, we learn what something is because we see many examples of it, in many contexts," Roychowdhury said. "That contextual learning is a key feature of our brains, and it helps us build robust models of objects that are part of an integrated worldview where everything is functionally connected."

The researchers tested the system with about 9,000 images, each showing people and other objects. The platform was able to build a detailed model of the human body without external guidance and without the images being labeled.

The engineers ran similar tests using images of motorcycles, cars and airplanes. In all cases, their system performed better or at least as well as traditional computer vision systems that have been developed with many years of training.

The study's co-senior author is Thomas Kailath, a professor emeritus of electrical engineering at Stanford who was Roychowdhury's doctoral advisor in the 1980s. Other authors are former UCLA doctoral students Lichao Chen (now a research engineer at Google) and Sudhir Singh (who founded a company that builds robotic teaching companions for children).

Singh, Roychowdhury and Kailath previously worked together to develop one of the first automated visual search engines for fashion, the now-shuttered StileEye, which gave rise to some of the basic ideas behind the new research.

More information: Lichao Chen et al. Brain-inspired automated visual object discovery and detection, Proceedings of the National Academy of Sciences (2018). DOI: 10.1073/pnas.1802103115

Journal information: Proceedings of the National Academy of Sciences

Provided by University of California, Los Angeles

Citation: New AI system mimics how humans visualize and identify objects (2018, December 18) retrieved 29 June 2024 from https://techxplore.com/news/2018-12-ai-mimics-humans-visualize.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Research identifies key weakness in modern computer vision systems

606 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

New AI system mimics how humans visualize and identify objects

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Research identifies key weakness in modern computer vision systems

Want computers to see better in the real world? Train them in virtual reality

Computer vision system studies word use to recognize objects it has never seen before

Team develops vision system that improves object recognition

Recognizing the partially seen

Study suggests humans and computers use different processes to identify objects visually

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New tool detects AI-generated videos with 93.7% accuracy

Researchers propose the next platform for brain-inspired computing

Phys.org

Medical Xpress

Science X

New AI system mimics how humans visualize and identify objects

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Research identifies key weakness in modern computer vision systems

Want computers to see better in the real world? Train them in virtual reality

Computer vision system studies word use to recognize objects it has never seen before

Team develops vision system that improves object recognition

Recognizing the partially seen

Study suggests humans and computers use different processes to identify objects visually

Recommended for you

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New tool detects AI-generated videos with 93.7% accuracy

Researchers propose the next platform for brain-inspired computing

Your Privacy