July 14, 2021

Enabling the 'imagination' of artificial intelligence

A team of researchers at USC is helping AI imagine the unseen, a technique that could also lead to fairer AI, new medicines and increased autonomous vehicle safety.

Imagine an orange cat. Now, imagine the same cat, but with coal-black fur. Now, imagine the cat strutting along the Great Wall of China. Doing this, a quick series of neuron activations in your brain will come up with variations of the picture presented, based on your previous knowledge of the world.

In other words, as humans, it's easy to envision an object with different attributes. But, despite advances in deep neural networks that match or surpass human performance in certain tasks, computers still struggle with the very human skill of "imagination."

Now, a USC research team comprising computer science Professor Laurent Itti, and Ph.D. students Yunhao Ge, Sami Abu-El-Haija and Gan Xin, has developed an AI that uses human-like capabilities to imagine a never-before-seen object with different attributes. The paper, titled "Zero-Shot Synthesis with Group-Supervised Learning," was published in the 2021 International Conference on Learning Representations on May 7.

"We were inspired by human visual generalization capabilities to try to simulate human imagination in machines," said Ge, the study's lead author.

"Humans can separate their learned knowledge by attributes—for instance, shape, pose, position, color—and then recombine them to imagine a new object. Our paper attempts to simulate this process using neural networks."

AI's generalization problem

For instance, say you want to create an AI system that generates images of cars. Ideally, you would provide the algorithm with a few images of a car, and it would be able to generate many types of cars—from Porsches to Pontiacs to pick-up trucks—in any color, from multiple angles.

This is one of the long-sought goals of AI: creating models that can extrapolate. This means that, given a few examples, the model should be able to extract the underlying rules and apply them to a vast range of novel examples it hasn't seen before. But machines are most commonly trained on sample features, pixels for instance, without taking into account the object's attributes.

The science of imagination

In this new study, the researchers attempt to overcome this limitation using a concept called disentanglement. Disentanglement can be used to generate deepfakes, for instance, by disentangling human face movements and identity. By doing this, said Ge, "people can synthesize new images and videos that substitute the original person's identity with another person, but keep the original movement."

Similarly, the new approach takes a group of sample images—rather than one sample at a time as traditional algorithms have done—and mines the similarity between them to achieve something called "controllable disentangled representation learning."

Then, it recombines this knowledge to achieve "controllable novel image synthesis," or what you might call imagination. "For instance, take the Transformer movie as an example" said Ge, "It can take the shape of Megatron car, the color and pose of a yellow Bumblebee car, and the background of New York's Times Square. The result will be a Bumblebee-colored Megatron car driving in Times Square, even if this sample was not witnessed during the training session."

This is similar to how we as humans extrapolate: when a human sees a color from one object, we can easily apply it to any other object by substituting the original color with the new one. Using their technique, the group generated a new dataset containing 1.56 million images that could help future research in the field.

Understanding the world

While disentanglement is not a new idea, the researchers say their framework can be compatible with nearly any type of data or knowledge. This widens the opportunity for applications. For instance, disentangling race and gender-related knowledge to make fairer AI by removing sensitive attributes from the equation altogether.

In the field of medicine, it could help doctors and biologists discover more useful drugs by disentangling the medicine function from other properties, and then recombining them to synthesize new medicine. Imbuing machines with imagination could also help create safer AI by, for instance, allowing autonomous vehicles to imagine and avoid dangerous scenarios previously unseen during training.

"Deep learning has already demonstrated unsurpassed performance and promise in many domains, but all too often this has happened through shallow mimicry, and without a deeper understanding of the separate attributes that make each object unique," said Itti. "This new disentanglement approach, for the first time, truly unleashes a new sense of imagination in A.I. systems, bringing them closer to humans' understanding of the world."

More information: Yunhao Ge et al, Zero-shot Synthesis with Group-Supervised Learning. openreview.net/forum?id=8wqCDnBmnrT

Provided by University of Southern California

Citation: Enabling the 'imagination' of artificial intelligence (2021, July 14) retrieved 4 July 2024 from https://techxplore.com/news/2021-07-enabling-artificial-intelligence.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

EventDrop: a method to augment asynchronous event data

326 shares

Feedback to editors

Think you're funny? ChatGPT might be funnier

9 hours ago

'Open-washing' generative AI: How Meta, Google and others feign openness

9 hours ago

New open-source software for quantum cryptography is greater than the sum of its parts

12 hours ago

How to increase the rate of plastics recycling

13 hours ago

Lab creates world's first anode-free sodium solid-state battery

14 hours ago

Novel 3D stretchable electronic strip could spark new possibilities for wearable e-textiles

15 hours ago

Meta releases four new publicly available AI models for developer use

15 hours ago

Survey shows most people think LLMs such as ChatGPT can experience feelings and memories

Jul 2, 2024

New ink-based method offers best recipe yet for thermoelectric devices

Jul 2, 2024

New recycling process can recover up to 99.97% of materials in perovskite solar cells

Jul 2, 2024

Load comments (0)

Enabling the 'imagination' of artificial intelligence

AI's generalization problem

The science of imagination

Understanding the world

Think you're funny? ChatGPT might be funnier

'Open-washing' generative AI: How Meta, Google and others feign openness

New open-source software for quantum cryptography is greater than the sum of its parts

How to increase the rate of plastics recycling

Lab creates world's first anode-free sodium solid-state battery

Novel 3D stretchable electronic strip could spark new possibilities for wearable e-textiles

Meta releases four new publicly available AI models for developer use

Survey shows most people think LLMs such as ChatGPT can experience feelings and memories

New ink-based method offers best recipe yet for thermoelectric devices

New recycling process can recover up to 99.97% of materials in perovskite solar cells

EventDrop: a method to augment asynchronous event data

The psychology of human creativity helps artificial intelligence imagine the unknown

Tweaking AI software to function like a human brain improves computer's learning ability

Concept whitening: A strategy to improve the interpretability of image recognition models

Fooling deep neural networks for object detection with adversarial 3-D logos

Computer vision system studies word use to recognize objects it has never seen before

Think you're funny? ChatGPT might be funnier

Meta releases four new publicly available AI models for developer use

'Open-washing' generative AI: How Meta, Google and others feign openness

Study employs image-recognition AI to determine battery composition and conditions

Survey shows most people think LLMs such as ChatGPT can experience feelings and memories

AI is learning from what you said on Reddit, Stack Overflow or Facebook. Are you OK with that?

Phys.org

Medical Xpress

Science X

Enabling the 'imagination' of artificial intelligence

AI's generalization problem

The science of imagination

Understanding the world

Think you're funny? ChatGPT might be funnier

'Open-washing' generative AI: How Meta, Google and others feign openness

New open-source software for quantum cryptography is greater than the sum of its parts

How to increase the rate of plastics recycling

Lab creates world's first anode-free sodium solid-state battery

Novel 3D stretchable electronic strip could spark new possibilities for wearable e-textiles

Meta releases four new publicly available AI models for developer use

Survey shows most people think LLMs such as ChatGPT can experience feelings and memories

New ink-based method offers best recipe yet for thermoelectric devices

New recycling process can recover up to 99.97% of materials in perovskite solar cells

Related Stories

EventDrop: a method to augment asynchronous event data

The psychology of human creativity helps artificial intelligence imagine the unknown

Tweaking AI software to function like a human brain improves computer's learning ability

Concept whitening: A strategy to improve the interpretability of image recognition models

Fooling deep neural networks for object detection with adversarial 3-D logos

Computer vision system studies word use to recognize objects it has never seen before

Recommended for you

Think you're funny? ChatGPT might be funnier

Meta releases four new publicly available AI models for developer use

'Open-washing' generative AI: How Meta, Google and others feign openness

Study employs image-recognition AI to determine battery composition and conditions

Survey shows most people think LLMs such as ChatGPT can experience feelings and memories

AI is learning from what you said on Reddit, Stack Overflow or Facebook. Are you OK with that?

Your Privacy