June 1, 2021

Researchers fine-tune control over AI image generation

by Matt Shipman, North Carolina State University

Researchers from North Carolina State University have developed a new state-of-the-art method for controlling how artificial intelligence (AI) systems create images. The work has applications for fields from autonomous robotics to AI training.

At issue is a type of AI task called conditional image generation, in which AI systems create images that meet a specific set of conditions. For example, a system could be trained to create original images of cats or dogs, depending on which animal the user requested. More recent techniques have built on this to incorporate conditions regarding an image layout. This allows users to specify which types of objects they want to appear in particular places on the screen. For example, the sky might go in one box, a tree might be in another box, a stream might be in a separate box, and so on.

The new work builds on those techniques to give users more control over the resulting images, and to retain certain characteristics across a series of images.

"Our approach is highly reconfigurable," says Tianfu Wu, co-author of a paper on the work and an assistant professor of computer engineering at NC State. "Like previous approaches, ours allows users to have the system generate an image based on a specific set of conditions. But ours also allows you to retain that image and add to it. For example, users could have the AI create a mountain scene. The users could then have the system add skiers to that scene."

In addition, the new approach allows users to have the AI manipulate specific elements so that they are identifiably the same, but have moved or changed in some way. For example, the AI might create a series of images showing skiers turn toward the viewer as they move across the landscape.

"One application for this would be to help autonomous robots 'imagine' what the end result might look like before they begin a given task," Wu says. "You could also use the system to generate images for AI training. So, instead of compiling images from external sources, you could use this system to create images for training other AI systems."

The researchers tested their new approach using the COCO-Stuff dataset and the Visual Genome dataset. Based on standard measures of image quality, the new approach outperformed the previous state-of-the-art image creation techniques.

"Our next step is to see if we can extend this work to video and three-dimensional images," Wu says.

Training for the new approach requires a fair amount of computational power; the researchers used a 4-GPU workstation. However, deploying the system is less computationally expensive.

"We found that one GPU gives you almost real-time speed," Wu says.

"In addition to our paper, we've made our source code for this approach available on GitHub. That said, we're always open to collaborating with industry partners."

More information: Wei Sun et al, Learning Layout and Style Reconfigurable GANs for Controllable Image Synthesis, IEEE Transactions on Pattern Analysis and Machine Intelligence (2021). DOI: 10.1109/TPAMI.2021.3078577

Journal information: IEEE Transactions on Pattern Analysis and Machine Intelligence

Provided by North Carolina State University

Citation: Researchers fine-tune control over AI image generation (2021, June 1) retrieved 24 April 2024 from https://techxplore.com/news/2021-06-fine-tune-ai-image.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New machine-learning approach brings digital photos back to life

10 shares

Feedback to editors

With a game show as his guide, researcher uses AI to predict deception

7 hours ago

Super Mario hackers' tricks could protect software from bugs, study finds

8 hours ago

The world's largest 3D printer is at a university in Maine. It just unveiled an even bigger one

10 hours ago

Researchers develop tiny chip that can safeguard user data while enabling efficient computing on a smartphone

12 hours ago

Personalization has the potential to democratize who decides how LLMs behave

12 hours ago

Aerogel-based phase change materials improve thermal management, reduce microwave emissions in electronic devices

12 hours ago

Holographic displays offer a glimpse into an immersive future

12 hours ago

Researchers develop high-energy-density aqueous battery based on halogen multi-electron transfer

12 hours ago

Extracting high-purity gold from electrical and electronic waste

14 hours ago

How potatoes, corn and beans led to breakthrough in smart windows technology

14 hours ago

Load comments (0)

Researchers fine-tune control over AI image generation

With a game show as his guide, researcher uses AI to predict deception

Super Mario hackers' tricks could protect software from bugs, study finds

The world's largest 3D printer is at a university in Maine. It just unveiled an even bigger one

Researchers develop tiny chip that can safeguard user data while enabling efficient computing on a smartphone

Personalization has the potential to democratize who decides how LLMs behave

Aerogel-based phase change materials improve thermal management, reduce microwave emissions in electronic devices

Holographic displays offer a glimpse into an immersive future

Researchers develop high-energy-density aqueous battery based on halogen multi-electron transfer

Extracting high-purity gold from electrical and electronic waste

How potatoes, corn and beans led to breakthrough in smart windows technology

New machine-learning approach brings digital photos back to life

New medical image fusion method draws on deep learning to improve patient outcomes

Discerning deep fakes digitally

New module for OpenAI GPT-3 creates unique images from text

Medical imaging: Displaying 3D images in 2D formats makes it easy to miss large targets in clinical settings

Training artificial intelligence to track greenhouses in Antarctica and Mars

A new framework to generate human motions from language prompts

Personalization has the potential to democratize who decides how LLMs behave

Holographic displays offer a glimpse into an immersive future

With a game show as his guide, researcher uses AI to predict deception

Neural networks can mediate between download size and quality, according to researcher

A coffee roastery in Finland has launched an AI-generated blend. The results were surprising

Phys.org

Medical Xpress

Science X

Researchers fine-tune control over AI image generation

With a game show as his guide, researcher uses AI to predict deception

Super Mario hackers' tricks could protect software from bugs, study finds

The world's largest 3D printer is at a university in Maine. It just unveiled an even bigger one

Researchers develop tiny chip that can safeguard user data while enabling efficient computing on a smartphone

Personalization has the potential to democratize who decides how LLMs behave

Aerogel-based phase change materials improve thermal management, reduce microwave emissions in electronic devices

Holographic displays offer a glimpse into an immersive future

Researchers develop high-energy-density aqueous battery based on halogen multi-electron transfer

Extracting high-purity gold from electrical and electronic waste

How potatoes, corn and beans led to breakthrough in smart windows technology

Related Stories

New machine-learning approach brings digital photos back to life

New medical image fusion method draws on deep learning to improve patient outcomes

Discerning deep fakes digitally

New module for OpenAI GPT-3 creates unique images from text

Medical imaging: Displaying 3D images in 2D formats makes it easy to miss large targets in clinical settings

Training artificial intelligence to track greenhouses in Antarctica and Mars

Recommended for you

A new framework to generate human motions from language prompts

Personalization has the potential to democratize who decides how LLMs behave

Holographic displays offer a glimpse into an immersive future

With a game show as his guide, researcher uses AI to predict deception

Neural networks can mediate between download size and quality, according to researcher

A coffee roastery in Finland has launched an AI-generated blend. The results were surprising

Your Privacy