August 19, 2017 weblog

Researchers explore photographic images synthesized from semantic layouts

by Nancy Owano , Tech Xplore

AI will serve to develop a network control system that not only detects and reacts to problems but can also predict and avoid them. Credit: CC0 Public Domain

How far can we go in achieving fictional scenes just by using real photos? More precisely, what can we do with deep learning in rendering video games? Those questions are the focus of research work by Qifeng Chen and Vladlen Koltun.

Their work attracted interest this month by New Scientist and other sites, exploring their approach. "It's paint by numbers for creating dreamy worlds," said Engadget.

Indeed, a video's notes said this was a paint by numbers approach to create a new image and it starts with a labeled layout. Sections are labeled as trees or cars, for example. The center might be labeled road.

Luke Dormehl in Digital Trends described their work as having "artificial intelligence that can create photorealistic Google Street View-style images of fake street scenes."

The key operative is artificial intelligence. Matt Reynolds in New Scientist said the AI from Qifeng Chen of Stanford and Intel "works from rough layouts that tell it what should be in each part of the image." AI uses this layout as a guide to generate a completely new image.

The AI was trained on 3000 images of German streets, Reynolds said.

Digital Trends discussed their use of a "cascaded refinement network, a type of neural network designed to synthesize HD images with a consistent structure. Like a regular neural network, a cascaded refinement network features multiple layers, which it uses to generate features one layer at a time."

With some human help it can build slightly blurry made-up scenes, said Roberto Baldwin, senior editor, Engadget. "To create an image a human needs to tell the AI system what goes where. Put a car here, put a building there, place a tree right there. It's paint by numbers and the system generates a wholly unique scene based on that input."

So fundamentally, Reynolds said, you are getting a fictional street that "was generated by an imaginative neural network, stitching together its memories of real streets it was trained on."

"Chen's AI isn't quite good enough to create photorealistic scenes just yet," said Baldwin. However, it could be used to create video game and VR worlds "where not everything needs to look perfect in the near future."

Its creators think it could eventually be used for creating photorealistic video game worlds.

What's next? The researchers detailed their work in "Photographic Image Synthesis with Cascaded Refinement Networks," by Chen and Koltun, which is on arXiv.

They described their approach as synthesizing photographic images conditioned on semantic layouts. Using an " input layout," they achieved a rendering engine. The result is a corresponding photographic image.

The authors pointed to what was special about their work. " We show that photographic images can be synthesized from semantic layouts by a single feedforward network with appropriate structure, trained end-to-end with a direct regression objective."

They stated in their paper that "Exciting work remains to be done to achieve perfect photorealism. If such level of realism is ever achieved, which we believe to be possible, alternative routes for image synthesis in computer graphics will open up."

More information: Photographic Image Synthesis with Cascaded Refinement Networks, arxiv.org/abs/1707.09405

Citation: Researchers explore photographic images synthesized from semantic layouts (2017, August 19) retrieved 19 April 2024 from https://techxplore.com/news/2017-08-explore-images-semantic-layouts.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Monet's worlds translated into realistic photos in Berkeley effort

36 shares

Feedback to editors

A dexterous four-legged robot that can walk and handle objects simultaneously

21 minutes ago

Climate change will increase value of residential rooftop solar panels across US, study finds

2 hours ago

Bitcoin's next 'halving' is right around the corner. Here's what you need to know

3 hours ago

Team develops a way to teach a computer to type like a human

14 hours ago

Universal 'cocktail electrolyte' developed for 4.6 V ultra-stable fast charging of commercial lithium-ion batteries

14 hours ago

Garbage could replace a quarter of petroleum-based jet fuel every year

15 hours ago

For more open and equitable public discussions on social media, try 'meronymity'

17 hours ago

Mess is best: Disordered structure of battery-like devices improves performance

17 hours ago

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

17 hours ago

An ink for 3D-printing flexible devices without mechanical joints

17 hours ago

Load comments (1)

Researchers explore photographic images synthesized from semantic layouts

A dexterous four-legged robot that can walk and handle objects simultaneously

Climate change will increase value of residential rooftop solar panels across US, study finds

Bitcoin's next 'halving' is right around the corner. Here's what you need to know

Team develops a way to teach a computer to type like a human

Universal 'cocktail electrolyte' developed for 4.6 V ultra-stable fast charging of commercial lithium-ion batteries

Garbage could replace a quarter of petroleum-based jet fuel every year

For more open and equitable public discussions on social media, try 'meronymity'

Mess is best: Disordered structure of battery-like devices improves performance

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

An ink for 3D-printing flexible devices without mechanical joints

Monet's worlds translated into realistic photos in Berkeley effort

Team accelerates rendering with AI

DeepStereo: Google quartet has method for new-view synthesis

Google team's neural network approach works on street numbers

Intelligent animation—engineers collaborate to incorporate AI into a computer-based rendering system

Lifelike 3-D cinematic imaging promises numerous medical uses

For more open and equitable public discussions on social media, try 'meronymity'

Researchers develop energy-efficient probabilistic computer by combining CMOS with stochastic nanomagnet

New computer vision tool can count damaged buildings in crisis zones and accurately estimate bird flock sizes

Game theory research shows AI can evolve into more selfish or cooperative personalities

Proof-of-principle demonstration of 3D magnetic recording could lead to enhanced hard disk drives

Tech companies want to build artificial general intelligence. But who decides when AGI is attained?

Phys.org

Medical Xpress

Science X

Researchers explore photographic images synthesized from semantic layouts

A dexterous four-legged robot that can walk and handle objects simultaneously

Climate change will increase value of residential rooftop solar panels across US, study finds

Bitcoin's next 'halving' is right around the corner. Here's what you need to know

Team develops a way to teach a computer to type like a human

Universal 'cocktail electrolyte' developed for 4.6 V ultra-stable fast charging of commercial lithium-ion batteries

Garbage could replace a quarter of petroleum-based jet fuel every year

For more open and equitable public discussions on social media, try 'meronymity'

Mess is best: Disordered structure of battery-like devices improves performance

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

An ink for 3D-printing flexible devices without mechanical joints

Related Stories

Monet's worlds translated into realistic photos in Berkeley effort

Team accelerates rendering with AI

DeepStereo: Google quartet has method for new-view synthesis

Google team's neural network approach works on street numbers

Intelligent animation—engineers collaborate to incorporate AI into a computer-based rendering system

Lifelike 3-D cinematic imaging promises numerous medical uses

Recommended for you

For more open and equitable public discussions on social media, try 'meronymity'

Researchers develop energy-efficient probabilistic computer by combining CMOS with stochastic nanomagnet

New computer vision tool can count damaged buildings in crisis zones and accurately estimate bird flock sizes

Game theory research shows AI can evolve into more selfish or cooperative personalities

Proof-of-principle demonstration of 3D magnetic recording could lead to enhanced hard disk drives

Tech companies want to build artificial general intelligence. But who decides when AGI is attained?

Your Privacy