August 12, 2021

Teaching AI to see depth in photographs and paintings

Researchers in SFU's Computational Photography Lab hope to give computers a visual advantage that we humans take for granted—the ability to see depth in photographs. While humans naturally can determine how close or far objects are from a single point of view, like a photograph or a painting, it's a challenge for computers—but one they may soon overcome.

Researchers recently published their work improving a process called monocular depth estimation, a technique that teaches computers how to see depth using machine learning.

"When we look at a picture, we can tell the relative distance of objects by looking at their size, position, and relation to each other," says Mahdi Miangoleh, an MSc student working in the lab. "This requires recognizing the objects in a scene and knowing what size the objects are in real life. This task alone is an active research topic for neural networks."

Despite progress in recent years, existing efforts to provide high resolution results that can transform an image into a 3-dimensional (3D) space have failed.

To counter this, the lab recognized the untapped potential of existing neural network models in the literature. The proposed research explains the lack of high-resolution results in current methods through the limitations of convolutional neural networks. Despite major advancements in recent years, the neural networks still have a relatively small capacity to generate many details at once.

Another limitation is how much of the scene these networks can 'look at' at once, which determines how much information the neural network can make use of to understand complex scenes. Bu working to increase the resolution of their visual estimations, the researchers are now making it possible to create detailed 3D renderings that look realistic to a human eye. These so-called "depth maps" are used to create 3D renderings of scenes and simulate camera motion in computer graphics.

"Our method analyzes an image and optimizes the process by looking at the image content according to the limitations of current architectures," explains Ph.D. student Sebastian Dille. "We give our input image to our neural network in many different forms, to create as many details as the model allows while preserving a realistic geometry."

The team also published a friendly explainer for the theory behind the method, which is available on YouTube.

"With the high-resolution depth maps that the team is able to develop for real-world photographs, artists and content creators can now immediately transfer their photograph or artwork into a rich 3D world," says computing science professor and lab director, Yağız Aksoy, whose team collaborated with researchers Sylvain Paris and Long Mai, from Adobe Research.

Tools enable artists to turn 2D art into 3D worlds

Global artists are already utilizing the applications enabled by Aksoy's lab's research. Akira Saito, a visual artist based in Japan, is creating videos that take viewers into fantastic 3D worlds dreamed up in 2D artwork. To do this he combines tools such as Houdini, a computer animation software, with the depth map generated by Aksoy and his team.

Creative content creators on TikTok are using the research to express themselves in new ways.

"It's a great pleasure to see independent artists make use of our technology in their own way," says Aksoy, whose lab has plans to extend this work to videos and develop new tools that will make depth maps more useful for artists.

"We have made great leaps in computer vision and computer graphics in recent years, but the adoption of these new AI technologies by the artist community needs to be an organic process, and that takes time."

More information: S. Mahdi et al, Boosting Monocular Depth Estimation Models to High-Resolution via Content-Adaptive Multi-Resolution Merging, Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2021): openaccess.thecvf.com/content/ … CVPR_2021_paper.html

Project Github: yaksoy.github.io/highresdepth/

Provided by Simon Fraser University

Citation: Teaching AI to see depth in photographs and paintings (2021, August 12) retrieved 16 August 2024 from https://techxplore.com/news/2021-08-ai-depth.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Virtual reality becomes more real

1236 shares

Feedback to editors

Engineers design tiny batteries for powering cell-sized robots

11 hours ago

Leaf-like solar concentrators promise major boost in solar efficiency

12 hours ago

Why does AI beat humans at the strategy game Diplomacy?

13 hours ago

New technique prints metal oxide thin film circuits at room temperature

14 hours ago

Studies highlight challenges and solutions in making large language models trustworthy

15 hours ago

Finding security flaws in Android ahead of malicious hackers

15 hours ago

Robot planning tool accounts for human carelessness

16 hours ago

From shrimp to steel: Introducing nature-inspired metalworking

16 hours ago

'AI Scientist' model designed to conduct scientific research autonomously

17 hours ago

Global AI adoption is outpacing risk understanding, researchers warn

17 hours ago

Load comments (0)

Teaching AI to see depth in photographs and paintings

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Virtual reality becomes more real

Researchers step back to mannequin viral wave to explore depth

New machine-learning approach brings digital photos back to life

Neural algorithm gives photo masterpiece-style treatments

Do deep networks 'see' as well as humans?

Generation query network lets computer create multi-view 3-D model from 2-D photographs

A two-stage framework to improve LLM-based anomaly detection and reactive planning

Robot planning tool accounts for human carelessness

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Why does AI beat humans at the strategy game Diplomacy?

Studies highlight challenges and solutions in making large language models trustworthy

Phys.org

Medical Xpress

Science X

Teaching AI to see depth in photographs and paintings

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Related Stories

Virtual reality becomes more real

Researchers step back to mannequin viral wave to explore depth

New machine-learning approach brings digital photos back to life

Neural algorithm gives photo masterpiece-style treatments

Do deep networks 'see' as well as humans?

Generation query network lets computer create multi-view 3-D model from 2-D photographs

Recommended for you

A two-stage framework to improve LLM-based anomaly detection and reactive planning

Robot planning tool accounts for human carelessness

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Why does AI beat humans at the strategy game Diplomacy?

Studies highlight challenges and solutions in making large language models trustworthy

Your Privacy