August 6, 2021

Doubling the performance of visual recognition AI

by DGIST (Daegu Gyeongbuk Institute of Science and Technology)

Prof. Sunghoon Im, from the Department of Information & Communication Engineering, DGIST, developed an artificial intelligence(AI) neural network module that can separate and convert environmental information in the form of complex images using deep learning. The developed network is expected to significantly contribute to the future advancement in the field of AI, including image conversion and domain adaptation.

Recently, deep learning, the basis of AI technology, has been increasingly advanced, and accordingly, deep learning research on image creation and conversion has been actively conducted. Conventional studies have focused on finding image information that is common in a domain, which is a set of images with multiple similar features. Thus, image information could not be properly used, limiting the performance of applicable data and models. Another limitation is that, because the image information used has a linearly simple structure, only one converted image can be obtained.

Professor Im's research team hypothesized that the structure of image information may vary depending on the domain, and the structure may not always be simple, such as a linear structure. The research team designed a separator that could clearly divide image information into overall form information and style information. Based on this, they used a different weight for each domain to reflect the difference between the domains. Furthermore, they successfully developed a neural network structure to determine appropriate style information for each image composition using the correlation between the separated pieces of image information.

The developed neural network exhibits the advantage that image conversions can be easily performed for many domains, even with just one model. When the developed domain adaptation algorithm was applied to a visual recognition problem, the accuracy increased by more than double.

Prof. Im says that "In this study, a neural network that incorporates a new analysis for image information was developed, and we expect that if the relevant technology is improved a little more in the future, it can be applied to several fields, positively impacting the development of AI."

Seunghoon Lee, a degree-linked course student majoring in Information and Communication Engineering, participated in this research as the first author. Furthermore, the paper was published in the IEEE Conference on Computer Vision and Pattern Recognition, a leading international journal in the AI field, and released online on Friday, June 25.

More information: Seunghun Lee et al, DRANet: Disentangling Representation and Adaptation Networks for Unsupervised Cross-Domain Adaptation, IEEE Conference on Computer Vision and Pattern Recognition (2021). arXiv:2103.13447 [cs.CV], arxiv.org/abs/2103.13447

Provided by DGIST (Daegu Gyeongbuk Institute of Science and Technology)

Citation: Doubling the performance of visual recognition AI (2021, August 6) retrieved 26 April 2024 from https://techxplore.com/news/2021-08-visual-recognition-ai.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Deep learning improves image reconstruction in optical coherence tomography using less data

206 shares

Feedback to editors

Proof of concept study shows path to easier recycling of solar modules

3 hours ago

New circuit boards can be repeatedly recycled

4 hours ago

Researchers develop an automated benchmark for language-based task planners

4 hours ago

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

5 hours ago

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

5 hours ago

Researchers outline path forward for tandem solar cells

6 hours ago

Researcher develop high-performance amorphous p-type oxide semiconductor

7 hours ago

Scientists create new atomic clock that is both ultra-precise and sturdy

7 hours ago

A framework to compare lithium battery testing data and results during operation

10 hours ago

New approach could make reusing captured carbon far cheaper, less energy-intensive

14 hours ago

Load comments (0)

Doubling the performance of visual recognition AI

Proof of concept study shows path to easier recycling of solar modules

New circuit boards can be repeatedly recycled

Researchers develop an automated benchmark for language-based task planners

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

Researchers outline path forward for tandem solar cells

Researcher develop high-performance amorphous p-type oxide semiconductor

Scientists create new atomic clock that is both ultra-precise and sturdy

A framework to compare lithium battery testing data and results during operation

New approach could make reusing captured carbon far cheaper, less energy-intensive

Deep learning improves image reconstruction in optical coherence tomography using less data

New medical image fusion method draws on deep learning to improve patient outcomes

Filling the gaps using image inpainting

Discerning deep fakes digitally

Scientist develops an image recognition algorithm that works 40% faster than analogs

Clearer and better focused SEM images

Researchers develop an automated benchmark for language-based task planners

Study explores why human-inspired machines can be perceived as eerie

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Microsoft claims that small, localized language models can be powerful as well

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Phys.org

Medical Xpress

Science X

Doubling the performance of visual recognition AI

Proof of concept study shows path to easier recycling of solar modules

New circuit boards can be repeatedly recycled

Researchers develop an automated benchmark for language-based task planners

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

Researchers outline path forward for tandem solar cells

Researcher develop high-performance amorphous p-type oxide semiconductor

Scientists create new atomic clock that is both ultra-precise and sturdy

A framework to compare lithium battery testing data and results during operation

New approach could make reusing captured carbon far cheaper, less energy-intensive

Related Stories

Deep learning improves image reconstruction in optical coherence tomography using less data

New medical image fusion method draws on deep learning to improve patient outcomes

Filling the gaps using image inpainting

Discerning deep fakes digitally

Scientist develops an image recognition algorithm that works 40% faster than analogs

Clearer and better focused SEM images

Recommended for you

Researchers develop an automated benchmark for language-based task planners

Study explores why human-inspired machines can be perceived as eerie

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Microsoft claims that small, localized language models can be powerful as well

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Your Privacy