June 26, 2023

New AI method for graphing scenes from images

by K. W. Wesselink, University of Twente

New AI-model with better understanding of images — Credit: University of Twente

Generative AI programs can generate images from textual prompts. These models work best when they generate images of single objects. Creating complete scenes is still difficult. Michael Ying Yang, a UT-researcher from the faculty of ITC recently developed a novel method that can graph scenes from images that can serve as a blueprint for generating realistic and coherent images. He and his team recently published their findings in the journal IEEE Transactions on Pattern Analysis and Machine Intelligence.

Humans are excellent at defining relationships between objects. "We can see that a chair is standing on the floor and a dog is walking on the street. AI models find this difficult," explains Yang, assistant professor at the Scene Understanding Group of the Faculty of Geo-Information Science and Earth Observation (ITC). Improving a computer's ability to detect and understand visual relationships is needed for image generation, but could also assist the perception of autonomous vehicles and robots.

From two-stage to single-stage

Currently, methods exist to graph a semantic understanding of an image, but they are slow. These methods use a two-stage approach. At first, they map all objects in a scene. In the second step, some specific neural network goes through all different possible connections and then labels them with the correct relationship.

The number of connections this method has to go through increases exponentially with the number of objects. "Our model takes just a single step. It automatically predicts subjects, objects and their relationships at the same time," says Yang.

Detecting relationships

For this one-stage method, the model looks at the visual features of the objects in the scene and focuses on the most relevant details for determining the relationships. It highlights important areas where objects interact or relate to each other. These techniques and relatively little training data are enough to identify the most important relationships between different objects. The only thing left to do is to generate a description of how they are connected.

"The model detects that in an example picture, the man is very likely to interact with the baseball bat. It's then trained to describe the most likely relationship: 'man-swings-baseball bat,'" says Yang.

More information: Yuren Cong et al, RelTR: Relation Transformer for Scene Graph Generation, IEEE Transactions on Pattern Analysis and Machine Intelligence (2023). DOI: 10.1109/TPAMI.2023.3268066

Journal information: IEEE Transactions on Pattern Analysis and Machine Intelligence

Provided by University of Twente

Citation: New AI method for graphing scenes from images (2023, June 26) retrieved 17 July 2024 from https://techxplore.com/news/2023-06-ai-method-graphing-scenes-images.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Machine-learning model could enable robots to understand interactions in the way humans do

53 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

14 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

16 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

18 hours ago

Large language models make human-like reasoning mistakes, researchers find

18 hours ago

Unveiling a new class of synthetic fuels

19 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

19 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

20 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

22 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

Jul 16, 2024

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

New AI method for graphing scenes from images

From two-stage to single-stage

Detecting relationships

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Machine-learning model could enable robots to understand interactions in the way humans do

Artificial intelligence that understands object relationships

Researchers use AI to identify similar materials in images

New method improves efficiency of vision transformer AI systems

Researchers detect and classify multiple objects without images

Gaining a deeper understanding of how we connect

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Phys.org

Medical Xpress

Science X

New AI method for graphing scenes from images

From two-stage to single-stage

Detecting relationships

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

Machine-learning model could enable robots to understand interactions in the way humans do

Artificial intelligence that understands object relationships

Researchers use AI to identify similar materials in images

New method improves efficiency of vision transformer AI systems

Researchers detect and classify multiple objects without images

Gaining a deeper understanding of how we connect

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Your Privacy