May 8, 2019 feature

A multi-scale body-part mask guided attention network for person re-identification

by Ingrid Fadelli , Tech Xplore

Person re-identification entails the automated identification of the same person in multiple images from different cameras and with different backgrounds, angles or positions. Despite recent advances in the field of artificial intelligence (AI), person re-identification remains a highly challenging task, particularly due to the many variations in a person's pose, as well as other differences associated with lighting, occlusion, misalignment and background clutter.

Researchers at the Suning R&D Center in the U.S. have recently developed a new technique for person re-identification based on a multi-scale body-part mask guided attention network (MMGA). Their paper, pre-published on arXiv, will be presented during the 2019 CVPR Workshop spotlight presentation in June.

"Person re-identification is becoming a more and more important task due to its wide range of potential applications, such as criminal investigation, public security and image retrieval," Honglong Cai, one of the researchers who carried out the study, told TechXplore. "However, it remains a challenging task, due to occlusion, misalignment, variation of poses and background clutter. In our recent study, our team tried to develop a method to overcome these challenges."

Instead of focusing on entire images, Cai and his colleagues developed a model for person re-identification that only pays attention to the person of interest, ignoring the background. Taking this idea one step further, their model analyses different body parts of the person in a given image.

"To implement our idea, we creatively proposed a multi-scale body-part mask guided attention network," Cai said. "We apply body masks to guide the training of our model so that it can pay more attention to the human body in the image. Our model contains two parts: a feature extractor and an attention module."

The feature extractor component of the model devised by Cai and his colleagues can extract discriminative features of people's bodies from images. The model's attention module, on the other hand, guides the MMGA network, highlighting areas of the image (i.e.pixels) that it should pay closer attention to.

The researchers used body masks to guide the training of their model's attention module, as this allows it to discern human bodies from background information. In addition, they split body masks into upper body and bottom body masks, so that the attention module can learn to distinguish between upper and lower parts of a person's body.

"Differently from most current person re-identification methods, which split images into fixed slides, our model can tell exactly where the upper body and lower body are," Cai explained. "Moreover, body masks are only used in the training phase, and we don't require body masks in the inference phase, which makes our model very efficient in practical applications."

To evaluate their model, Cai and his colleagues carried out a series of experiments testing its performance on two datasets, namely the Market-1501 and DukeMTMC-reID datasets. They found that their model can reduce the negative effects of variations in a person's pose, misalignment and background clutter, outperforming state-of-the-art re-identification methods.

The findings gathered by the researchers suggest that attention mechanisms can significantly improve the accuracy of person re-identification networks. Moreover, their study introduced a mask guide attention training method that can further improve this accuracy.

"In our recent work, upper body masks and lower body masks are used to guide the training of the attention module," Cai said. "In the future, we would like to try dividing body masks into finer details such as head, hand, arm, leg, etc., as this could further improve the accuracy of person re-identification."

More information: Honglong Cai, Zhiguan Wang, Jinxing Cheng. Multi-scale body-part mask guided attention for person re-identification. arXiv:1904.11041 [cs.CV]. arxiv.org/abs/1904.11041

Citation: A multi-scale body-part mask guided attention network for person re-identification (2019, May 8) retrieved 26 April 2024 from https://techxplore.com/news/2019-05-multi-scale-body-part-mask-attention-network.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

What and where in the processing of body-part information

74 shares

Feedback to editors

Proof of concept study shows path to easier recycling of solar modules

5 hours ago

New circuit boards can be repeatedly recycled

6 hours ago

Researchers develop an automated benchmark for language-based task planners

6 hours ago

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

6 hours ago

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

7 hours ago

Researchers outline path forward for tandem solar cells

8 hours ago

Researcher develop high-performance amorphous p-type oxide semiconductor

9 hours ago

Scientists create new atomic clock that is both ultra-precise and sturdy

9 hours ago

A framework to compare lithium battery testing data and results during operation

12 hours ago

New approach could make reusing captured carbon far cheaper, less energy-intensive

16 hours ago

Load comments (0)

A multi-scale body-part mask guided attention network for person re-identification

Proof of concept study shows path to easier recycling of solar modules

New circuit boards can be repeatedly recycled

Researchers develop an automated benchmark for language-based task planners

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

Researchers outline path forward for tandem solar cells

Researcher develop high-performance amorphous p-type oxide semiconductor

Scientists create new atomic clock that is both ultra-precise and sturdy

A framework to compare lithium battery testing data and results during operation

New approach could make reusing captured carbon far cheaper, less energy-intensive

What and where in the processing of body-part information

A multi-granularity reasoning framework for social relation recognition

NVIDIA researchers raise the bar on image inpainting

Girl, look at that body: Can changing who we look at help our body image?

A hierarchical RNN-based model to predict scene graphs for images

Multi-face tracking to help AI follow the action

Researchers develop an automated benchmark for language-based task planners

Study explores why human-inspired machines can be perceived as eerie

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Microsoft claims that small, localized language models can be powerful as well

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Phys.org

Medical Xpress

Science X

A multi-scale body-part mask guided attention network for person re-identification

Proof of concept study shows path to easier recycling of solar modules

New circuit boards can be repeatedly recycled

Researchers develop an automated benchmark for language-based task planners

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

Researchers outline path forward for tandem solar cells

Researcher develop high-performance amorphous p-type oxide semiconductor

Scientists create new atomic clock that is both ultra-precise and sturdy

A framework to compare lithium battery testing data and results during operation

New approach could make reusing captured carbon far cheaper, less energy-intensive

Related Stories

What and where in the processing of body-part information

A multi-granularity reasoning framework for social relation recognition

NVIDIA researchers raise the bar on image inpainting

Girl, look at that body: Can changing who we look at help our body image?

A hierarchical RNN-based model to predict scene graphs for images

Multi-face tracking to help AI follow the action

Recommended for you

Researchers develop an automated benchmark for language-based task planners

Study explores why human-inspired machines can be perceived as eerie

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Microsoft claims that small, localized language models can be powerful as well

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Your Privacy