July 19, 2021

Scientists adopt deep learning for multi-object tracking

by GIST (Gwangju Institute of Science and Technology)

Computer vision has progressed much over the past decade and made its way into all sorts of relevant applications, both in academia and in our daily lives. There are, however, some tasks in this field that are still extremely difficult for computers to perform with acceptable accuracy and speed. One example is object tracking, which involves recognizing persistent objects in video footage and tracking their movements. While computers can simultaneously track more objects than humans, they usually fail to discriminate the appearance of different objects. This, in turn, can lead to the algorithm to mix up objects in a scene and ultimately produce incorrect tracking results.

At the Gwangju Institute of Science and Technology in Korea, a team of researchers led by Professor Moongu Jeon seeks to solve these issues by incorporating deep learning techniques into a multi-object tracking framework. In a recent study published in Information Sciences, they present a new tracking model based on a technique they call 'deep temporal appearance matching association (Deep-TAMA)' which promises innovative solutions to some of the most prevalent problems in multi-object tracking.

Conventional tracking approaches determine object trajectories by associating a bounding box to each detected object and establishing geometric constraints. The inherent difficulty in this approach is in accurately matching previously tracked objects with objects detected in the current frame. Differentiating detected objects based on hand-crafted features like color usually fails because of changes in lighting conditions and occlusions. Thus, the researchers focused on enabling the tracking model with the ability to accurately extract the known features of detected objects and compare them not only with those of other objects in the frame but also with a recorded history of known features. To this end, they combined joint-inference neural networks (JI-Nets) with long-short-term-memory networks (LSTMs).

LSTMs help to associate stored appearances with those in the current frame whereas JI-Nets allow for comparing the appearances of two detected objects simultaneously from scratch—one of the most unique aspects of this new approach. Using historical appearances in this way allowed the algorithm to overcome short-term occlusions of the tracked objects. "Compared to conventional methods that pre-extract features from each object independently, the proposed joint-inference method exhibited better accuracy in public surveillance tasks, namely pedestrian tracking," highlights Dr. Jeon. Moreover, the researchers also offset a main drawback of deep learning—low speed—by adopting indexing-based GPU parallelization to reduce computing times. Tests on public surveillance datasets confirmed that the proposed tracking framework offers state-of-the-art accuracy and is therefore ready for deployment.

Multi-object tracking unlocks a plethora of applications ranging from autonomous driving to public surveillance, which can help combat crime and reduce the frequency of accidents. "We believe our methods can inspire other researchers to develop novel deep-learning-based approaches to ultimately improve public safety," concludes Dr. Jeon.

More information: Young-Chul Yoon et al, Online multiple pedestrians tracking using deep temporal appearance matching association, Information Sciences (2020). DOI: 10.1016/j.ins.2020.10.002

Provided by GIST (Gwangju Institute of Science and Technology)

Citation: Scientists adopt deep learning for multi-object tracking (2021, July 19) retrieved 16 April 2024 from https://techxplore.com/news/2021-07-scientists-deep-multi-object-tracking.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Accurate and efficient 3-D motion tracking using deep learning

13 shares

Feedback to editors

Taichi: A large-scale diffractive hybrid photonic AI chiplet

1 hour ago

New insight about the working principles of bipolar membranes could guide future fuel cell design

2 hours ago

Using sound waves for photonic machine learning: Study lays foundation for reconfigurable neuromorphic building blocks

5 hours ago

Samsung returns to top of the smartphone market: Industry tracker

6 hours ago

Safeguarding the future of online security with AI and metasurfaces

18 hours ago

Security vulnerability in browser interface allows computer access via graphics card

21 hours ago

AI's new power of persuasion: Study shows LLMs can exploit personal information to change your mind

21 hours ago

Research team manufactures the first universal, programmable and multifunctional photonic chip

22 hours ago

Researchers develop stretchable quantum dot display

22 hours ago

Mimicking fish to create the ideal deep-sea submersible

22 hours ago

Load comments (0)

Scientists adopt deep learning for multi-object tracking

Taichi: A large-scale diffractive hybrid photonic AI chiplet

New insight about the working principles of bipolar membranes could guide future fuel cell design

Using sound waves for photonic machine learning: Study lays foundation for reconfigurable neuromorphic building blocks

Samsung returns to top of the smartphone market: Industry tracker

Safeguarding the future of online security with AI and metasurfaces

Security vulnerability in browser interface allows computer access via graphics card

AI's new power of persuasion: Study shows LLMs can exploit personal information to change your mind

Research team manufactures the first universal, programmable and multifunctional photonic chip

Researchers develop stretchable quantum dot display

Mimicking fish to create the ideal deep-sea submersible

Accurate and efficient 3-D motion tracking using deep learning

Researchers develop fabric-friendly sensors

PoseRBPF: A new particle filter for 6D object pose tracking

Object classification through a single-pixel detector

Salient object detection makes computer vision smarter

A software platform for 'smart' video tracking

Taichi: A large-scale diffractive hybrid photonic AI chiplet

Using sound waves for photonic machine learning: Study lays foundation for reconfigurable neuromorphic building blocks

AI's new power of persuasion: Study shows LLMs can exploit personal information to change your mind

Engineers recreate Star Trek's Holodeck using ChatGPT and video game assets

New computer vision tool can count damaged buildings in crisis zones and accurately estimate bird flock sizes

Tiny AI-trained robots demonstrate remarkable soccer skills

Phys.org

Medical Xpress

Science X

Scientists adopt deep learning for multi-object tracking

Taichi: A large-scale diffractive hybrid photonic AI chiplet

New insight about the working principles of bipolar membranes could guide future fuel cell design

Using sound waves for photonic machine learning: Study lays foundation for reconfigurable neuromorphic building blocks

Samsung returns to top of the smartphone market: Industry tracker

Safeguarding the future of online security with AI and metasurfaces

Security vulnerability in browser interface allows computer access via graphics card

AI's new power of persuasion: Study shows LLMs can exploit personal information to change your mind

Research team manufactures the first universal, programmable and multifunctional photonic chip

Researchers develop stretchable quantum dot display

Mimicking fish to create the ideal deep-sea submersible

Related Stories

Accurate and efficient 3-D motion tracking using deep learning

Researchers develop fabric-friendly sensors

PoseRBPF: A new particle filter for 6D object pose tracking

Object classification through a single-pixel detector

Salient object detection makes computer vision smarter

A software platform for 'smart' video tracking

Recommended for you

Taichi: A large-scale diffractive hybrid photonic AI chiplet

Using sound waves for photonic machine learning: Study lays foundation for reconfigurable neuromorphic building blocks

AI's new power of persuasion: Study shows LLMs can exploit personal information to change your mind

Engineers recreate Star Trek's Holodeck using ChatGPT and video game assets

New computer vision tool can count damaged buildings in crisis zones and accurately estimate bird flock sizes

Tiny AI-trained robots demonstrate remarkable soccer skills

Your Privacy