This article has been reviewed according to Science X's editorial process and policies. Editors have highlighted the following attributes while ensuring the content's credibility:



trusted source


Visual active search tool combines deep reinforcement learning, traditional active search methods

Visual active search tool combines deep reinforcement learning, traditional active search methods
A new framework for visual active search developed by McKelvey engineers leverages reinforcement learning to improve search performance. An adaptive search approach initially makes a mistake but then quickly adapts by learning from its mistakes. Credit: Anindya Sarkar

In geospatial exploration, the quest for efficient identification of regions of interest has recently taken a leap forward with visual active search (VAS). This modeling framework uses visual cues to guide exploration with potential applications that range from wildlife poaching detection to search-and-rescue missions to the identification of illegal trafficking activities.

A new approach to VAS developed at the McKelvey School of Engineering at Washington University in St. Louis combines deep reinforcement learning, where a computer can learn to make better decisions through trial and error, with traditional active search, where human searchers go out and verify what's in a selected region.

The team that developed the novel VAS includes Yevgeniy Vorobeychik and Nathan Jacobs, professors of computer science and engineering, and Anindya Sarkar, a doctoral student in Vorobeychik's lab. The team presented its work Dec. 13 at the Neural Information Processing Systems conference in New Orleans.

"VAS improves on traditional active search more or less depending on search task," Jacobs said. "If a task is relatively easy, then improvements are modest. But if an object is very rare—for example, an that we want to locate for purposes of wildlife conservation—then improvements offered by VAS are substantial. Notably, this isn't about finding things faster. It's about finding as many things as possible given , especially limited human resources."

The team's VAS framework improves on previous methods by breaking down the search into two distinct modules. The framework first uses a prediction to produce regions of interest based on geospatial image data and search history. Then a search module takes the resulting prediction map as an input and outputs a search plan. Each module can be updated as human explorers return results from physical searches in real time.

"Instead of an end-to-end search policy, decomposing into two modules allows us to be much more adaptable," Sarkar said. "During the actual search, we can update our prediction module with the search results. Then the search module can learn the dynamics of the prediction module—how it's changing across search steps—and adapt. In this meta-learning strategy, the search module is basically learning how to search. It's also human interpretable, so if the model isn't working properly, the user can check it and debug as needed."

The major strength of the framework comes from its ongoing incorporation of two levels of deployment: the predicts where to search, then humans go out into the world to conduct the search. The human component is vastly more expensive in terms of time and other resources required to explore large geospatial areas, so it makes sense to adapt and optimize the computer-generated search plan for a maximally efficient search.

Adaptability in the computer model is especially important when the object sought varies drastically from the objects the model is trained on. Experimental results showed marked improvement by Sarkar, Vorobeychik and Jacobs' proposed VAS framework over existing methods in various visual active tasks.

The findings are published on the arXiv preprint server.

More information: Anindya Sarkar et al, A Partially Supervised Reinforcement Learning Framework for Visual Active Search, arXiv (2023). DOI: 10.48550/arxiv.2310.09689

Journal information: arXiv
Citation: Visual active search tool combines deep reinforcement learning, traditional active search methods (2023, December 14) retrieved 20 April 2024 from
This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Using large language models to enable open-world, interactive and personalized robot navigation


Feedback to editors