December 14, 2023

Visual active search tool combines deep reinforcement learning, traditional active search methods

by Shawn Ballard, Washington University in St. Louis

In geospatial exploration, the quest for efficient identification of regions of interest has recently taken a leap forward with visual active search (VAS). This modeling framework uses visual cues to guide exploration with potential applications that range from wildlife poaching detection to search-and-rescue missions to the identification of illegal trafficking activities.

A new approach to VAS developed at the McKelvey School of Engineering at Washington University in St. Louis combines deep reinforcement learning, where a computer can learn to make better decisions through trial and error, with traditional active search, where human searchers go out and verify what's in a selected region.

The team that developed the novel VAS framework includes Yevgeniy Vorobeychik and Nathan Jacobs, professors of computer science and engineering, and Anindya Sarkar, a doctoral student in Vorobeychik's lab. The team presented its work Dec. 13 at the Neural Information Processing Systems conference in New Orleans.

"VAS improves on traditional active search more or less depending on search task," Jacobs said. "If a task is relatively easy, then improvements are modest. But if an object is very rare—for example, an endangered species that we want to locate for purposes of wildlife conservation—then improvements offered by VAS are substantial. Notably, this isn't about finding things faster. It's about finding as many things as possible given limited resources, especially limited human resources."

The team's VAS framework improves on previous methods by breaking down the search into two distinct modules. The framework first uses a prediction module to produce regions of interest based on geospatial image data and search history. Then a search module takes the resulting prediction map as an input and outputs a search plan. Each module can be updated as human explorers return results from physical searches in real time.

"Instead of an end-to-end search policy, decomposing into two modules allows us to be much more adaptable," Sarkar said. "During the actual search, we can update our prediction module with the search results. Then the search module can learn the dynamics of the prediction module—how it's changing across search steps—and adapt. In this meta-learning strategy, the search module is basically learning how to search. It's also human interpretable, so if the model isn't working properly, the user can check it and debug as needed."

The major strength of the framework comes from its ongoing incorporation of two levels of deployment: the computational model predicts where to search, then humans go out into the world to conduct the search. The human component is vastly more expensive in terms of time and other resources required to explore large geospatial areas, so it makes sense to adapt and optimize the computer-generated search plan for a maximally efficient search.

Adaptability in the computer model is especially important when the object sought varies drastically from the objects the model is trained on. Experimental results showed marked improvement by Sarkar, Vorobeychik and Jacobs' proposed VAS framework over existing methods in various visual active search tasks.

The findings are published on the arXiv preprint server.

More information: Anindya Sarkar et al, A Partially Supervised Reinforcement Learning Framework for Visual Active Search, arXiv (2023). DOI: 10.48550/arxiv.2310.09689

Journal information: arXiv

Provided by Washington University in St. Louis

Citation: Visual active search tool combines deep reinforcement learning, traditional active search methods (2023, December 14) retrieved 17 July 2024 from https://techxplore.com/news/2023-12-visual-tool-combines-deep-traditional.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Using large language models to enable open-world, interactive and personalized robot navigation

36 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

15 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

17 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

19 hours ago

Large language models make human-like reasoning mistakes, researchers find

20 hours ago

Unveiling a new class of synthetic fuels

20 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

20 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

21 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

Jul 16, 2024

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

Jul 16, 2024

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

Visual active search tool combines deep reinforcement learning, traditional active search methods

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Using large language models to enable open-world, interactive and personalized robot navigation

Need a doctor? Google can now help you find places that accept your health insurance

New machine-learning approach facilitates molecular conformer search in complex molecules

Distractor objects can help the visual system develop more effective search strategies

Computer scientists create new search systems to limit COVID-19 misinformation

Sketch-based object detection tool using machine learning could boost tumor detection

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Phys.org

Medical Xpress

Science X

Visual active search tool combines deep reinforcement learning, traditional active search methods

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

Using large language models to enable open-world, interactive and personalized robot navigation

Need a doctor? Google can now help you find places that accept your health insurance

New machine-learning approach facilitates molecular conformer search in complex molecules

Distractor objects can help the visual system develop more effective search strategies

Computer scientists create new search systems to limit COVID-19 misinformation

Sketch-based object detection tool using machine learning could boost tumor detection

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Your Privacy