November 22, 2023

Engineers develop framework to predict types of sounds likely to be heard at certain locations

by Shawn Ballard, Washington University in St. Louis

Mapping soundscapes everywhere — A soundscape map (left) for the text prompt, "This is a sound of sea waves," and the region's corresponding overhead image. Green indicates areas where the sound is more probable; white indicates less probable. Credit: Jacobs lab

Imagine yourself on a beautiful beach. You're likely visualizing sand and sea but also hearing a symphony of wind gusting, waves crashing and gulls cawing. In this scene—as well as in urban settings with neighbors talking, dogs barking and traffic whooshing—sounds are critical components of the overall feel of a place.

Indeed, sound is one of the fundamental senses that helps humans understand their environments, and environmental sound conditions have been shown to have a strong correlation with a person's mental and physical health. Reliable methods for understanding the soundscape of a given geographic area are therefore valuable for applications ranging from collective policymaking around urban planning and noise management to individual decisions about where to buy a home or establish a business.

Nathan Jacobs, a professor of computer science and engineering, along with graduate students Subash Khanal, Srikumar Sastry and Aayush Dhakal, all studying computer science and engineering, at the McKelvey School of Engineering at Washington University in St. Louis, developed Geography-Aware Contrastive Language Audio Pre-training (GeoCLAP), a novel framework for soundscape mapping that can be applied anywhere in the world.

They presented their work on Nov. 22 at the British Machine Vision Conference in Aberdeen, United Kingdom. The paper is also posted to the arXiv preprint server.

The team's key innovation comes from their use of three different modalities, or types of data, in their framework, which incorporates geotagged audio, textual description and overhead images. Unlike previous methods for soundscape mapping that focused on only two modalities, GeoCLAP's richer understanding allows users to create probable soundscapes from either textual or audio queries for any geographic location.

"We've developed a simple and scalable way of creating a soundscape map for any geographic area," Jacobs said. "Our approach overcomes the limitations of previous soundscape mapping methods that were rule-based, often missing important sound sources, or relied on direct human observations, which are difficult to obtain in sufficient quantities away from popular tourist destinations.

"By leveraging the intrinsic relationship between sound and localized visual cues, our multimodal tool and freely available overhead imagery makes it possible for us to create soundscape maps for any area in the world."

More information: Subash Khanal et al, Learning Tri-modal Embeddings for Zero-Shot Soundscape Mapping, arXiv (2023). DOI: 10.48550/arxiv.2309.10667

Journal information: arXiv

Provided by Washington University in St. Louis

Citation: Engineers develop framework to predict types of sounds likely to be heard at certain locations (2023, November 22) retrieved 30 June 2024 from https://techxplore.com/news/2023-11-framework-heard.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Measuring the changing soundscape in Glacier National Park

46 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

Engineers develop framework to predict types of sounds likely to be heard at certain locations

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Measuring the changing soundscape in Glacier National Park

Designing workplaces with sound disturbances in mind

To find out how wildlife is doing, scientists try listening

Improving child development by monitoring noisy daycares

Comparing Western and Chinese classical music using deep learning algorithms

Personalized soundscape could help people with dementia with time, place recognition

Researchers develop the fastest possible flow algorithm

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Self-assembling, highly conductive sensors could improve wearable devices

Light-controlled artificial maple seeds could monitor the environment even in hard-to-reach locations

Sony introduces AI for single-instrument accompaniment generation in music production

Phys.org

Medical Xpress

Science X

Engineers develop framework to predict types of sounds likely to be heard at certain locations

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Measuring the changing soundscape in Glacier National Park

Designing workplaces with sound disturbances in mind

To find out how wildlife is doing, scientists try listening

Improving child development by monitoring noisy daycares

Comparing Western and Chinese classical music using deep learning algorithms

Personalized soundscape could help people with dementia with time, place recognition

Recommended for you

Researchers develop the fastest possible flow algorithm

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Self-assembling, highly conductive sensors could improve wearable devices

Light-controlled artificial maple seeds could monitor the environment even in hard-to-reach locations

Sony introduces AI for single-instrument accompaniment generation in music production

Your Privacy