Supervised speech enhancement approach improves quality of voice communication

For voice communication, it is important to suppress background noise without introducing unnatural distortion. Deep learning-based speech enhancement approaches can effectively suppress background noise components.

However, in the noise-mismatched condition, unnatural residual noise is generated and it heavily influences speech comfortableness.

Recently, researchers from the Institute of Acoustics of the Chinese Academy of Sciences (IACAS) proposed a type of supervised speech enhancement approach with residual noise control for voice communication.

Based on artificially maintaining low-level residual noise, researchers dedicated to maximizing noise reduction and minimizing speech distortion jointly, leading to better perceptual comfortableness of enhanced speech.

Facing the widely-existing disadvantages of loss functions, researchers introduced multiple adjustable hyper-parameters and derived a generalized loss function.

They selected suitable parameter configurations, making the enhanced speech weigh flexibly and effectively between the two objectives. Meanwhile, by introducing low-level background noise, they improved the subjective perceptual quality.

Experimental results showed that choosing suitable parameter configurations could make the enhanced speech outperform previous works in terms of both objective metrics and subjective evaluation results.

This work could be utilized for noise suppression and speech information extraction in the speech communication devices.

The study, published in Applied Sciences, was supported by the National Natural Science Foundation of China.

More information: Andong Li et al., A Supervised Speech Enhancement Approach with Residual Noise Control for Voice Communication, Applied Sciences (2020). DOI: 10.3390/app10082894

Provided by Chinese Academy of Sciences

Supervised speech enhancement approach improves quality of voice communication

Noise is an increasing problem in learning environments

For more open and equitable public discussions on social media, try 'meronymity'

Researchers develop energy-efficient probabilistic computer by combining CMOS with stochastic nanomagnet

Research team manufactures the first universal, programmable and multifunctional photonic chip

Metasurface antenna could enable future 6G communications networks

New computer vision tool can count damaged buildings in crisis zones and accurately estimate bird flock sizes

Could new technique for 'curving' light be the secret to improved wireless communication?

Game theory research shows AI can evolve into more selfish or cooperative personalities

Team develops a way to teach a computer to type like a human

Universal 'cocktail electrolyte' developed for 4.6 V ultra-stable fast charging of commercial lithium-ion batteries

Garbage could replace a quarter of petroleum-based jet fuel every year

Mess is best: Disordered structure of battery-like devices improves performance

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

An ink for 3D-printing flexible devices without mechanical joints

Floating solar's potential to support sustainable development

Harvesting vibrational energy from 'colored noise'

New understanding of energy losses in emerging light source

Octopus inspires new suction mechanism for robots

Proof-of-concept nanogenerator turns CO₂ into sustainable power

Supervised speech enhancement approach improves quality of voice communication

Let us know if there is a problem with our content

Thank you for taking time to provide your feedback to the editors

Share article

E-MAIL THE STORY