share this!
5
9
Share
Email

April 28, 2021

Cognitive neuroscience could pave the way for emotionally intelligent robots

by Japan Advanced Institute of Science and Technology

Human beings have the ability to recognize emotions in others. Although perfectly capable of communicating with humans through speech, robots and virtual agents are only good at processing logical instructions, which greatly restricts human-robot interaction (HRI). Consequently, a great deal of research in HRI is about emotion recognition from speech. But first, how do we describe emotions?

Categorical emotions such as happiness, sadness and anger are well understood by us but can be hard for robots to register. Researchers have focused on "dimensional emotions," which constitute a gradual emotional transition in natural speech. "Continuous dimensional emotion can help a robot capture the time dynamics of a speaker's emotional state and accordingly adjust its manner of interaction and content in real time," explains Prof. Masashi Unoki from Japan Advanced Institute of Science and Technology (JAIST), who works on speech recognition and processing.

Studies have shown that an auditory perception model simulating the working of a human ear can generate what are called "temporal modulation cues" that faithfully capture the time dynamics of dimensional emotions. Neural networks can then be employed to extract features from these cues that reflect these time dynamics. However, due to the complexity and variety of auditory perception models, feature extraction turns out to be pretty challenging.

In a new study published in Neural Networks, Prof. Unoki and his colleagues, including Zhichao Peng, from Tianjin University, China (who led the study), Jianwu Dang from Pengcheng Laboratory, China, and Prof. Masato Akagi from JAIST, have now taken inspiration from a recent finding in cognitive neuroscience suggesting that our brain forms multiple representations of natural sounds with different degrees of spectral (i.e., frequency) and temporal resolutions through a combined analysis of spectral-temporal modulations.

Accordingly, the researchers have proposed a novel feature called multi-resolution modulation-filtered cochleagram (MMCG), which combines four modulation-filtered cochleagrams (time-frequency representations of the input sound) at different resolutions to obtain the temporal and contextual modulation cues. To account for the diversity of the cochleagrams, researchers designed a parallel neural network architecture called "long short-term memory" (LSTM), which modeled the time variations of multi-resolution signals from the cochleagrams and carried out extensive experiments on two datasets of spontaneous speech.

The results were encouraging. The researchers found that MMCG showed a significantly better emotion recognition performance than traditional acoustic-based features and other auditory-based features for both the datasets. Furthermore, the parallel LSTM network demonstrated a superior prediction of dimensional emotions than that with a plain LSTM-based approach.

Prof. Unoki is thrilled and contemplates improving upon the MMCG feature in future research. "Our next goal is to analyze the robustness of environmental noise sources and investigate our feature for other tasks, such as categorical emotion recognition, speech separation, and voice activity detection,"he concludes.

More information: Zhichao Peng et al. Multi-resolution modulation-filtered cochleagram feature for LSTM-based dimensional emotion recognition from speech, Neural Networks (2021). DOI: 10.1016/j.neunet.2021.03.027

Provided by Japan Advanced Institute of Science and Technology

Citation: Cognitive neuroscience could pave the way for emotionally intelligent robots (2021, April 28) retrieved 16 August 2024 from https://techxplore.com/news/2021-04-cognitive-neuroscience-pave-emotionally-intelligent.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Sounds familiar: A speaker identity-controllable framework for machine speech translation

15 shares

Feedback to editors

Engineers design tiny batteries for powering cell-sized robots

11 hours ago

Leaf-like solar concentrators promise major boost in solar efficiency

12 hours ago

Why does AI beat humans at the strategy game Diplomacy?

13 hours ago

New technique prints metal oxide thin film circuits at room temperature

14 hours ago

Studies highlight challenges and solutions in making large language models trustworthy

15 hours ago

Finding security flaws in Android ahead of malicious hackers

15 hours ago

Robot planning tool accounts for human carelessness

16 hours ago

From shrimp to steel: Introducing nature-inspired metalworking

16 hours ago

'AI Scientist' model designed to conduct scientific research autonomously

17 hours ago

Global AI adoption is outpacing risk understanding, researchers warn

17 hours ago

Load comments (0)

Cognitive neuroscience could pave the way for emotionally intelligent robots

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Sounds familiar: A speaker identity-controllable framework for machine speech translation

Study explains role of bone-conducted speech transmission in speech production and hearing

A deep learning technique for context-aware emotion recognition

A convolutional network to align and predict emotion annotations

Emotion recognition based on paralinguistic information

Using a cappella to explain speech and music specialization

Engineers design tiny batteries for powering cell-sized robots

A two-stage framework to improve LLM-based anomaly detection and reactive planning

Watch how this shape-shifting wheel tackles uneven surfaces

Robot planning tool accounts for human carelessness

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Phys.org

Medical Xpress

Science X

Cognitive neuroscience could pave the way for emotionally intelligent robots

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Related Stories

Sounds familiar: A speaker identity-controllable framework for machine speech translation

Study explains role of bone-conducted speech transmission in speech production and hearing

A deep learning technique for context-aware emotion recognition

A convolutional network to align and predict emotion annotations

Emotion recognition based on paralinguistic information

Using a cappella to explain speech and music specialization

Recommended for you

Engineers design tiny batteries for powering cell-sized robots

A two-stage framework to improve LLM-based anomaly detection and reactive planning

Watch how this shape-shifting wheel tackles uneven surfaces

Robot planning tool accounts for human carelessness

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Your Privacy