share this!
2
3
Share
Email

October 2, 2020

Hey Google, it's time you listened closely to what our kids are saying

by Lachlan Gilbert, University of New South Wales

Engineers from UNSW Sydney are leading a drive to sample the voices of Australian kids so that they can be better understood by devices that use voice recognition software.

And the researchers say the benefits could also flow into education and speech therapy where digital devices could provide immediate and ongoing feedback in speech training and other learning tasks.

Up until now, speech recognition software that powers virtual assistants like Google Assistant, Alexa and Siri has relied on a growing database of adult voices.

But all that is about to change with the launch of AusKidTalk, a joint project of five Australian universities that aims to build a world-first database of Australian children's voices.

Dr. Beena Ahmed, a senior lecturer with UNSW's School of Electrical Engineering and Telecommunications, says while speech recognition technology has made leaps and bounds in the last decade, the technology is still lagging when it comes to understanding and speaking with children.

"There's been a big improvement in speech recognition to work with different accents and languages," she says. "But so far that has just been for adults. There is a definite shortage of data for children—not just in Australia, but all over the world. This is despite children being such an important demographic. Companies like Amazon, Apple and Google are all starting to notice that this is a big market."

Dr. Ahmed and her fellow engineers, linguists, psychologists and speech pathologists are about to start recruiting 750 children between the ages of three and 12 to provide speech samples as part of the AusKidTalk program. In sound-proof studios located at each of the five campuses, the children will be recorded as they are prompted to repeat words, digits and sentences before engaging in unscripted storytelling exercises.

The new database of children's speech will be used by linguists and psychologists to better understand how children develop their speech and language. Engineers, meanwhile, will be able to use it to develop new speech recognition systems that will interact with younger users much more seamlessly.

Dr. Ahmed says the accuracy of speech recognition systems when interacting with children has so far been quite poor.

Credit: University of New South Wales

"The main reason for this is because children's speech is quite different from adults' speech.

"Children's language skills aren't as sophisticated as adults. They might mispronounce or leave sounds or words out, or change the expected order of words. Then there are physiological differences—their vocal tract isn't fully developed, and until they hit puberty, they speak in much higher pitches. All this makes their speech very different from adults and therefore harder for speech recognition systems to process."

Potential benefits for speech therapy and education

In addition to recording samples of typical speech, the researchers will also be recording samples of disordered speech spoken by children.

The idea behind this is if speech recognition systems could be taught to recognise when children are having problems forming words, they could not only be used to understand voice commands spoken by kids with impaired speech, but could also be used therapeutically to help with speech training using a mobile device.

"Speech therapy is a very costly business," Dr. Ahmed says.

"You've got parents spending up to $200 for a session with a clinician, and still having to do a lot of home practice that the clinician can't monitor.

"Another problem is that parents can also find it hard to provide feedback themselves, because they're not properly trained or because they're already tuned to understand their kids in cases where others might not."

But with an automated speech therapy tool, kids and parents could get instant feedback when they practice what they've learned with the clinician, Dr. Ahmed says.

"It would give children immediate and ongoing access. You can't expect this level of attention from limited appointments with limited numbers of available pathologists."

Speech recognition systems using a database of children's voices could also have benefits in education.

"A lot of schools rely on getting parent volunteers to listen to children doing their reading in early education. But in schools that may have trouble getting enough parent volunteers, a child could read to a tablet or computer which could listen and correct them as they went," Dr. Ahmed says.

The researchers say the COVID-19 pandemic has shown just how important remote communication and learning tools are.

"Unfortunately children have not been able to benefit from these tools as much as adults due to a lack of effective speech-based tools for remote speech therapy and learning—so they likely have not been able to get the same benefit from telehealth and tele-education tools," Dr. Ahmed says.

Dr. Ahmed says after the samples of 750 children have been recorded and integrated into a speech recognition system, an open source database will be available online for other researchers to work with. The project is expected to be complete by June 2021.

AusKidTalk is an ARC-funded program involving UNSW Sydney, The University of Sydney, Western Sydney University, Macquarie University and the University of Melbourne.

If you would like your child to take part in this research, visit the AusKidTalk website for details on how to apply.

Provided by University of New South Wales

Citation: Hey Google, it's time you listened closely to what our kids are saying (2020, October 2) retrieved 29 June 2024 from https://techxplore.com/news/2020-10-hey-google-kids.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

How does playing with other children affect toddlers' language learning?

5 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

23 hours ago

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

Hey Google, it's time you listened closely to what our kids are saying

Potential benefits for speech therapy and education

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

How does playing with other children affect toddlers' language learning?

New genes linked to severe childhood speech disorder

Some children find it harder to understand what strangers are saying

Automated speech recognition less accurate for blacks: study

Preschoolers correct speaking mistakes even when talking to themselves

Variability in natural speech is challenging for the dyslexic brain

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Software engineers develop a way to run AI language models without matrix multiplication

New tool detects AI-generated videos with 93.7% accuracy

Phys.org

Medical Xpress

Science X

Hey Google, it's time you listened closely to what our kids are saying

Potential benefits for speech therapy and education

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

How does playing with other children affect toddlers' language learning?

New genes linked to severe childhood speech disorder

Some children find it harder to understand what strangers are saying

Automated speech recognition less accurate for blacks: study

Preschoolers correct speaking mistakes even when talking to themselves

Variability in natural speech is challenging for the dyslexic brain

Recommended for you

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Software engineers develop a way to run AI language models without matrix multiplication

New tool detects AI-generated videos with 93.7% accuracy

Your Privacy