Machine Learning & AI

Emotion recognition based on paralinguistic information

Researchers at the University of Texas at Arlington have recently explored the use of machine learning for emotion recognition based solely on paralinguistic information. Paralinguistics are aspects of spoken communication ...

Robotics

Improbotics: Bringing machine intelligence into improvised theatre

An unconventional company called HumanMachine has recently devised a unique theatre production called Improbotics, which involves both human performers and machines. This is the latest of a series of projects and initiatives ...

Software

Mozilla releases transcription model and huge voice dataset

(Tech Xplore)—Mozilla (maker of the Firefox browser) has announced the release of an open source speech recognition model along with a large voice dataset. The release marks the advent of open source speech recognition ...

Consumer & Gadgets

How Google's Pixel Buds earphones translate languages

In the Hitchhiker's Guide to The Galaxy, Douglas Adams's seminal 1978 BBC broadcast (then book, feature film and now cultural icon), one of the many technology predictions was the Babel Fish. This tiny yellow life-form, inserted ...

Security

Researchers explore inaudible voice command attack

(Tech Xplore)—The novelty has worn off; we are now quite accustomed to the technology advances leveraged by leading vendors that allow the daily use of human-device interactions via voice.

Computer Sciences

Machine voice recognition reaches human parity

Last year, Microsoft's speech and dialog research group announced a milestone in reaching human parity on the Switchboard conversational speech recognition task, meaning we had created technology that recognized words in ...

Engineering

Robotic co-pilot is shown to land simulated Boeing 737

(Tech Xplore)—Would you want to take your next flight out knowing the pilot is a robot? The question will not be a problem for you to resolve any time soon. Instead, consider a robot behaving as co-pilot and that is not ...

Computer Sciences

Baidu Research is keen on addressing transcription pain points

(Tech Xplore)—Artificial intelligence powered transcription software? How, where? Professionals in many sectors who may have to cope with transcriptions of interviews and recorded statements know how tiring transcribing ...

page 1 from 3

Speech recognition

Speech recognition (also known as automatic speech recognition or computer speech recognition) converts spoken words to machine-readable input (for example, to key presses, using the binary code for a string of character codes). The term "voice recognition" is sometimes used to refer to speech recognition where the recognition system is trained to a particular speaker - as is the case for most desktop recognition software, hence there is an aspect of speaker recognition, which attempts to identify the person speaking, to better recognise what is being said. Speech recognition is a broad term which means it can recognise almost anybodys speech - such as a callcentre system designed to recognise many voices. Voice recognition is a system trained to a particular user, where it recognises their speech based on their unique vocal sound.

Speech recognition applications include voice dialing (e.g., "Call home"), call routing (e.g., "I would like to make a collect call"), domotic appliance control and content-based spoken audio search (e.g., find a podcast where particular words were spoken), simple data entry (e.g., entering a credit card number), preparation of structured documents (e.g., a radiology report), speech-to-text processing (e.g., word processors or emails), and in aircraft cockpits (usually termed Direct Voice Input).

This text uses material from Wikipedia, licensed under CC BY-SA