share this!
3
2
Share
Email

September 15, 2020

Giving computers a voice

by Martina Märki, ETH Zurich

From Alexa and Siri to translation programs and computer-generated news, anything seems possible these days.The Media Technology Center is searching for applications that could lend a hand with day-to-day editorial work.

Every time you talk to Siri on your phone and ask a question or give a command, you are communicating with artificial intelligence. The only problem is that this intelligence has its limits. In fact, compared to human intelligence, Siri could even be described as fairly stupid, says Ryan Cotterell, a professor who has worked at ETH Zurich since February 2020. Appointed through the ETH media technology initiative as a Professor of Computer Science, Cotterell brings together linguistics, automated language processing and artificial intelligence. "The only reason Siri works is because people typically use very simple questions and commands when they speak to their phone," he says.

Cotterell insists that we shouldn't expect the same from AI as we do from human intelligence. None of us have any trouble learning our native language, he says, and English speakers can intuitively spot grammatical mistakes in an English sentence. Yet computer programs still struggle to identify whether an English sentence is grammatically correct or not—and that's because a language processing program works very differently to the human brain. "No translator has ever had to learn the sheer number of words we need to train a translation program," he says.

The Swiss German challenge

Modern translation programs learn using big data, honing their abilities with millions of pairs of sentences. Yet coming up with multiple alternatives for translating an individual sentence is a lot harder. Human translators can do it easily, but translation programs typically offer just one solution. Cotterell hopes to change that: "We want users to have multiple options rather than just being presented with one result. That would allow users to choose the best-fit sentence for each specific context." Yet developing a viable algorithm for this purpose is no easy task, he cautions.

A further challenge is creating translation programs and voice assistants for languages that are only used by relatively small numbers of people. "It's very hard to develop a good system for languages that are low on data," says Cotterell. Hence his enthusiasm for a voice assistant program that speaks Swiss dialects, which was developed by the Media Technology Center (MTC) at ETH Zurich.

This is a truly remarkable achievement, not only because there are so many regional variants of Swiss dialect, but also because these languages lack a standardized form of spelling. The MTC's voice assistant has been fluent in a Bernese dialect called "Bärndütsch" since 2019, and further dialects are now in the pipeline. To develop their Swiss German assistant, researchers partnered with Swiss Radio and Television (SRF). The benefit of technologies that translate standard German into Swiss German or read local news and weather in specific dialects is their ability to provide regional authenticity—even when automatically converting text to speech.

A computer-generated media experience

More research is needed into linguistic diversity in Switzerland and Europe, especially since most language processing systems come from English-speaking areas, including those suitable for use in media. "That's why we can't just take what American and English media are doing with computerized language processing and simply apply it here," says Cotterell. With support from the media companies NZZ and TX Group, he is planning a translation system that will translate high-quality articles from German into French. Severin Klingler, Managing Director of the Media Technology Center, explains the thinking behind this move: "The idea is to identify existing technologies from English-speaking areas and make them accessible for other languages, too."

The realm of new media presents its own challenges. Filter bubbles and fake news are now part and parcel of our day-to-day media experience, but could AI offer a means of countering this? This is one of the questions currently being explored by the Media Technology Center. As part of the Anti-Recommendation Engine for News Articles project, researchers are seeking to combat filter bubbles by programming a system to search for relevant counterarguments. MTC is also running a project that aims to computerize comment sorting based on content-related criteria. "This could help make differences of opinion more visible," says Klingler.

The only caveat is that the same methods could also be used to generate filter bubbles and fake news. Earlier this summer, news headlines were dominated by cutting-edge language-processing AI from the Californian company OpenAI. Known as GPT-3, this massive language model overshadows everything that has come before. "The dimensions are so huge that it would be impossible for universities to build or even test it," says Cotterell. One of the reasons the system attracted so much attention was the potential risk of AI-generated fake news. Given just a few sample news items, GPT-3 can generate plausible news stories in English. It looks like Ryan Cotterell and his fellow researchers at the Media Technology Center still have plenty of work ahead of them.

Provided by ETH Zurich

Citation: Giving computers a voice (2020, September 15) retrieved 19 April 2024 from https://techxplore.com/news/2020-09-voice.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

The virtual made real: New technology for the media of the future

5 shares

Feedback to editors

Researchers develop sodium battery capable of rapid charging in just a few seconds

4 hours ago

Greater access to clean water, thanks to a better membrane

6 hours ago

Silent flight edges closer to take off, according to new research

7 hours ago

A flexible and efficient DC power converter for sustainable-energy microgrids

7 hours ago

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

8 hours ago

To build a better AI helper, start by modeling the irrational behavior of humans

8 hours ago

Versatile fibers offer improved energy storage capacity for wearable devices

8 hours ago

Harnessing solar energy for high-efficiency NH₃ production

9 hours ago

A dexterous four-legged robot that can walk and handle objects simultaneously

11 hours ago

Climate change will increase value of residential rooftop solar panels across US, study finds

12 hours ago

Load comments (0)

Giving computers a voice

The Swiss German challenge

A computer-generated media experience

Researchers develop sodium battery capable of rapid charging in just a few seconds

Greater access to clean water, thanks to a better membrane

Silent flight edges closer to take off, according to new research

A flexible and efficient DC power converter for sustainable-energy microgrids

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Versatile fibers offer improved energy storage capacity for wearable devices

Harnessing solar energy for high-efficiency NH₃ production

A dexterous four-legged robot that can walk and handle objects simultaneously

Climate change will increase value of residential rooftop solar panels across US, study finds

The virtual made real: New technology for the media of the future

How AI systems use Mad Libs to teach themselves grammar

Chinese to English translating: Not human, but exceptional

Google adds 20 languages to instant translation app

Study finds dialects and languages are processed differently

Comment: Here's how the English language will change after Brexit

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Team develops a way to teach a computer to type like a human

Using sim-to-real reinforcement learning to train robots to do simple tasks in broad environments

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

Researchers use machine learning to create a fabric-based touch sensor

Phys.org

Medical Xpress

Science X

Giving computers a voice

The Swiss German challenge

A computer-generated media experience

Researchers develop sodium battery capable of rapid charging in just a few seconds

Greater access to clean water, thanks to a better membrane

Silent flight edges closer to take off, according to new research

A flexible and efficient DC power converter for sustainable-energy microgrids

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Versatile fibers offer improved energy storage capacity for wearable devices

Harnessing solar energy for high-efficiency NH₃ production

A dexterous four-legged robot that can walk and handle objects simultaneously

Climate change will increase value of residential rooftop solar panels across US, study finds

Related Stories

The virtual made real: New technology for the media of the future

How AI systems use Mad Libs to teach themselves grammar

Chinese to English translating: Not human, but exceptional

Google adds 20 languages to instant translation app

Study finds dialects and languages are processed differently

Comment: Here's how the English language will change after Brexit

Recommended for you

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

To build a better AI helper, start by modeling the irrational behavior of humans

Team develops a way to teach a computer to type like a human

Using sim-to-real reinforcement learning to train robots to do simple tasks in broad environments

Meta's newest AI model beats some peers. But its amped-up AI agents are confusing Facebook users

Researchers use machine learning to create a fabric-based touch sensor

Your Privacy