April 20, 2024

Microsoft teases lifelike avatar AI tech but gives no release date

by Glenn CHAPMAN

Microsoft researchers say an AI model they have developed lets avatars engage in realistic seeming conversations complete with nuanced facial expressions.

Researchers at Microsoft have revealed a new artificial tool that can create deeply realistic human avatars—but offered no timetable to make it available to the public, citing concerns about facilitating deep fake content.

The AI model known as VASA-1, for "visual affective skills," can create an animated video of a person talking, with synchronized lip movements, using just a single image and a speech audio clip.

Disinformation researchers fear rampant misuse of AI-powered applications to create "deep fake" pictures, video, and audio clips in a pivotal election year.

"We are opposed to any behavior to create misleading or harmful contents of real persons," wrote the authors of the VASA-1 report, released this week by Microsoft Research Asia.

"We are dedicated to developing AI responsibly, with the goal of advancing human well-being," they said.

"We have no plans to release an online demo, API, product, additional implementation details, or any related offerings until we are certain that the technology will be used responsibly and in accordance with proper regulations."

Microsoft researchers said the technology can capture a wide spectrum of facial nuances and natural head motions.

"It paves the way for real-time engagements with lifelike avatars that emulate human conversational behaviors," researchers said in the post.

VASA can work with artistic photos, songs, and non-English speech, according to Microsoft.

Researchers touted potential benefits of the technology such as providing virtual teachers to students or therapeutic support to people in need.

"It is not intended to create content that is used to mislead or deceive," they said.

VASA videos still have "artifacts" that reveal they are AI-generated, according to the post.

ProPublica technology lead Ben Werdmuller said he'd be "excited to hear about someone using it to represent them in a Zoom meeting for the first time."

"Like, how did it go? Did anyone notice?" he said on social network Threads.

ChatGPT-maker OpenAI in March revealed a voice-cloning tool called "Voice Engine" that can essentially duplicate someone's speech based on a 15-second audio sample.

But it said it was "taking a cautious and informed approach to a broader release due to the potential for synthetic voice misuse."

Earlier this year, a consultant working for a long-shot Democratic presidential candidate admitted he was behind a robocall impersonation of Joe Biden sent to voters in New Hampshire, saying he was trying to highlight the dangers of AI.

The call featured what sounded like Biden's voice urging people not to cast ballots in the state's January's primary, sparking alarm among experts who fear a deluge of AI-powered deep fake disinformation in the 2024 White House race.

Citation: Microsoft teases lifelike avatar AI tech but gives no release date (2024, April 20) retrieved 17 July 2024 from https://techxplore.com/news/2024-04-microsoft-lifelike-avatar-ai-tech.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

OpenAI unveils voice-cloning tool

33 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

14 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

16 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

18 hours ago

Large language models make human-like reasoning mistakes, researchers find

19 hours ago

Unveiling a new class of synthetic fuels

19 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

19 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

20 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

23 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

Jul 16, 2024

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

Microsoft teases lifelike avatar AI tech but gives no release date

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

OpenAI unveils voice-cloning tool

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

Meta urged to update rules after fake Biden post

Biden robocall: Audio deepfake fuels election disinformation fears

AI giants to unveil pact to fight political deepfakes

Meta to start labeling AI-generated content in May

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

Flexible, permeable and 3D integrated electronic skin combines liquid metal circuits with fibrous substrates

Phys.org

Medical Xpress

Science X

Microsoft teases lifelike avatar AI tech but gives no release date

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

OpenAI unveils voice-cloning tool

Microsoft's AI app VASA-1 makes photographs talk and sing with believable facial expressions

Meta urged to update rules after fake Biden post

Biden robocall: Audio deepfake fuels election disinformation fears

AI giants to unveil pact to fight political deepfakes

Meta to start labeling AI-generated content in May

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

Flexible, permeable and 3D integrated electronic skin combines liquid metal circuits with fibrous substrates

Your Privacy