July 5, 2024 report

Adding audio data when training robots helps them do a better job

by Bob Yirka , Tech Xplore

A combined team of roboticists from Stanford University and the Toyota Research Institute has found that adding audio data to visual data when training robots helps to improve their learning skills. The team has posted their research on the arXiv preprint server.

The researchers noted that virtually all training done with AI-based robots involves exposing them to a large amount of visual information, while ignoring associated audio. They wondered if adding microphones to robots and allowing them to collect data regarding how something is supposed to sound as it is being done might help them learn a task better.

For example, if a robot is supposed to learn how to open a box of cereal and fill a bowl with it, it may be helpful to hear the sounds of a box being opened and the dryness of the cereal as it cascades down into a bowl. To find out, the team designed and carried out four robot-learning experiments.

The first experiment involved teaching a robot to turn over a bagel in a frying pan using a spatula. The second involved teaching a robot to use an eraser to erase an image on a white board. The third was pouring dice held in a cup into another cup and the fourth was to choose the correct size of tape from three available samples and to use it to tape a wire to a plastic strip.

All the experiments involved using the same robot equipped with a grasping claw. All of them were also done in two ways, using video only and using video and audio. The research team also varied teaching and performance factors such as table height, type of tape or the kind of image on the white board.

After running all their experiments, the researchers compared the results by judging how quickly and easily the robots were able to learn and carry out the tasks and also their accuracy. They found that adding audio significantly improved speed and accuracy with some tasks, but not others.

Adding audio to the task of pouring dice, for example, dramatically improved the robot's ability to figure out if there were any dice in the cup. It also helped the robot understand if it was exerting the right amount of pressure on the eraser, because of the unique sound that was made. Adding sound did not help much, on the other hand, in determining if the bagel had been turned successfully or if all of an image had been successfully removed from a white board.

The team concludes by suggesting that their work shows that adding audio to teaching material for AI robots could provide better results for some applications.

More information: Zeyi Liu et al, ManiWAV: Learning Robot Manipulation from In-the-Wild Audio-Visual Data, arXiv (2024). DOI: 10.48550/arxiv.2406.19464

Project page: mani-wav.github.io/

Journal information: arXiv

Citation: Adding audio data when training robots helps them do a better job (2024, July 5) retrieved 5 July 2024 from https://techxplore.com/news/2024-07-adding-audio-robots-job.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Using contact microphones as tactile sensors for robot manipulation

6 shares

Feedback to editors

New contaminant-tolerant catalyst could help capture carbon directly from smokestacks

1 hour ago

Is AI a major drain on the world's energy supply?

1 hour ago

New electrolyte design boosts lithium metal battery range while minimizing fluorine content

2 hours ago

A new brain-inspired artificial dendritic neural circuit

3 hours ago

Student designs wearable purifier to protect underground train users and improve air quality

Jul 4, 2024

Cool roofs outperform green roofs in urban climate modeling study

Jul 4, 2024

Japan deploys humanoid robot for railway maintenance

Jul 4, 2024

Think you're funny? ChatGPT might be funnier

Jul 3, 2024

'Open-washing' generative AI: How Meta, Google and others feign openness

Jul 3, 2024

New open-source software for quantum cryptography is greater than the sum of its parts

Jul 3, 2024

Load comments (0)

Adding audio data when training robots helps them do a better job

New contaminant-tolerant catalyst could help capture carbon directly from smokestacks

Is AI a major drain on the world's energy supply?

New electrolyte design boosts lithium metal battery range while minimizing fluorine content

A new brain-inspired artificial dendritic neural circuit

Student designs wearable purifier to protect underground train users and improve air quality

Cool roofs outperform green roofs in urban climate modeling study

Japan deploys humanoid robot for railway maintenance

Think you're funny? ChatGPT might be funnier

'Open-washing' generative AI: How Meta, Google and others feign openness

New open-source software for quantum cryptography is greater than the sum of its parts

Using contact microphones as tactile sensors for robot manipulation

A simpler method to teach robots new skills

S. Korea administrative robot defunct after apparent suicide

Engineers quicken the response time for robots to react to human conversation

Adaptive robot can open all the doors

Teaching robots to move by sketching trajectories

Is AI a major drain on the world's energy supply?

A new brain-inspired artificial dendritic neural circuit

Japan deploys humanoid robot for railway maintenance

Think you're funny? ChatGPT might be funnier

Meta releases four new publicly available AI models for developer use

'Open-washing' generative AI: How Meta, Google and others feign openness

Phys.org

Medical Xpress

Science X

Adding audio data when training robots helps them do a better job

New contaminant-tolerant catalyst could help capture carbon directly from smokestacks

Is AI a major drain on the world's energy supply?

New electrolyte design boosts lithium metal battery range while minimizing fluorine content

A new brain-inspired artificial dendritic neural circuit

Student designs wearable purifier to protect underground train users and improve air quality

Cool roofs outperform green roofs in urban climate modeling study

Japan deploys humanoid robot for railway maintenance

Think you're funny? ChatGPT might be funnier

'Open-washing' generative AI: How Meta, Google and others feign openness

New open-source software for quantum cryptography is greater than the sum of its parts

Related Stories

Using contact microphones as tactile sensors for robot manipulation

A simpler method to teach robots new skills

S. Korea administrative robot defunct after apparent suicide

Engineers quicken the response time for robots to react to human conversation

Adaptive robot can open all the doors

Teaching robots to move by sketching trajectories

Recommended for you

Is AI a major drain on the world's energy supply?

A new brain-inspired artificial dendritic neural circuit

Japan deploys humanoid robot for railway maintenance

Think you're funny? ChatGPT might be funnier

Meta releases four new publicly available AI models for developer use

'Open-washing' generative AI: How Meta, Google and others feign openness

Your Privacy