October 24, 2023 report

Robots learn faster with AI boost from Eureka

by Peter Grad , Tech Xplore

Intelligent robots are reshaping our universe. In New Jersey's Robert Wood Johnson University Hospital, AI-assisted robots are bringing a new level of security to doctors and patients by scanning every inch of the premises for harmful bacteria and viruses and disinfecting them with precise doses of germicidal ultraviolet light.

In agriculture, robotic arms driven by drones scan varying types of fruits and vegetables and determine when they are perfectly ripe for picking.

The Airspace Intelligence System AI Flyways takes over the challenging and often stressful tasks of flight dispatchers who must make last-minute flight pattern changes due to sudden extreme weather, depleted fuel supplies, mechanical problems or other emergencies. It optimizes solutions, is safer, saves time and is cost-efficient.

But forget about those accomplishments: Can a robot perform flawless pen-spinning tricks?

A team at NVIDIA Research developed one that can. And while the task is impressive—some experts say it could take months or even a year or more for humans to master the fine art of finger spinning, including challenging manipulations with names such as Devil's Sonic, Backaround, Corkscrew and Bust X2—what stands out about NVIDA's project is that the pen-spinning feat was taught by AI-generated instructions.

In a paper titled "Eureka: Human-Level Reward Design via Coding Large Language Models" that appears on the preprint server arXiv, researchers describe an "evolutionary optimization over reward code" in which robots learn complex fine-manipulation movements through AI generated instructions.

It holds the promise of ever-more efficient problem solving with LLMs, more advanced physical manipulation, and ever-smarter machines in our future.

The team developed Eureka, an algorithm applied to GPT-4 that establishes a reward system for LLMs learning advanced motor functions. The tasks are performed in a physical simulation application called Isaac Gym, developed by NVIDIA. Researchers from UPenn, Caltech and the University of Texas at Austin also participated in the project.

Results achieved through Eureka's training were superior to instructions designed by humans in 83% of the trials. The rapid pen-spinning task was one of 29 complex skills trained on the Eureka algorithm.

"The versatility and substantial performance gains of Eureka suggest that the simple principle of combining large language models with evolutionary algorithms is a general and scalable approach to reward design, an insight that may be generally applicable to difficult, open-ended search problems," said Anima Anandkumar, senior director of AI research at NVIDIA and an author of the Eureka paper.

The Isaac Gym simulates physical activity in a three-dimensional environment. The massively parallel training sessions rapidly generate possible solutions for numerous manipulations far faster than humans or early computation systems can. The gym, researchers say, can improve the speed of training by a factor of 1,000.

Feedback from human operators can be incorporated into training algorithms. The researchers say that would act as a "powerful co-pilot" in especially challenging tasks.

Other tasks accomplished through Eureka training include opening cabinets and drawers, handling scissors and tossing and catching balls.

Eureka compiles statistics of each session's progress and adjusts code to continually improve results.

According to Shital Shah, a principal research engineer at Microsoft Research, "The proverbial positive feedback loop of self-improvement might be just around the corner that allows us to go beyond human training data and capabilities."

More information: Yecheng Jason Ma et al, Eureka: Human-Level Reward Design via Coding Large Language Models, arXiv (2023). DOI: 10.48550/arxiv.2310.12931

Project website: eureka-research.github.io/

Journal information: arXiv

Citation: Robots learn faster with AI boost from Eureka (2023, October 24) retrieved 29 June 2024 from https://techxplore.com/news/2023-10-robots-faster-ai-boost-eureka.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

With encouragement, large language models devise more efficient prompts

56 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

18 hours ago

Researchers develop the fastest possible flow algorithm

21 hours ago

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

23 hours ago

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

Robots learn faster with AI boost from Eureka

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

With encouragement, large language models devise more efficient prompts

A deep learning technique to improve how robots grasp objects

DeepMind unveils self-training RoboCat

New dual-arm robot achieves bimanual tasks by learning from simulation

The right to be forgotten in the age of AI

A computer vision–based teleoperation system that can be applied to different robots

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

New tool detects AI-generated videos with 93.7% accuracy

Phys.org

Medical Xpress

Science X

Robots learn faster with AI boost from Eureka

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

With encouragement, large language models devise more efficient prompts

A deep learning technique to improve how robots grasp objects

DeepMind unveils self-training RoboCat

New dual-arm robot achieves bimanual tasks by learning from simulation

The right to be forgotten in the age of AI

A computer vision–based teleoperation system that can be applied to different robots

Recommended for you

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

New tool detects AI-generated videos with 93.7% accuracy

Your Privacy