December 7, 2021

A system for designing and training intelligent soft robots

by Rachel Gordon, Massachusetts Institute of Technology

Let's say you wanted to build the world's best stair-climbing robot. You'd need to optimize for both the brain and the body, perhaps by giving the bot some high-tech legs and feet, coupled with a powerful algorithm to enable the climb.

Although design of the physical body and its brain, the "control," are key ingredients to letting the robot move, existing benchmark environments favor only the latter. Co-optimizing for both elements is hard—it takes a lot of time to train various robot simulations to do different things, even without the design element.

Scientists from MIT's Computer Science and Artificial Intelligence Laboratory (CSAIL), aimed to fill the gap by designing "Evolution Gym," a large-scale testing system for co-optimizing the design and control of soft robots, taking inspiration from nature and evolutionary processes.

The robots in the simulator look a little bit like squishy, moveable Tetris pieces made up of soft, rigid, and actuator "cells" on a grid, put to the tasks of walking, climbing, manipulating objects, shape-shifting, and navigating dense terrain. To test the robot's aptitude, the team developed their own co-design algorithms by combining standard methods for design optimization and deep reinforcement learning (RL) techniques.

The co-design algorithm functions somewhat like a power couple, where the design optimization methods evolve the robot's bodies and the RL algorithms optimize a controller (a computer system that connects to the robot to control the movements) for a proposed design. The design optimization asks "How well does the design perform?" and the control optimization responds with a score, which could look like a five for "walking."

The result looks like a little robot Olympics. In addition to standard tasks like walking and jumping, the researchers also included some unique tasks, like climbing, flipping, balancing, and stair-climbing.

In over 30 different environments, the bots performed amply on simple tasks, like walking or carrying an item, but in more difficult environments, like catching and lifting, they fell short, showing the limitations of current co-design algorithms. For instance, sometimes the optimized robots exhibited what the team calls "frustratingly" obvious nonoptimal behavior on many tasks. For example, the "catcher" robot would often dive forward to catch a falling block that was falling behind it.

Even though the robot designs evolved autonomously from scratch and without prior knowledge by the co-design algorithms, in a step toward more evolutionary processes, they often grew to resemble existing natural creatures while outperforming hand-designed robots.

"With Evolution Gym we're aiming to push the boundaries of algorithms for machine learning and artificial intelligence," says MIT undergraduate Jagdeep Bhatia, a lead researcher on the project. "By creating a large-scale benchmark that focuses on speed and simplicity, we not only create a common language for exchanging ideas and results within the reinforcement learning and co-design space, but also enable researchers without state-of-the-art computing resources to contribute to algorithmic development in these areas. We hope that our work brings us one step closer to a future with robots as intelligent as you or I."

In certain cases, for robots to learn just like humans, trial and error can lead to the best performance of understanding a task, which is the thought behind reinforcement learning. Here, the robots learned how to complete a task like pushing a block by getting some information that will assist it, like "seeing" where the block is, and what the nearby terrain is like. Then, a robot gets some measurement of how well it's doing (the "reward"). The more the robot pushes the block, the higher the reward. The robot had to simultaneously balance exploration (maybe asking itself "Can I increase my reward by jumping?") and exploitation (further exploring behaviors that increase the reward).

The different combinations of "cells" the algorithms came up with for different designs were highly effective: One evolved to resemble a galloping horse with leg-like structures, mimicking what's found in nature. The climber robot evolved two arms and two leg-like structures (kind of like a monkey) to help it climb. The lifter robot resembled a two-fingered gripper.

One avenue for future research is so-called "morphological development," where a robot incrementally becomes more intelligent as it gains experience solving more complex tasks. For example, you'd start by optimizing a simple robot for walking, then take the same design, optimize it for carrying, and then climbing stairs. Over time, the robot's body and brain "morph" into something that can solve more challenging tasks compared to robots directly trained on the same tasks from the start.

"Evolution Gym is part of a growing awareness in the AI community that the body and brain are equal partners in supporting intelligent behavior," says University of Vermont robotics professor Josh Bongard. "There is so much to do in figuring out what forms this partnership can take. Gym is likely to be an important tool in working through these kinds of questions."

Evolution Gym is open source and free to use. This is by design as the researchers hope that their work inspires new and improved algorithms in codesign.

Bhatia wrote the paper alongside MIT undergraduate Holly Jackson, MIT CSAIL Ph.D. student Yunsheng Tian, and Jie Xu, as well as MIT Professor Wojciech Matusik. They are presenting the research at the 2021 Conference on Neural Information Processing Systems.

More information: Jagdeep Bhatia et al, Evolution Gym: A Large-Scale Benchmark for Evolving Soft Robots (2021) is available as a PDF at papers.nips.cc/paper/2021/file … b27861f0c2-Paper.pdf

Provided by Massachusetts Institute of Technology

This story is republished courtesy of MIT News (web.mit.edu/newsoffice/), a popular site that covers news about MIT research, innovation and teaching.

Citation: A system for designing and training intelligent soft robots (2021, December 7) retrieved 17 July 2024 from https://techxplore.com/news/2021-12-intelligent-soft-robots.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researchers' algorithm designs soft-bodied robots that sense their own positions in space

133 shares

Feedback to editors

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

37 minutes ago

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

57 minutes ago

Scientists bridge the 'valley of death' in carbon capture technologies

58 minutes ago

Flexible electronics researchers develop a completely stretchy lithium-ion battery

4 hours ago

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

5 hours ago

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

20 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

22 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Jul 16, 2024

Large language models make human-like reasoning mistakes, researchers find

Jul 16, 2024

Unveiling a new class of synthetic fuels

Jul 16, 2024

Load comments (0)

A system for designing and training intelligent soft robots

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

Scientists bridge the 'valley of death' in carbon capture technologies

Flexible electronics researchers develop a completely stretchy lithium-ion battery

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Researchers' algorithm designs soft-bodied robots that sense their own positions in space

A robot that teaches itself to walk using reinforcement learning

The MIT humanoid robot: A dynamic robotic that can perform acrobatic behaviors

A policy to enable the use of general-purpose manipulators in high-speed robot air hockey

Control of snake-like robots for high mobility and dexterity

Training robots to manipulate soft and deformable objects

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

New system enables intuitive teleoperation of a robotic manipulator in real-time

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Phys.org

Medical Xpress

Science X

A system for designing and training intelligent soft robots

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

Scientists bridge the 'valley of death' in carbon capture technologies

Flexible electronics researchers develop a completely stretchy lithium-ion battery

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Related Stories

Researchers' algorithm designs soft-bodied robots that sense their own positions in space

A robot that teaches itself to walk using reinforcement learning

The MIT humanoid robot: A dynamic robotic that can perform acrobatic behaviors

A policy to enable the use of general-purpose manipulators in high-speed robot air hockey

Control of snake-like robots for high mobility and dexterity

Training robots to manipulate soft and deformable objects

Recommended for you

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

New system enables intuitive teleoperation of a robotic manipulator in real-time

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Your Privacy