June 14, 2023

Engineering safer machine learning

by Maggie Lindenberg, University of Pittsburgh

Children first learning to walk may go a bit too fast and fall down, or run into a piece of furniture. However, that cause-and-effect element teaches them invaluable information about how their bodies move through space so that they can avoid falling in the future.

Machines learn in a lot of the same ways that humans do, including learning from their mistakes. However, for many machines—like self-driving cars and power systems—learning on the job with human safety at stake presents a problem. As machine learning matures and proliferates, there is a growing interest in applying it to highly complex, safety-critical autonomous systems. The promise of these technologies, however, is hindered by the safety risks inherent in the training process and beyond.

A new research paper challenges the idea that you need an unlimited number of trials to learn safe actions in unfamiliar environments. The paper, published recently in the journal IEEE Transactions on Automatic Control, presents a fresh approach that ensures learning safe actions with complete confidence, while managing the balance between being optimal, encountering dangerous situations, and quickly recognizing unsafe actions.

"Generally, machine learning looks for the most optimized solution, which can result in more errors along the way. That's problematic when the error could mean crashing into a wall," explained Juan Andres Bazerque, assistant professor of electrical and computer engineering at the Swanson School of Engineering, who led the research along with Associate Professor Enrique Mallada at Johns Hopkins University.

"In this study, we show that learning safe policies is fundamentally different from learning optimal policies, and that it can be done separately and efficiently."

The research team conducted studies in two different scenarios to illustrate their concept. By making reasonable assumptions about exploration, they created an algorithm that detects all unsafe actions within a limited number of rounds. The team also tackled the challenge of finding optimal policies for a Markov decision process (MDP) with almost sure constraints.

Their analysis emphasized a tradeoff between the time required to detect unsafe actions in the underlying MDP and the level of exposure to unsafe events. MDP is useful because it provides a mathematical framework for modeling decision-making in situations where outcomes are partly random and partly under the control of a decision maker.

To validate their theoretical findings, the researchers conducted simulations that confirmed the identified tradeoffs. These findings also suggested that incorporating safety constraints can expedite the learning process.

"This research challenges the prevailing belief that learning safe actions requires an unlimited number of trials," stated Bazerque. "Our results demonstrate that by effectively managing tradeoffs between optimality, exposure to unsafe events, and detection time, we can achieve guaranteed safety without an infinite number of explorations. This has significant implications for robotics, autonomous systems, and artificial intelligence, and more."

More information: Agustin Castellano et al, Learning to Act Safely With Limited Exposure and Almost Sure Certainty, IEEE Transactions on Automatic Control (2023). DOI: 10.1109/TAC.2023.3240925

Journal information: IEEE Transactions on Automatic Control

Provided by University of Pittsburgh

Citation: Engineering safer machine learning (2023, June 14) retrieved 17 July 2024 from https://techxplore.com/news/2023-06-safer-machine.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Machines can make better decisions than humans, but how do we know when they're actually accurate?

29 shares

Feedback to editors

Engineers develop technique to pinpoint nanoscale 'hot spots' in electronics to improve their longevity

52 minutes ago

Researchers create insect-inspired autonomous navigation strategy for tiny, lightweight robots

52 minutes ago

Soft, stretchy 'jelly batteries' inspired by electric eels

52 minutes ago

Astronomy methods applied to reflections in eyes could help with spotting deepfakes

53 minutes ago

The magnet trick: New invention makes vibrations disappear

2 hours ago

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

3 hours ago

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

3 hours ago

Scientists bridge the 'valley of death' in carbon capture technologies

3 hours ago

Flexible electronics researchers develop a completely stretchy lithium-ion battery

6 hours ago

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

8 hours ago

Load comments (0)

Engineering safer machine learning

Engineers develop technique to pinpoint nanoscale 'hot spots' in electronics to improve their longevity

Researchers create insect-inspired autonomous navigation strategy for tiny, lightweight robots

Soft, stretchy 'jelly batteries' inspired by electric eels

Astronomy methods applied to reflections in eyes could help with spotting deepfakes

The magnet trick: New invention makes vibrations disappear

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

Scientists bridge the 'valley of death' in carbon capture technologies

Flexible electronics researchers develop a completely stretchy lithium-ion battery

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

Machines can make better decisions than humans, but how do we know when they're actually accurate?

Machine-learning method used for self-driving cars could improve lives of type-1 diabetes patients

New mathematical model: Punishments and rewards teach AI agents to make the right decisions

Engineers help artificial intelligence to learn more safely in the real world

A reinforcement learning framework to enhance the ramp merging capabilities of autonomous vehicles

Scientists design learning-enabled safe control for systems in uncertain environments

Engineers develop technique to pinpoint nanoscale 'hot spots' in electronics to improve their longevity

Researchers create insect-inspired autonomous navigation strategy for tiny, lightweight robots

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

New system enables intuitive teleoperation of a robotic manipulator in real-time

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Phys.org

Medical Xpress

Science X

Engineering safer machine learning

Engineers develop technique to pinpoint nanoscale 'hot spots' in electronics to improve their longevity

Researchers create insect-inspired autonomous navigation strategy for tiny, lightweight robots

Soft, stretchy 'jelly batteries' inspired by electric eels

Astronomy methods applied to reflections in eyes could help with spotting deepfakes

The magnet trick: New invention makes vibrations disappear

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

Scientists bridge the 'valley of death' in carbon capture technologies

Flexible electronics researchers develop a completely stretchy lithium-ion battery

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

Related Stories

Machines can make better decisions than humans, but how do we know when they're actually accurate?

Machine-learning method used for self-driving cars could improve lives of type-1 diabetes patients

New mathematical model: Punishments and rewards teach AI agents to make the right decisions

Engineers help artificial intelligence to learn more safely in the real world

A reinforcement learning framework to enhance the ramp merging capabilities of autonomous vehicles

Scientists design learning-enabled safe control for systems in uncertain environments

Recommended for you

Engineers develop technique to pinpoint nanoscale 'hot spots' in electronics to improve their longevity

Researchers create insect-inspired autonomous navigation strategy for tiny, lightweight robots

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

New system enables intuitive teleoperation of a robotic manipulator in real-time

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Your Privacy