September 19, 2023

Machine learning models can produce reliable results even with limited training data

by Sarah Collins, University of Cambridge

Researchers have determined how to build reliable machine learning models that can understand complex equations in real-world situations while using far less training data than is normally expected.

The researchers, from the University of Cambridge and Cornell University, found that for partial differential equations—a class of physics equations that describe how things in the natural world evolve in space and time—machine learning models can produce reliable results even when they are provided with limited data.

Their results, reported in the Proceedings of the National Academy of Sciences, could be useful for constructing more time- and cost-efficient machine learning models for applications such as engineering and climate modeling.

Most machine learning models require large amounts of training data before they can begin returning accurate results. Traditionally, a human will annotate a large volume of data—such as a set of images, for example—to train the model.

"Using humans to train machine learning models is effective, but it's also time-consuming and expensive," said first author Dr. Nicolas Boullé, from the Isaac Newton Institute for Mathematical Sciences. "We're interested to know exactly how little data we actually need to train these models and still get reliable results."

Other researchers have been able to train machine learning models with a small amount of data and get excellent results, but how this was achieved has not been well-explained. For their study, Boullé and his co-authors, Diana Halikias and Alex Townsend from Cornell University, focused on partial differential equations (PDEs).

"PDEs are like the building blocks of physics: they can help explain the physical laws of nature, such as how the steady state is held in a melting block of ice," said Boullé, who is an INI-Simons Foundation Postdoctoral Fellow. "Since they are relatively simple models, we might be able to use them to make some generalizations about why these AI techniques have been so successful in physics."

The researchers found that PDEs that model diffusion have a structure that is useful for designing AI models. "Using a simple model, you might be able to enforce some of the physics that you already know into the training data set to get better accuracy and performance," said Boullé.

The researchers constructed an efficient algorithm for predicting the solutions of PDEs under different conditions by exploiting the short and long-range interactions happening. This allowed them to build some mathematical guarantees into the model and determine exactly how much training data was required to end up with a robust model.

"It depends on the field, but for physics, we found that you can actually do a lot with a very limited amount of data," said Boullé. "It's surprising how little data you need to end up with a reliable model. Thanks to the mathematics of these equations, we can exploit their structure to make the models more efficient."

The researchers say that their techniques will allow data scientists to open the 'black box' of many machine learning models and design new ones that can be interpreted by humans, although future research is still needed.

"We need to make sure that models are learning the right things, but machine learning for physics is an exciting field—there are lots of interesting math[s] and physics questions that AI can help us answer," said Boullé.

More information: Nicolas Boullé et al, Elliptic PDE learning is provably data-efficient, Proceedings of the National Academy of Sciences (2023). DOI: 10.1073/pnas.2303904120

Journal information: Proceedings of the National Academy of Sciences

Provided by University of Cambridge

Citation: Machine learning models can produce reliable results even with limited training data (2023, September 19) retrieved 29 June 2024 from https://techxplore.com/news/2023-09-machine-reliable-results-limited.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Training models with a structured data curriculum

90 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

Machine learning models can produce reliable results even with limited training data

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Training models with a structured data curriculum

Rational neural network advances partial differentiation equation learning

Novel physics-encoded artificial intelligence model helps to learn spatiotemporal dynamics

Scientists could discover physical laws faster using new machine learning technique

Breakthrough reported in machine learning-enhanced quantum chemistry

Machine learning applications need less data than has been assumed

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Phys.org

Medical Xpress

Science X

Machine learning models can produce reliable results even with limited training data

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Training models with a structured data curriculum

Rational neural network advances partial differentiation equation learning

Novel physics-encoded artificial intelligence model helps to learn spatiotemporal dynamics

Scientists could discover physical laws faster using new machine learning technique

Breakthrough reported in machine learning-enhanced quantum chemistry

Machine learning applications need less data than has been assumed

Recommended for you

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Your Privacy