February 21, 2024

Charting new paths in AI learning: How changing two variables leads to vastly different outcomes

by Ecole Polytechnique Federale de Lausanne

In an era where artificial intelligence (AI) is transforming industries from health care to finance, understanding how these digital brains learn is more crucial than ever. Now, two researchers from EPFL, Antonia Sclocchi and Matthieu Wyart, have shed light on this process, focusing on a popular method known as Stochastic Gradient Descent (SGD).

At the heart of an AI's learning process are algorithms: sets of rules that guide AIs to improve based on the data they're fed. SGD is one of these algorithms, like a guiding star that helps AIs navigate a complex landscape of information to find the best possible solutions a bit at a time.

However, not all learning paths are equal. The EPFL study, published in Proceedings of the National Academy of Sciences reveals how different approaches to SGD can significantly affect the efficiency and quality of AI learning. Specifically, the researchers examined how changing two key variables can lead to vastly different learning outcomes.

The two variables were the size of the data samples the AI learns from at a single time (this is called the "batch size") and the magnitude of its learning steps (this is the "learning rate"). They identified three distinct scenarios ("regimes"), each with unique characteristics that affect the AI's learning process differently.

In the first scenario, like exploring a new city without a map, the AI takes small, random steps, using small batches and high learning rates, which allows it to stumble upon solutions it might not have found otherwise. This approach is beneficial for exploring a wide range of possibilities but can be chaotic and unpredictable.

The second scenario involves the AI taking a significant initial step based on its first impression, using larger batches and learning rates, followed by smaller, exploratory steps. This regime can speed up the learning process but risks missing out on better solutions that a more cautious approach might discover.

The third scenario is like using a detailed map to navigate directly to known destinations. Here, the AI uses large batches and smaller learning rates, making its learning process more predictable and less prone to random exploration. This approach is efficient but may not always lead to the most creative or optimal solutions.

The study offers a deeper understanding of the tradeoffs involved in training AI models, and highlights the importance of tailoring the learning process to the particular needs of each application. For example, medical diagnostics might benefit from a more exploratory approach where accuracy is paramount, while voice recognition might favor more direct learning paths for speed and efficiency.

More information: Antonio Sclocchi et al, On the different regimes of stochastic gradient descent, Proceedings of the National Academy of Sciences (2024). DOI: 10.1073/pnas.2316301121

Journal information: Proceedings of the National Academy of Sciences

Provided by Ecole Polytechnique Federale de Lausanne

Citation: Charting new paths in AI learning: How changing two variables leads to vastly different outcomes (2024, February 21) retrieved 27 April 2024 from https://techxplore.com/news/2024-02-paths-ai-variables-vastly-outcomes.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New research shows students' knowledge and perceptions of active learning declined during pandemic-era teaching

21 shares

Feedback to editors

Proof of concept study shows path to easier recycling of solar modules

16 hours ago

New circuit boards can be repeatedly recycled

17 hours ago

Researchers develop an automated benchmark for language-based task planners

17 hours ago

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

17 hours ago

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

18 hours ago

Researchers outline path forward for tandem solar cells

19 hours ago

Researcher develop high-performance amorphous p-type oxide semiconductor

20 hours ago

Scientists create new atomic clock that is both ultra-precise and sturdy

20 hours ago

A framework to compare lithium battery testing data and results during operation

23 hours ago

New approach could make reusing captured carbon far cheaper, less energy-intensive

Apr 26, 2024

Load comments (0)

Charting new paths in AI learning: How changing two variables leads to vastly different outcomes

Proof of concept study shows path to easier recycling of solar modules

New circuit boards can be repeatedly recycled

Researchers develop an automated benchmark for language-based task planners

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

Researchers outline path forward for tandem solar cells

Researcher develop high-performance amorphous p-type oxide semiconductor

Scientists create new atomic clock that is both ultra-precise and sturdy

A framework to compare lithium battery testing data and results during operation

New approach could make reusing captured carbon far cheaper, less energy-intensive

New research shows students' knowledge and perceptions of active learning declined during pandemic-era teaching

A theoretical model for reliability assessment of machine learning systems

Online machine learning models accurately predict wastewater influent flow rate

When deep learning meets active learning in the era of foundation models

AI tackles the ABCD of skin cancer

Study looks to put engaged-learning to test

Researchers develop an automated benchmark for language-based task planners

Study explores why human-inspired machines can be perceived as eerie

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Microsoft claims that small, localized language models can be powerful as well

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Phys.org

Medical Xpress

Science X

Charting new paths in AI learning: How changing two variables leads to vastly different outcomes

Proof of concept study shows path to easier recycling of solar modules

New circuit boards can be repeatedly recycled

Researchers develop an automated benchmark for language-based task planners

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

Researchers outline path forward for tandem solar cells

Researcher develop high-performance amorphous p-type oxide semiconductor

Scientists create new atomic clock that is both ultra-precise and sturdy

A framework to compare lithium battery testing data and results during operation

New approach could make reusing captured carbon far cheaper, less energy-intensive

Related Stories

New research shows students' knowledge and perceptions of active learning declined during pandemic-era teaching

A theoretical model for reliability assessment of machine learning systems

Online machine learning models accurately predict wastewater influent flow rate

When deep learning meets active learning in the era of foundation models

AI tackles the ABCD of skin cancer

Study looks to put engaged-learning to test

Recommended for you

Researchers develop an automated benchmark for language-based task planners

Study explores why human-inspired machines can be perceived as eerie

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Microsoft claims that small, localized language models can be powerful as well

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Your Privacy