January 17, 2024 feature

The solution space of the spherical negative perceptron model is star-shaped, researchers find

by Ingrid Fadelli , Tech Xplore

Recent numerical studies investigating neural networks have found that solutions typically found by modern machine learning algorithms lie in complex extended regions of the loss landscape. In these regions, zero-energy paths between pairs of distant solutions can be established.

Researchers at Bocconi University, Politecnico di Torino and Bocconi Institute for Data Science and Analytics recently carried out a study aimed at exploring these regions using one of the simplest non-convex neural network models, known as the negative spherical perceptron. Their paper, published in Physical Review Letters, found that solutions of this neural network model are arranged in a star-shaped geometry.

"Recent research on the landscape of neural networks has shown that independent stochastic gradient descent (SGD) trajectories often land in the same low-loss basin, and often no barrier is found along the linear interpolation between them," Luca Saglietti, co-author of the paper, told Tech Xplore. "We started to think if we could reproduce analytically this phenomenology in a simple model of neural network."

In their paper, Saglietti and his colleagues introduced an analytical method that can be used to compute energy barriers on the linear interpolation between pairs of solutions. As part of their recent study, they applied this method to the so-called spherical negative perceptron , which is essentially a paradigmatic toy model of a neural network.

"This model is particularly interesting because it is both continuous and non-convex," Clarissa Lauditi, co-author of the paper said. "It involves a set of parameters (called weights) that need to be tuned in such a way to satisfy a training set of input-output associations. In the negative perceptron the constraints can be relaxed, and in this overparameterized regime the geometry of the solution space becomes surprisingly rich."

The researchers probed this model and the rich solution landscape associated with it by characterizing the energy barriers on the linear interpolation between pairs of solutions using the so-called replica method. This is a well-known and established technique that is commonly applied in statistical physics studies.

"Unexpectedly, we unveiled that the solutions are arranged in a star-shaped geometry," Enrico Malatesta, co-author of the paper said.

"Most of them are located on the tips of the star, but there exists a subset of solutions (located in the core of the star) that are connected through a straight line to almost all the other solutions. We have found that this shape actually impacts the behavior of the training algorithms: common algorithms used in deep learning have a bias towards the solutions located in the core of the star. Those solutions have desirable properties, e.g. better robustness and generalization capabilities."

The recent work by this research team offers interesting new insight about the geometrical shape of solutions associated with the spherical negative perceptron. The team found that the way the solutions are arranged affects the performance of algorithms, with the algorithms often preferentially selecting solutions located at the center of the star-like geometry, which often exhibit advantageous characteristics.

"Our future research aims to understand to what extent the star-shaped geometry may be a universal property of overparameterized neural networks and weakly constrained optimization problems," Gabriele Perugini, co-author of the paper added. "The possibility that also completely different non-convex optimization problems can develop simple connectivity properties when overparameterized is certainly intriguing."

More information: Brandon Livio Annesi et al, Star-Shaped Space of Solutions of the Spherical Negative Perceptron, Physical Review Letters (2023). DOI: 10.1103/PhysRevLett.131.227301.

Journal information: Physical Review Letters

Citation: The solution space of the spherical negative perceptron model is star-shaped, researchers find (2024, January 17) retrieved 29 June 2024 from https://techxplore.com/news/2024-01-solution-space-spherical-negative-perceptron.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New parallel hybrid network achieves better performance through quantum-classical collaboration

64 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

The solution space of the spherical negative perceptron model is star-shaped, researchers find

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

New parallel hybrid network achieves better performance through quantum-classical collaboration

Scientists improve deep learning method for neural networks

Reviewing recent advances in metasurface design with intelligent optimization methods

New insights into training dynamics of deep classifiers

Study explores the scaling of deep learning models for chemistry research

AI approach offers solutions to tricky optimization problems, from global package routing to power grid operation

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Phys.org

Medical Xpress

Science X

The solution space of the spherical negative perceptron model is star-shaped, researchers find

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

New parallel hybrid network achieves better performance through quantum-classical collaboration

Scientists improve deep learning method for neural networks

Reviewing recent advances in metasurface design with intelligent optimization methods

New insights into training dynamics of deep classifiers

Study explores the scaling of deep learning models for chemistry research

AI approach offers solutions to tricky optimization problems, from global package routing to power grid operation

Recommended for you

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Your Privacy