February 21, 2024

In-depth analysis: Automated machine learning from the perspective of bilevel optimization

In-depth analysis: AutoML (Automated Machine Learning) from the perspective of bilevel optimization — (a) Illustration of key issues of ML task. (b) Formulation of AutoML paradigm from the perspective of bilevel optimization. Credit: Science China Press

Recently, professors Risheng Liu from Dalian University of Technology and Zhouchen Lin from Peking University collaborated on an opinion article published in the National Science Review (NSR). Their article delves deeply into AutoML from the perspective of bilevel optimization, achieving unified modeling of various AutoML tasks while exploring challenges and opportunities. This article will be included in the NSR's special topic on "Automating Machine Learning."

Generally, AutoML requires the automation of three key tasks, including meta-feature learning, neural network architecture search, and hyperparameter optimization. Bilevel Optimization (BLO) is an effective mathematical tool for modeling these tasks, providing a unified AutoML framework. This framework achieves the core objective of AutoML: constructing high-performance models with minimal human intervention.

Specifically, in the upper-level optimization, the core variables are "meta-parameters," aiming to seek the optimal "methodology" to achieve performance optimization of machine learning models on the validation set (such as meta-features, network structures, and hyperparameters). On the other hand, the core variables in lower-level optimization are "model parameters," focusing on optimizing model performance on the training set.

Currently, ML/AutoML technologies, represented by gradient-based BLO algorithms, have gradually gained prominence. However, they still face numerous challenges in practical applications.

For instance, some algorithms heavily rely on the singularity and convexity of lower-level problems, limiting their practicality in real-world scenarios. Additionally, when employing approximate substitution methods in practical applications, there is a lack of theoretical analysis regarding the rigorous convergence of algorithms.

In the future, the challenges faced by BLO in the field of AutoML and promising research directions mainly include the following aspects:

Compute Acceleration: As the scale of datasets expands and task complexity grows, there is an urgent need to accelerate the computational speed of BLO algorithms in handling large-scale, high-dimensional AutoML tasks. Parallel/distributed computing technologies could serve as an effective approach to address this issue.

Theoretical Breakthroughs: Presently, gradient-based BLO methods heavily rely on stringent theoretical assumptions, such as the assumption of submodularity and convexity in lower-level problems. To meet the demands of real-world applications, there is a necessity to construct new theoretical analysis frameworks and efficient computational methods to handle better more challenging practical scenarios involving non-convexity and discreteness.

Optimization-Derived Learning: From the new perspective of bi-level optimization, we can explore disruptive AutoML technologies that integrate Simulation Learning Methodology (SLeM), especially when integrated with large models. This exploration involves delving deeper into the underlying logic of AutoML to design more efficient and precise learning strategies.

In summary, this article has achieved unified modeling of different AutoML tasks from the perspective of BLO. It extensively analyzes the current state and future directions of AutoML centered around the development of BLO algorithms. The novel viewpoints presented in this article contribute to advancing AutoML, empowering artificial intelligence technology to progress toward more intelligent and efficient realms.

More information: Risheng Liu et al, Bilevel optimization for automated machine learning: a new perspective on framework and algorithm, National Science Review (2023). DOI: 10.1093/nsr/nwad292

Provided by Science China Press

Citation: In-depth analysis: Automated machine learning from the perspective of bilevel optimization (2024, February 21) retrieved 29 June 2024 from https://techxplore.com/news/2024-02-depth-analysis-automated-machine-perspective.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Cracking open the black box of automated machine learning

8 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

In-depth analysis: Automated machine learning from the perspective of bilevel optimization

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Cracking open the black box of automated machine learning

Natural evolutionary strategy boosts quantum computing performance

General deep learning framework for emissivity engineering

A new approach to overcome multi-model forgetting in deep neural networks

Evolutionary reinforcement learning promises further advances in machine learning

When AI is made by AI, results are impressive

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Phys.org

Medical Xpress

Science X

In-depth analysis: Automated machine learning from the perspective of bilevel optimization

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Cracking open the black box of automated machine learning

Natural evolutionary strategy boosts quantum computing performance

General deep learning framework for emissivity engineering

A new approach to overcome multi-model forgetting in deep neural networks

Evolutionary reinforcement learning promises further advances in machine learning

When AI is made by AI, results are impressive

Recommended for you

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Your Privacy