October 30, 2018 feature

A new dynamic ensemble active learning method based on a non-stationary bandit

by Ingrid Fadelli , Tech Xplore

Researchers at the University of Edinburgh, University College London (UCL) and Nara Institute of Science and Technology have developed a new ensemble active learning approach based on a non-stationary multi-armed bandit and an expert advice algorithm. Their method, presented in a paper pre-published on arXiv, could reduce the time and effort invested in the manual annotation of data.

"Conventional supervised machine learning is data-hungry, and labelled data can be a bottleneck when data annotation is expensive," Timothy Hospedales, one of the researchers who carried out the study told Tech Xplore. "Active learning supports supervised learning by predicting the most informative data points to annotate so that good models can be trained with a reduced annotation budget."

Active learning is a particular area of machine learning in which a learning algorithm can actively choose the data it wants to learn from. This typically results in better performance, with significantly smaller training datasets.

Researchers have developed a variety of active learning algorithms that could reduce the costs of annotation, but so far, none of these solutions has proved to be effective for all problems. Other studies have hence used bandit algorithms to identify the best active learning algorithm for a given dataset.

"The term 'bandit' refers to a multi-armed bandit slot machine, which is a convenient mathematical abstraction for exploration/exploitation problems," Hospedales explained. "A bandit algorithm finds a good balance between effort spent on exploring all slot machines to find out which is paying out most, with effort spent on exploiting the best slot machine found so far."

The efficacy of active learning algorithms varies both across problems and over time at different stages of learning. This observation is analogous to playing slot machines, where payout probability changes over time.

"The aim of our study was to develop a new bandit algorithm that improves performance by accounting for this aspect of the active learning problem," Hospedales said.

To tackle this limitation, the researchers proposed a dynamic ensemble active learner (DEAL) based on a non-stationary bandit. This learner builds up an estimate of each active learning algorithm's efficacy online, based on the reward (importance-weighted accuracy) obtained after every annotation of data.

"It does this by using the preference expressed for that point by each active learning algorithm," Kunkun Pang, another researcher who carried out the study, told Tech Xplore. "To deal with the issue of the changing efficacy of active learners over time, we periodically restart the learning algorithm to refresh its active learner preference. With this capability, if the most effective active learning algorithm changes between early and late stages of learning, we can quickly adapt to this change."

The researchers tested their approach on 13 popular datasets, achieving highly encouraging results. Their DEAL algorithm has a mathematical performance guarantee, meaning that it there is a high degree of confidence in how well it will work.

"The guarantee relates the performance of our algorithm, which is that of an ideal oracle that always knows the right choice for the active learner," Hospedales explained. "It provides a bound on the performance gap between such a best-case algorithm and ours."

The empirical evaluation carried out by Hospedales and his colleagues confirmed that their DEAL algorithm improves active learning performance on a suite of benchmarks. It does this by continuously identifying the most effective active learning algorithm for different tasks and at different stages of training.

"Today, while active learning is appealing, its impact on machine learning practices is limited due to the hassle of matching algorithms to problems and to stages of learning," Hospedales said. "DEAL eliminates this difficulty and provides an approach to tackle many problems and all stages of learning. By making active learning easier to use, we hope it can have a bigger impact on reducing annotation cost in machine learning practice."

Despite the very promising results, the technique devised by the researchers still has a significant limitation. DEAL does all the learning within a single problem and this results in a 'cold start,' meaning that the algorithm approaches all new problems with a blank slate.

"In ongoing work, we are learning how to annotate on many different problems and eventually transfer this knowledge to a new problem, in order to perform effective annotation immediately with no warm-up requirements," Pang said. "Our preliminary work on this topic has been published and also won the Best Paper prize at ICML 2018 AutoML workshop."

More information: Dynamic ensemble active learning: A non-stationary bandit with expert advice. arXiv: 1810.07778 [cs.LG]. arxiv.org/abs/1810.07778

Meta-Learning Transferable Active Learning Policies by Deep Reinforcement Learning. arxiv:1806.04798 [cs.LG] arxiv.org/abs/1806.04798

Citation: A new dynamic ensemble active learning method based on a non-stationary bandit (2018, October 30) retrieved 17 July 2024 from https://techxplore.com/news/2018-10-dynamic-ensemble-method-based-non-stationary.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New algorithm limits bias in machine learning

59 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

13 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

15 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

17 hours ago

Large language models make human-like reasoning mistakes, researchers find

18 hours ago

Unveiling a new class of synthetic fuels

18 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

18 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

19 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

22 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

Jul 16, 2024

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

A new dynamic ensemble active learning method based on a non-stationary bandit

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

New algorithm limits bias in machine learning

New algorithm can more quickly predict LED materials

Restoring balance in machine learning datasets

Improving machine learning with an old approach

Baidu researchers develop a new auto-tuning framework for autonomous vehicles

Machine learning used for helping farmers select optimal products suited for their operation

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Phys.org

Medical Xpress

Science X

A new dynamic ensemble active learning method based on a non-stationary bandit

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

New algorithm limits bias in machine learning

New algorithm can more quickly predict LED materials

Restoring balance in machine learning datasets

Improving machine learning with an old approach

Baidu researchers develop a new auto-tuning framework for autonomous vehicles

Machine learning used for helping farmers select optimal products suited for their operation

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Your Privacy