July 28, 2021 report

Machine learning applications need less data than has been assumed

by Bob Yirka , Tech Xplore

A combined team of researchers from the University of British Columbia and the University of Alberta has found that at least some machine learning applications can learn from far fewer examples than has been assumed. In their paper published in the journal Nature Machine Intelligence, the group describes testing they carried out with machine learning applications created to predict certain types of molecular structures.

Machine learning can be used in a wide variety of applications—one of the most well-known is learning to spot people or objects in photographs. Such applications typically require huge amounts of data for training. In this new effort, the researchers have found that in some instances, machine learning applications do not need such huge amounts of data to be useful.

The researchers were initially looking for ways to predict the structure of illegal designer drugs. Doing so would help medical researchers prepare for them should people consuming them begin showing up in hospital emergency rooms. The team realized their job would be much easier if they could use a machine learning application; unfortunately, there are only 1,700 known designer drugs that could be used to train such a system. Undaunted, the researchers wondered if it might be possible to figure out just how much data would be required for such a system to be useful, or if there might be a way to modify an algorithm or the data that was used to train it to allow for less available data.

To find out, the researchers created 8,500 models and trained each of them on differently sized datasets taken from the 500,000 molecules in the simplified molecular-input line-entry system. Then they used the models to predict possible molecular types. In so doing, they found many of the models worked quite well with the limited dataset. They also found that most of them began to level off in their predictive abilities after just 10,000 to 20,000 data records. When they used the best-performing models to conduct their initial research, they found the results were correct approximately 50% of the time.

More information: Michael A. Skinnider et al, Chemical language models enable navigation in sparsely populated chemical space, Nature Machine Intelligence (2021). DOI: 10.1038/s42256-021-00368-1

Journal information: Nature Machine Intelligence

Citation: Machine learning applications need less data than has been assumed (2021, July 28) retrieved 25 April 2024 from https://techxplore.com/news/2021-07-machine-applications-assumed.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Chemists show how bias can crop up in machine learning algorithm results

546 shares

Feedback to editors

Study explores why human-inspired machines can be perceived as eerie

1 hour ago

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

15 hours ago

Study shows potential of super grids when hurricanes overshadow solar panels

16 hours ago

Rubber-like stretchable energy storage device fabricated with laser precision

16 hours ago

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

16 hours ago

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

18 hours ago

Why can't robots outrun animals?

18 hours ago

Virtual sensors help aerial vehicles stay aloft when rotors fail

19 hours ago

New insights lead to better next-gen solar cells

19 hours ago

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

19 hours ago

Load comments (0)

Machine learning applications need less data than has been assumed

Study explores why human-inspired machines can be perceived as eerie

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

Study shows potential of super grids when hurricanes overshadow solar panels

Rubber-like stretchable energy storage device fabricated with laser precision

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

Why can't robots outrun animals?

Virtual sensors help aerial vehicles stay aloft when rotors fail

New insights lead to better next-gen solar cells

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Chemists show how bias can crop up in machine learning algorithm results

Machine learning aids in simulating dynamics of interacting atoms

Combining machine learning with smartphone tracking data to forecast the spread of the flu

Researchers use machine learning to rank cancer drugs in order of efficacy

Machine learning at speed with in-network aggregation

Researchers build models using machine learning technique to enhance predictions of COVID-19 outcomes

Study explores why human-inspired machines can be perceived as eerie

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Microsoft claims that small, localized language models can be powerful as well

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

Emulating neurodegeneration and aging in artificial intelligence systems

A new framework to generate human motions from language prompts

Phys.org

Medical Xpress

Science X

Machine learning applications need less data than has been assumed

Study explores why human-inspired machines can be perceived as eerie

High-energy-density capacitors with 2D nanomaterials could significantly enhance energy storage

Study shows potential of super grids when hurricanes overshadow solar panels

Rubber-like stretchable energy storage device fabricated with laser precision

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

Why can't robots outrun animals?

Virtual sensors help aerial vehicles stay aloft when rotors fail

New insights lead to better next-gen solar cells

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Related Stories

Chemists show how bias can crop up in machine learning algorithm results

Machine learning aids in simulating dynamics of interacting atoms

Combining machine learning with smartphone tracking data to forecast the spread of the flu

Researchers use machine learning to rank cancer drugs in order of efficacy

Machine learning at speed with in-network aggregation

Researchers build models using machine learning technique to enhance predictions of COVID-19 outcomes

Recommended for you

Study explores why human-inspired machines can be perceived as eerie

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

Microsoft claims that small, localized language models can be powerful as well

On the trail of deepfakes, researchers identify 'fingerprints' of AI-generated video

Emulating neurodegeneration and aging in artificial intelligence systems

A new framework to generate human motions from language prompts

Your Privacy