July 25, 2024

Using AI to train AI: Model collapse could be coming for LLMs, say researchers

Model collapse coming for LLMs, say researchers — The high-level description of the feedback mechanism in the learning process. Credit: *Nature* (2024). DOI: 10.1038/s41586-024-07566-y

Using AI-generated datasets to train future generations of machine learning models may pollute their output, a concept known as model collapse, according to a new paper published in Nature. The research shows that within a few generations, original content is replaced by unrelated nonsense, demonstrating the importance of using reliable data to train AI models.

Generative AI tools such as large language models (LLMs) have grown in popularity and have been primarily trained using human-generated inputs. However, as these AI models continue to proliferate across the Internet, computer-generated content may be used to train other AI models—or themselves—in a recursive loop.

Ilia Shumailov and colleagues present mathematical models to illustrate how AI models may experience model collapse. The authors demonstrate that an AI may overlook certain outputs (for example, less common lines of text) in training data, causing it to train itself on only a portion of the dataset.

Shumailov and colleagues also investigated how AI models responded to a training dataset that was predominantly created with artificial intelligence. They found that feeding a model AI-generated data causes successive generations to degrade in their ability to learn, eventually leading to model collapse.

Nearly all of the recursively trained language models they tested tended to display repeating phrases. For example, a test was run using text about medieval architecture as the original input and by the ninth generation the output was a list of jackrabbits.

The authors propose that model collapse is an inevitable outcome of AI models that use training datasets created by previous generations. In order to successfully train artificial intelligence with its own outputs, Shumailov and colleagues suggest that training a model with AI-generated data is not impossible, but the filtering of that data must be taken seriously.

At the same time, tech firms that rely on human-generated content may be able to train AI models that are more effective over their competitors.

More information: Ilia Shumailov et al, AI models collapse when trained on recursively generated data, Nature (2024). DOI: 10.1038/s41586-024-07566-y

Journal information: Nature

Provided by Nature Publishing Group

Citation: Using AI to train AI: Model collapse could be coming for LLMs, say researchers (2024, July 25) retrieved 25 July 2024 from https://techxplore.com/news/2024-07-ai-collapse-llms.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

AI models feeding on AI data may face death spiral

33 shares

Feedback to editors

Happy 50th birthday to the UPC barcode—no one expected you would revolutionize global commerce

6 minutes ago

Engineers develop a recipe for zero-emissions fuel: Soda cans, seawater and caffeine

24 minutes ago

Engineers design new valve to give soft robots resilience boost

1 hour ago

Engineers develop technique that enhances thermal imaging and infrared thermography for police, medical and military use

1 hour ago

Scientists' innovation for indoor solar cells maximizes the use of light energy

1 hour ago

A robot that survives through self-amputation

2 hours ago

Memristive radiofrequency switches show improved performance for mmWave applications

4 hours ago

Research team designs biomimetic vision system based on praying mantis eyes

19 hours ago

Foam fluidics showcase team's creative approach to circuit design

20 hours ago

Nondestructive flash cathode recycling method uses magnetic properties for battery recycling

20 hours ago

Load comments (0)

Using AI to train AI: Model collapse could be coming for LLMs, say researchers

Happy 50th birthday to the UPC barcode—no one expected you would revolutionize global commerce

Engineers develop a recipe for zero-emissions fuel: Soda cans, seawater and caffeine

Engineers design new valve to give soft robots resilience boost

Engineers develop technique that enhances thermal imaging and infrared thermography for police, medical and military use

Scientists' innovation for indoor solar cells maximizes the use of light energy

A robot that survives through self-amputation

Memristive radiofrequency switches show improved performance for mmWave applications

Research team designs biomimetic vision system based on praying mantis eyes

Foam fluidics showcase team's creative approach to circuit design

Nondestructive flash cathode recycling method uses magnetic properties for battery recycling

AI models feeding on AI data may face death spiral

Using illustrations to train an image-free computer vision system to recognize real photos

AI trained to draw inspiration from images, not copy them

Facebook owner Meta seeks to train AI model on European data as it faces privacy concerns

New tool uses vision language models to safeguard against offensive image content

A new large-scale simulation platform to train robots on everyday tasks

Research team designs biomimetic vision system based on praying mantis eyes

New learning-based method trains robots to reliably pick up and place objects

Study: When allocating scarce resources with AI, randomization can improve fairness

Lightweight neural network enables realistic rendering of woven fabrics in real-time

Multimodal agent can iteratively design experiments to better understand various components of AI systems

AI study reveals dramatic reasoning breakdown in large language models

Phys.org

Medical Xpress

Science X

Using AI to train AI: Model collapse could be coming for LLMs, say researchers

Happy 50th birthday to the UPC barcode—no one expected you would revolutionize global commerce

Engineers develop a recipe for zero-emissions fuel: Soda cans, seawater and caffeine

Engineers design new valve to give soft robots resilience boost

Engineers develop technique that enhances thermal imaging and infrared thermography for police, medical and military use

Scientists' innovation for indoor solar cells maximizes the use of light energy

A robot that survives through self-amputation

Memristive radiofrequency switches show improved performance for mmWave applications

Research team designs biomimetic vision system based on praying mantis eyes

Foam fluidics showcase team's creative approach to circuit design

Nondestructive flash cathode recycling method uses magnetic properties for battery recycling

Related Stories

AI models feeding on AI data may face death spiral

Using illustrations to train an image-free computer vision system to recognize real photos

AI trained to draw inspiration from images, not copy them

Facebook owner Meta seeks to train AI model on European data as it faces privacy concerns

New tool uses vision language models to safeguard against offensive image content

A new large-scale simulation platform to train robots on everyday tasks

Recommended for you

Research team designs biomimetic vision system based on praying mantis eyes

New learning-based method trains robots to reliably pick up and place objects

Study: When allocating scarce resources with AI, randomization can improve fairness

Lightweight neural network enables realistic rendering of woven fabrics in real-time

Multimodal agent can iteratively design experiments to better understand various components of AI systems

AI study reveals dramatic reasoning breakdown in large language models

Your Privacy