December 2, 2021

Research shows how statistics can aid in the fight against misinformation

An American University math professor and his team have created a statistical model that can be used to detect misinformation in social posts. The model also avoids the problem of black boxes that occur in machine learning.

With the use of algorithms and computer models, machine learning is increasingly playing a role in helping to stop the spread of misinformation, but a main challenge for scientists is the black box of unknowability, where researchers don't understand how the machine arrives at the same decision as human trainers.

Using a Twitter dataset with misinformation tweets about COVID-19, Zois Boukouvalas, assistant professor in AU's Department of Mathematics and Statistics, College of Arts and Sciences, shows how statistical models can detect misinformation in social media during events like a pandemic or a natural disaster. In newly published research, Boukouvalas and his colleagues, including AU student Caitlin Moroney and Computer Science Prof. Nathalie Japkowicz, also show how the model's decisions align with those made by humans.

"We would like to know what a machine is thinking when it makes decisions, and how and why it agrees with the humans that trained it," Boukouvalas said. "We don't want to block someone's social media account because the model makes a biased decision."

Boukouvalas's method is a type of machine learning using statistics. It's not as popular a field of study as deep learning, the complex, multi-layered type of machine learning and artificial intelligence. Statistical models are effective and provide another, somewhat untapped, way to fight misinformation, Boukouvalas said.

For a testing set of 112 real and misinformation tweets, the model achieved a high prediction performance and classified them correctly, with an accuracy of nearly 90 percent. (Using such a compact dataset was an efficient way for verifying how the method detected the misinformation tweets.)

"What's significant about this finding is that our model achieved accuracy while offering transparency about how it detected the tweets that were misinformation," Boukouvalas added. "Deep learning methods cannot achieve this kind of accuracy with transparency."

Before testing the model on the dataset, researchers first prepared to train the model. Models are only as good as the information humans provide. Human biases get introduced (one of the reasons behind bias in facial recognition technology) and black boxes get created.

Researchers carefully labeled the tweets as either misinformation or real, and they used a set of pre-defined rules about language used in misinformation to guide their choices. They also considered the nuances in human language and linguistic features linked to misinformation, such as a post that has a greater use of proper nouns, punctuation and special characters. A socio-linguist, Prof. Christine Mallinson of the University of Maryland Baltimore County, identified the tweets for writing styles associated with misinformation, bias, and less reliable sources in news media. Then it was time to train the model.

"Once we add those inputs into the model, it is trying to understand the underlying factors that leads to the separation of good and bad information," Japkowicz said. "It's learning the context and how words interact."

For example, two of the tweets in the dataset contain "bat soup" and "COVID" together. The tweets were labeled misinformation by the researchers, and the model identified them as such. The model identified the tweets as having hate speech, hyperbolic language, and strongly emotional language, all of which are associated with misinformation. This suggests that the model distinguished in each of these tweets the human decision behind the labeling, and that it abided by the researchers' rules.

The next steps are to improve the user interface for the model, along with improving the model so that it can detect misinformation social posts that include images or other multimedia. The statistical model will have to learn how a variety of elements in social posts interact to create misinformation. In its current form, the model could best be used by social scientists or others who are researching ways to detect misinformation.

In spite of the advances in machine learning to help fight misinformation, Boukouvalas and Japkowicz agreed that human intelligence and news literacy remain the first line of defense in stopping the spread of misinformation.

"Through our work, we design tools based on machine learning to alert and educate the public in order to eliminate misinformation, but we strongly believe that humans need to play an active role in not spreading misinformation in the first place," Boukouvalas said.

More information: Caitlin Moroney et al, The Case for Latent Variable Vs Deep Learning Methods in Misinformation Detection: An Application to COVID-19, Discovery Science (2021). DOI: 10.1007/978-3-030-88942-5_33

Provided by American University

Citation: Research shows how statistics can aid in the fight against misinformation (2021, December 2) retrieved 17 July 2024 from https://techxplore.com/news/2021-12-statistics-aid-misinformation.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Twitter rolls out redesigned misinformation warning labels

19 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

14 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

16 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

18 hours ago

Large language models make human-like reasoning mistakes, researchers find

18 hours ago

Unveiling a new class of synthetic fuels

19 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

19 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

20 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

22 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

Jul 16, 2024

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

Research shows how statistics can aid in the fight against misinformation

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Twitter rolls out redesigned misinformation warning labels

Twitter analysis finds national lockdown announcement helped minimise COVID-19 misinformation

Twitter cracks down on COVID vaccine misinformation

New study examines vaccination misinformation on social media

Matching tweets to ZIP codes can spotlight hot spots of COVID-19 vaccine hesitancy

Bad news for fake news: New research helps combat social media misinformation

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Phys.org

Medical Xpress

Science X

Research shows how statistics can aid in the fight against misinformation

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

Twitter rolls out redesigned misinformation warning labels

Twitter analysis finds national lockdown announcement helped minimise COVID-19 misinformation

Twitter cracks down on COVID vaccine misinformation

New study examines vaccination misinformation on social media

Matching tweets to ZIP codes can spotlight hot spots of COVID-19 vaccine hesitancy

Bad news for fake news: New research helps combat social media misinformation

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Your Privacy