July 16, 2024

'Extreme boosting' AI model can cut through social media 'noise'

by Aaron Kupec, University of Massachusetts Amherst

Social media offers a treasure trove of data for researchers to understand how organizations and individuals use the technology to communicate with and grow their base of followers. However, manually analyzing the content can be time consuming or, in some cases, simply impossible due to the volume of data. While machine-learning models can help, they present their own set of challenges.

Viviana Chiu Sik Wu, assistant professor of public policy at the University of Massachusetts Amherst, conducted a systematic review of 43 studies that analyzed social media data from philanthropic and nonprofit organizations. She then devised and tested a model that pairs machine learning with human oversight to analyze content more effectively.

The study appears in the Journal of Chinese Governance.

Wu found that most of the studies relied heavily on manual coding to analyze relatively small datasets, missing out on the benefits of automation and scalability offered by artificial intelligence. In cases where AI was used, it was often stymied by language nuances and other variables that arise during the training process for large language models, she says.

"We have been seeing a lot of research using topic modeling, but without properly training the data, those unsupervised models can introduce biases and noise into the results," Wu explains.

In addition, she notes many studies omitted entire categories of data, which can be organized into three groups: text (message content), engagement (likes, comments, retweets, etc.) and network data (how followers, friends, etc. are interconnected).

Wu used a coded sample to develop what she calls an "extreme boosting" model, which harnesses computational power coupled with human abilities to classify messages into specific sets of preconceived categories, known as supervised machine learning.

While unsupervised machine learning can identify hidden patterns and relationships, for content analysis "it can be highly unreliable without a substantial set of training examples to begin with," the study warns.

To test her model, Wu collected 66,749 tweets from the Twitter/X accounts of 192 community foundations in the U.S. from 2017–18. She manually analyzed 15% of the messages and used them to train and test various algorithms to identify the best predictive model to automatically analyze the remaining 56,718 tweets.

The model was tasked with identifying posts related to public engagement, which are particularly challenging to distinguish from other messages about fundraising, grants, etc. due to content that often overlaps with other topics.

The results yielded 6,331 public engagement tweets, which were verified. Though the "extreme boosting" model shows promise, Wu cautions that it requires further refinements to achieve the highest accuracy.

What is clear, she says, is that combining manual content analysis with automated machine learning can be a powerful tool to analyze social media datasets that are simply too large to be processed manually.

"The findings can be extended to situations in other fields well beyond nonprofits to analyze massive observational datasets on social media," Wu says.

However, she points out that accessing this data has become more challenging for researchers in recent years as some platforms, including Twitter/X and Facebook, have placed additional limits on the data they make available to researchers and the public.

The changes have scholars looking at other platforms, such as Reddit and TikTok.

"We need to be more creative and innovative at getting the data," she says.

More information: Viviana Chiu Sik Wu, Leveraging computational methods for nonprofit social media research: a systematic review and methodological framework, Journal of Chinese Governance (2024). DOI: 10.1080/23812346.2024.2365008

Provided by University of Massachusetts Amherst

Citation: 'Extreme boosting' AI model can cut through social media 'noise' (2024, July 16) retrieved 17 July 2024 from https://techxplore.com/news/2024-07-extreme-boosting-ai-social-media.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Facebook owner Meta seeks to train AI model on European data as it faces privacy concerns

6 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

8 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

11 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

13 hours ago

Large language models make human-like reasoning mistakes, researchers find

13 hours ago

Unveiling a new class of synthetic fuels

13 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

14 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

15 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

17 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

19 hours ago

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

'Extreme boosting' AI model can cut through social media 'noise'

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Facebook owner Meta seeks to train AI model on European data as it faces privacy concerns

Predictive model detects potential extremist propaganda on social media

Detecting influence campaigns on X with AI and network science

Machine learning models teach each other to identify molecular properties

Study finds racial bias in tweets flagged as hate speech

Social media posts that promote tobacco are increasing, AI detection technology finds

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Phys.org

Medical Xpress

Science X

'Extreme boosting' AI model can cut through social media 'noise'

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

Facebook owner Meta seeks to train AI model on European data as it faces privacy concerns

Predictive model detects potential extremist propaganda on social media

Detecting influence campaigns on X with AI and network science

Machine learning models teach each other to identify molecular properties

Study finds racial bias in tweets flagged as hate speech

Social media posts that promote tobacco are increasing, AI detection technology finds

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Your Privacy