September 7, 2022

Cleaning up social media with machine learning

by David Bradley, Inderscience

Adult, or pornographic, content spam is a growing problem on social media. New research in the International Journal of Business Intelligence and Data Mining discusses how such content might be quickly detected and removed in a timely manner.

Deepali Dhaka, Surbhi Kakar, and Monica Mehrotra of Jamia Millia Islamia (Central University) in Jamia Nagar, New Delhi, India, explain how the general user experience and that of younger people using social media might be improved if obscene spam content can be filtered effectively and quickly. Machine learning tools are often the way forward in detecting particular types of content and the team has demonstrated that one such tool, XGboost, can detect adult spam content with more than 90% accuracy. This was the most effective classification algorithm of the six tested and adapted by the team for detecting pornographic spam on Twitter.

As such, fewer than ten in every hundred updates flagged as adult spam would be false positives. The team's approach needed to analyze just a small number of features, value system, the entropy of words, lexical diversity, and word embeddings, to be able to pluck adult spam updates from the general stream of updates on one of the most well-known social media platforms, Twitter.

Inherent in positive detection is that in general, everyday users of the platform discuss a wide variety of topics in different contexts and write and share in what might be referred to as an organic manner. In contrast, spammers and pornographic spammers, in this case, tend to have a fixed or even entirely automated approach to their updates, limited diversity of subject matter, as one would expect, and a very limited lexicon. These and other characteristics of spam messages, make them recognizable to the algorithm.

More information: Monica Mehrotra et al, Detection of Spammers disseminating obscene content on Twitter, International Journal of Business Intelligence and Data Mining (2021). DOI: 10.1504/IJBIDM.2022.10040432

Provided by Inderscience

Citation: Cleaning up social media with machine learning (2022, September 7) retrieved 17 July 2024 from https://techxplore.com/news/2022-09-social-media-machine.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Twitter says it removes 1 million spam accounts a day

29 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

11 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

13 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

15 hours ago

Large language models make human-like reasoning mistakes, researchers find

16 hours ago

Unveiling a new class of synthetic fuels

16 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

16 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

17 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

20 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

21 hours ago

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

Cleaning up social media with machine learning

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Twitter says it removes 1 million spam accounts a day

Twitter attorney says bot data given to Musk was 'explicitly an estimate'

The origin and future of spam and other online intrusions

A new model to automatically detect and filter spam emails

How to know when it's safe to click 'unsubscribe' on spam emai

A solution for moderating junk senders on WhatsApp

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Phys.org

Medical Xpress

Science X

Cleaning up social media with machine learning

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

Twitter says it removes 1 million spam accounts a day

Twitter attorney says bot data given to Musk was 'explicitly an estimate'

The origin and future of spam and other online intrusions

A new model to automatically detect and filter spam emails

How to know when it's safe to click 'unsubscribe' on spam emai

A solution for moderating junk senders on WhatsApp

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Your Privacy