July 22, 2024

Study showcases new method for better grouping in data analysis

by Tepper School of Business, Carnegie Mellon University

Researchers at Carnegie Mellon University and UC Berkeley have developed a new method to improve how computers organize and analyze large datasets. This advancement improves the ability to extract information from knowledge graphs, impacting the ability to analyze social networks and customer behavior.

The new method explained in a study led by Benjamin Moseley, Carnegie Bosch Associate Professor of Operations Research at the Tepper School of Business at Carnegie Mellon, can group similar items together more effectively while keeping different items apart.

The paper will appear in ICALP, the International Colloquium on Automata, Languages, and Programming conference, which took place in July 2024.

"Our new algorithm can significantly enhance how we analyze large data sets, whether it's for improving social media platforms by accurately detecting user communities or advancing medical research by better understanding genetic interactions," Moseley said.

He noted that a key trend in business analytics is the ability to work with knowledge graphs, which show information like customer behavior or business processes. This paper focuses on clustering, a common method for extracting information from these graphs. The new method in this study can group similar items more effectively while keeping different items apart.

Organizing massive amounts of data correctly is challenging due to inconsistencies and the sheer volume of information. Moseley and his team focused on creating an algorithm that can quickly and accurately group data points. They used mathematical structures consisting of nodes, which represent data points, and edges, which are connections between nodes. The algorithm works by evaluating these connections and determining the best way to group similar nodes.

The results showed that their algorithm is faster and more accurate than previous methods. It can handle large data sets more efficiently, making it practical for real-world applications.

"Our new method is faster than any previous methods at minimizing mistakes when grouping data," said Sami Davies, a research scientist in theoretical computer science at the University of California, Berkeley. "Our method is also more flexible, in the sense that we can group data in a way that is good for many different objectives simultaneously."

The researchers plan to continue refining their method and exploring its applications in different fields. This ongoing work could lead to even more accurate and insightful data analysis.

Heather Newman, a Ph.D. candidate in the Algorithms, Combinatorics, and Optimization doctoral program at the Tepper School was also a co-author.

More information: Sami Davies et al, Simultaneously Approximating All ℓp-norms in Correlation Clustering, arXiv (2023). DOI: 10.48550/arxiv.2308.01534

Journal information: arXiv

Provided by Tepper School of Business, Carnegie Mellon University

Citation: Study showcases new method for better grouping in data analysis (2024, July 22) retrieved 22 July 2024 from https://techxplore.com/news/2024-07-showcases-method-grouping-analysis.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New machine learning method predicts future data patterns to optimize data storage

4 shares

Feedback to editors

Researchers discover mathematical expression for 'blockchain trilemma'

1 hour ago

Converting captured carbon to fuel: Study assesses what's practical and what's not

1 hour ago

Engineers develop safe and long-cyclable lithium metal battery for high temperatures

1 hour ago

Interfacial fracture of perovskite light emitting devices

1 hour ago

Foldable pouch actuator improves finger extension in soft rehabilitation gloves

2 hours ago

Carbon fiber composite sensors offer solution for efficient traffic monitoring

4 hours ago

Scientists use AI to predict a wildfire's next move

4 hours ago

Less is more: Efficient hydrogen production with less precious metals

5 hours ago

Micro-sized optical spectrometer operates across visible spectrum with sub-5-nm resolution

Jul 20, 2024

Researchers develop framework to merge AI and human intelligence for process safety

Jul 20, 2024

Load comments (0)

Study showcases new method for better grouping in data analysis

Researchers discover mathematical expression for 'blockchain trilemma'

Converting captured carbon to fuel: Study assesses what's practical and what's not

Engineers develop safe and long-cyclable lithium metal battery for high temperatures

Interfacial fracture of perovskite light emitting devices

Foldable pouch actuator improves finger extension in soft rehabilitation gloves

Carbon fiber composite sensors offer solution for efficient traffic monitoring

Scientists use AI to predict a wildfire's next move

Less is more: Efficient hydrogen production with less precious metals

Micro-sized optical spectrometer operates across visible spectrum with sub-5-nm resolution

Researchers develop framework to merge AI and human intelligence for process safety

New machine learning method predicts future data patterns to optimize data storage

How machine learning enables computers to think faster and work smarter

New study offers a better way to make AI fairer for everyone

Decoding consumer preference: Advanced algorithms enhance brand loyalty

New research explores innovative methods in data visualization with Jaya algorithm

Advanced DeepLabv3+ algorithm enhances safflower filament harvesting with high accuracy

Scientists use AI to predict a wildfire's next move

Researchers develop framework to merge AI and human intelligence for process safety

New framework allows robots to learn via online human demonstration videos

Enhancing adaptive radar with AI and an enormous open-source dataset

Neural network learns to build maps using Minecraft

Machine learning unlocks secrets to advanced alloys

Phys.org

Medical Xpress

Science X

Study showcases new method for better grouping in data analysis

Researchers discover mathematical expression for 'blockchain trilemma'

Converting captured carbon to fuel: Study assesses what's practical and what's not

Engineers develop safe and long-cyclable lithium metal battery for high temperatures

Interfacial fracture of perovskite light emitting devices

Foldable pouch actuator improves finger extension in soft rehabilitation gloves

Carbon fiber composite sensors offer solution for efficient traffic monitoring

Scientists use AI to predict a wildfire's next move

Less is more: Efficient hydrogen production with less precious metals

Micro-sized optical spectrometer operates across visible spectrum with sub-5-nm resolution

Researchers develop framework to merge AI and human intelligence for process safety

Related Stories

New machine learning method predicts future data patterns to optimize data storage

How machine learning enables computers to think faster and work smarter

New study offers a better way to make AI fairer for everyone

Decoding consumer preference: Advanced algorithms enhance brand loyalty

New research explores innovative methods in data visualization with Jaya algorithm

Advanced DeepLabv3+ algorithm enhances safflower filament harvesting with high accuracy

Recommended for you

Scientists use AI to predict a wildfire's next move

Researchers develop framework to merge AI and human intelligence for process safety

New framework allows robots to learn via online human demonstration videos

Enhancing adaptive radar with AI and an enormous open-source dataset

Neural network learns to build maps using Minecraft

Machine learning unlocks secrets to advanced alloys

Your Privacy