October 6, 2022

Students help NASA find landslides by training computers to read Reddit

University of British Columbia graduate students have trained computers to "read" news articles about landslides on Reddit to bolster a NASA database, which could improve predictions of when and where these natural disasters will occur.

For their Master of Data Science in Computational Linguistics capstone project, Badr Jaidi and his team, the Social Landslides group, trained computers to automatically extract useful information from relevant news articles about landslides that were posted to Reddit. In this Q&A, he discusses how this tool could end up saving lives.

Why do we need this tool?

According to the World Health Organization, landslides are more widespread than any other geological event. They're so destructive, and we don't have that much data about them. The more accurate landslide data you have, the more it's possible to accurately predict which places have higher risk, which could ultimately save lives.

NASA collects such information in a public database called the Cooperative Open Online Repository, or COOLR, and uses this to predict when and where landslides will occur. But people have had to manually submit landslide information or search for news articles and data one by one, which is pretty tedious. Our tool automates that process, completing in minutes what previously might have taken months.

That would free up resources for more important research, and would also mean we get more data, faster, potentially improving research in landslides generally, as well as NASA's landslide predictions.

How does it work?

Guided by BGC Engineering Inc. and NASA for our capstone project, our team designed a tool that scans Reddit for news articles about landslides within a given period of time and then extracts relevant information.

First, a computer model works out whether the article is indeed about landslides, rather than say, an election where someone wins "by a landslide," or as we also found, articles about Pokémon with earth techniques like "rock slide."

Then, we trained a natural language processing model on landslide data, teaching it to recognize the information we wanted from an article. This kind of model can understand language, including analyzing sentences. So, we would give it a news article, and ask where a landslide might have happened. The model would predict the answer based on the language involved, for example, "The landslide most likely happened here, according to this sentence," and we would let it know if it was correct or not.

In this way, the computer learns what information to automatically and accurately extract, including when a landslide happened and where, what caused it, and how many fatalities were involved.

This all happens fairly quickly: It returns a month's worth of articles in about 15 minutes, compared with going through them manually to find those pieces of information. The data can then be fed into COOLR. This took us about two months to build. NASA is currently assessing whether the tool can be run as-is or needs some adjustments to use.

Could the tool be used on other social media sites?

We used Reddit because it's free to access their application programming interface (API). For instance, Twitter's API has a lot of restrictions, and it's quite expensive to access. Also, the amount of data would be enormous.

We wanted to start small and prove it works with Reddit. But it could be expanded to bigger platforms and sources, provided they have news articles. You could even expand the tool to use it for other disasters such as earthquakes, using the same methodology by training the models with similar datasets.

Improving upon the model and adding more sources from which landslides can be extracted other than Reddit would ultimately help NASA have more data points, faster. I'll keep my eye on it.

More information: BGC-NASA Landslide Detection tool.

Provided by University of British Columbia

Citation: Students help NASA find landslides by training computers to read Reddit (2022, October 6) retrieved 17 July 2024 from https://techxplore.com/news/2022-10-students-nasa-landslides-reddit.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researchers upgrade international nomenclature of landslide geometry

42 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

10 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

13 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

15 hours ago

Large language models make human-like reasoning mistakes, researchers find

15 hours ago

Unveiling a new class of synthetic fuels

16 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

16 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

17 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

19 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

21 hours ago

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

Students help NASA find landslides by training computers to read Reddit

Why do we need this tool?

How does it work?

Could the tool be used on other social media sites?

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Researchers upgrade international nomenclature of landslide geometry

New nationwide strategy brings scientists and communities together to help reduce landslide risks

NASA study finds climate extremes affect landslides in surprising ways

Slope stability model can help predict landslides to protect communities, save lives

Keeping current with landslide prediction tools

Landslides increasingly threaten the world's urban poor

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Phys.org

Medical Xpress

Science X

Students help NASA find landslides by training computers to read Reddit

Why do we need this tool?

How does it work?

Could the tool be used on other social media sites?

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

Researchers upgrade international nomenclature of landslide geometry

New nationwide strategy brings scientists and communities together to help reduce landslide risks

NASA study finds climate extremes affect landslides in surprising ways

Slope stability model can help predict landslides to protect communities, save lives

Keeping current with landslide prediction tools

Landslides increasingly threaten the world's urban poor

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Your Privacy