AI tool summarizes lengthy papers in a sentence

Scholars have a nifty way of alerting colleagues to lengthy treatises that they find simply not worth their time to read.

They tag such documents "tl;dr"—too long, didn't read.

It's kind of a 21st century spin on the 420-year-old notion Shakespeare's Polonius relayed to the king and queen in "Hamlet": "Brevity," he suggested, "is the soul of wit."

The Allen Institute for Artificial Intelligence in Seattle has taken both sentiments to heart and this week unveiled a system that offers extreme condensation of lengthy computer-science reports to slash the time it take to review such literature.

Semantic Scholar is a research tool powered by AI and used for scientific research. With its new summarization feature, it surveys massive numbers of scientific research papers and reduces them to one-sentence summaries. More than 7 million users a month have been accessing Semantic Scholar.

Currently, there are 10 million computer-science papers in Semantic Scholar's database. According to Dan Weld, who oversees the database, papers from other disciplines will gradually be added.

The system offers a great advantage to researchers who up to now have had to rely on scanning numerous titles and often lengthy abstracts, an especially trying task on mobile devices. Following early tests, reaction has been positive. "People seem to really like it," Weld said.

There have been a variety of Natural Language Processing programs developed over the years to summarize documents. They generally use one of two approaches: the extractive approach focuses on selecting representative text and using it verbatim in the summary. For instance, Paper Digest, developed in 2018, appears to extract key sentences rather than rewriting findings in its own words.

The other approach is abstractive; it uses natural language generation algorithms to create summaries with original wording. Improvements in AI natural language generation in recent years have made this approach the favored one among programmers.

Semantic Scholar is notable for achieving the greatest compression rate of all summarizing tools. With scientific papers averaging 5,000 words, Semantic Scholar's summaries are around 21 words. That averages to summaries 1/238th the size of the reports. The closest Semantic Scholar competitor compresses documents to only 1/36th of the report size.

According to Jevin West, an information scientist at the University of Washington in Seattle who tested the new program, "I predict that this kind of tool will become a standard feature of scholarly search in the near future. Actually, given the need, I am amazed it has taken this long to see it in practice."

He noted that it is not yet perfect, "but it's definitely a step in the right direction," he said.

The Allen Institute team is making their code available for free. They also have set up a demonstration site open to all. scitldr.apps.allenai.org/

Currently, only papers written in English are being accepted. But the program's authors hope to include documents in other languages eventually.

More information: github.com/allenai/scitldr

AI tool summarizes lengthy papers in a sentence

Seattle AI lab's free search engine aims to accelerate scientific breakthroughs

Researchers develop an automated benchmark for language-based task planners

Study explores why human-inspired machines can be perceived as eerie

Adobe's VideoGigaGAN uses AI to make blurry videos sharp and clear

Emulating neurodegeneration and aging in artificial intelligence systems

Microsoft claims that small, localized language models can be powerful as well

Scientists pioneer new X-ray microscopy method for data analysis 'on the fly'

New tech could help traveling VR gamers experience 'ludicrous speed' without motion sickness

New circuit boards can be repeatedly recycled

Built-in bionic computing: Researchers develop method to control pneumatic artificial muscles

Custom-made catalyst leads to longer-lasting and more sustainable green hydrogen production

Researchers outline path forward for tandem solar cells

Researcher develop high-performance amorphous p-type oxide semiconductor

Scientists create new atomic clock that is both ultra-precise and sturdy

A framework to compare lithium battery testing data and results during operation

New approach could make reusing captured carbon far cheaper, less energy-intensive

How much energy can offshore wind farms in the U.S. produce? New study sheds light

Engineers uncover key to efficient and stable organic solar cells

Mask-inspired perovskite smart windows enhance weather resistance and energy efficiency

AI tool summarizes lengthy papers in a sentence

Let us know if there is a problem with our content

Thank you for taking time to provide your feedback to the editors

Share article

E-MAIL THE STORY