April 9, 2024

New code mines microscopy images in scientific articles

by Joseph E. Harmon, Argonne National Laboratory

Deep learning is a form of artificial intelligence transforming society by teaching computers to process information using artificial neural networks that mimic the human brain. It is now used in facial recognition, self-driving cars and even in the playing of complex games like Go. In general, the success of deep learning has depended on using large datasets of labeled images for training purposes.

A potential gold mine of labeled images resides within scientific literature, with over a million articles published each year. Most have many figures woven into the text. To date, these figures have not been amenable to deep learning models. This is, in part, due to their complex layouts. Each figure typically contains multiple embedded images, graphs and illustrations. Also lacking has been an adequate means to search the literature for images matching specific content.

Addressing this challenge, researchers at the U.S. Department of Energy's (DOE) Argonne National Laboratory and Northwestern University have created the EXSCLAIM! software tool. The name stands for extraction, separation and caption-based natural language annotation of images.

The findings are published in the journal Patterns.

"Images generated by electron microscopes down to the billionths of a meter are one of the most important kinds of figures in materials science literature," said Maria Chan, scientist in Argonne's Center for Nanoscale Materials, a DOE Office of Science user facility. "These images are essential to the understanding and development of new materials in many different fields. Our goal with EXSCLAIM! is to unlock the untapped potential of these imaging data."

What sets EXSCLAIM! apart is its unique focus on a query-to-dataset approach, similar to how a prompt is used with generative AI tools such as ChatGPT and DALL-E. It is thus capable of extracting individual images with very specific content from figures, as it both classifies the image content and recognizes the degree of magnification. It can then create descriptive labels for each image. This innovative software tool is expected to become a valuable asset for scientists researching new materials at the nanoscale.

"While existing methods often struggle with the compound layout problem, EXSCLAIM! employs a new approach to overcome this," said lead author Eric Schwenker, a former Argonne graduate student. "Our software is effective at identifying sharp image boundaries, and it excels in capturing irregular image arrangements."

EXSCLAIM! has already demonstrated its effectiveness by constructing a self-labeled electron microscopy dataset of over 280,000 nanostructure images. While initially developed around materials microscopy images, EXSCLAIM! is adaptable to any scientific field that produces high volumes of papers with images. The software thus promises to revolutionize the use of published scientific images across various disciplines.

"Researchers now have a powerful image-mining tool to advance their understanding of complex visual information," Chan said.

More information: Eric Schwenker et al, EXSCLAIM!: Harnessing materials science literature for self-labeled microscopy datasets, Patterns (2023). DOI: 10.1016/j.patter.2023.100843

Journal information: Patterns

Provided by Argonne National Laboratory

Citation: New code mines microscopy images in scientific articles (2024, April 9) retrieved 16 August 2024 from https://techxplore.com/news/2024-04-code-microscopy-images-scientific-articles.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New research suggests AI image generation using DALL-E 2 has promising future in radiology

30 shares

Feedback to editors

Engineers design tiny batteries for powering cell-sized robots

10 hours ago

Leaf-like solar concentrators promise major boost in solar efficiency

11 hours ago

Why does AI beat humans at the strategy game Diplomacy?

11 hours ago

New technique prints metal oxide thin film circuits at room temperature

12 hours ago

Studies highlight challenges and solutions in making large language models trustworthy

13 hours ago

Finding security flaws in Android ahead of malicious hackers

14 hours ago

Robot planning tool accounts for human carelessness

14 hours ago

From shrimp to steel: Introducing nature-inspired metalworking

15 hours ago

'AI Scientist' model designed to conduct scientific research autonomously

15 hours ago

Global AI adoption is outpacing risk understanding, researchers warn

16 hours ago

Load comments (0)

New code mines microscopy images in scientific articles

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

New research suggests AI image generation using DALL-E 2 has promising future in radiology

High-precision superimposition of X-ray fluoroscopic images and 3D CT data

Researcher develops filter to tackle 'unsafe' AI-generated images

Detecting academic plagiarism through image processing and semantic mapping

Addressing copyright, compensation issues in generative AI

Artificial intelligence magnifies the utility of electron microscopes

A two-stage framework to improve LLM-based anomaly detection and reactive planning

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Why does AI beat humans at the strategy game Diplomacy?

Studies highlight challenges and solutions in making large language models trustworthy

How working with AI impacts the collective attention of teams

Phys.org

Medical Xpress

Science X

New code mines microscopy images in scientific articles

Engineers design tiny batteries for powering cell-sized robots

Leaf-like solar concentrators promise major boost in solar efficiency

Why does AI beat humans at the strategy game Diplomacy?

New technique prints metal oxide thin film circuits at room temperature

Studies highlight challenges and solutions in making large language models trustworthy

Finding security flaws in Android ahead of malicious hackers

Robot planning tool accounts for human carelessness

From shrimp to steel: Introducing nature-inspired metalworking

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Related Stories

New research suggests AI image generation using DALL-E 2 has promising future in radiology

High-precision superimposition of X-ray fluoroscopic images and 3D CT data

Researcher develops filter to tackle 'unsafe' AI-generated images

Detecting academic plagiarism through image processing and semantic mapping

Addressing copyright, compensation issues in generative AI

Artificial intelligence magnifies the utility of electron microscopes

Recommended for you

A two-stage framework to improve LLM-based anomaly detection and reactive planning

'AI Scientist' model designed to conduct scientific research autonomously

Global AI adoption is outpacing risk understanding, researchers warn

Why does AI beat humans at the strategy game Diplomacy?

Studies highlight challenges and solutions in making large language models trustworthy

How working with AI impacts the collective attention of teams

Your Privacy