September 24, 2019

How an AI trained to read scientific papers could predict future discoveries

"Can machines think?", asked the famous mathematician, code breaker and computer scientist Alan Turing almost 70 years ago. Today, some experts have no doubt that Artificial Intelligence (AI) will soon be able to develop the kind of general intelligence that humans have. But others argue that machines will never measure up. Although AI can already outperform humans on certain tasks—just like calculators—they can't be taught human creativity.

After all, our ingenuity, which is sometimes driven by passion and intuition rather than logic and evidence, has enabled us to make spectacular discoveries—ranging from vaccines to fundamental particles. Surely an AI won't ever be able to compete? Well, it turns out they might. A paper recently published in Nature reports that an AI has now managed to predict future scientific discoveries by simply extracting meaningful data from research publications.

Language has a deep connection with thinking, and it has shaped human societies, relationships and, ultimately, intelligence. Therefore, it is not surprising that the holy grail of AI research is the full understanding of human language in all its nuances. Natural Language Processing (NLP), which is part of a much larger umbrella called machine learning, aims to assess, extract and evaluate information from textual data.

Children learn by interacting with the surrounding world via trial and error. Learning how to ride a bicycle often involves a few bumps and falls. In other words, we make mistakes and we learn from them. This is precisely the way machine learning operates, sometimes with some extra "educational" input (supervised machine learning).

For example, an AI can learn to recognise objects in images by building up a picture of an object from many individual examples. Here, a human must show it images containing the object or not. The computer then makes a guess as to whether it does, and adjusts its statistical model according to the accuracy of the guess, as judged by the human. However we can also leave the computer program to do all the relevant learning by itself (unsupervised machine learning). Here, AI automatically starts being able to detect patterns in data. In either case, a computer program needs to find a solution by evaluating how wrong it is, and then try to adjust it to minimise such error.

Suppose we want to understand some properties related to a specific material. The obvious step is to search for information from books, web pages and any other appropriate resources. However, this is time consuming, as it may involve hours of web searching, reading articles and specialised literature. NLP can, however, help us. Via sophisticated methods and techniques, computer programs can identify concepts, mutual relationships, general topics and specific properties from large textual datasets.

In the new study, an AI learned to retrieve information from scientific literature via unsupervised learning. This has remarkable implications. So far, most of the existing automated NLP-based methods are supervised, requiring input from humans. Despite being an improvement compared to a purely manual approach, this is still a labour intensive job.

However, in the new study, the researchers created a system that could accurately identify and extract information independently. It used sophisticated techniques based on statistical and geometrical properties of data to identify chemical names, concepts and structures. This was based on about 1.5m abstracts of scientific papers on material science.

A machine learning program then classified words in the data based on specific features such as "elements", "energetics" and "binders". For example, "heat" was classified as part of "energetics", and "gas" as "elements". This helped connect certain compounds with types of magnetism and similarity with other materials among other things, providing an insight on how the words were connected with no human intervention required.

Scientific discoveries

This method could capture complex relationships and identify different layers of information, which would be virtually impossible to carry out by humans. It provided insights well in advance compared to what scientists can predict at the moment. In fact, the AI could recommend materials for functional applications several years before their actual discovery. There were five such predictions, all based on papers published before the year 2009. For example, the AI managed to identify a substance known as CsAgGa2Se4as as a thermoelectric material, which scientists only discovered in 2012. So if the AI had been around in 2009, it could have speeded up the discovery.

It made the prediction by connecting the compound with words such as "chalcogenide" (material containing "chalcogen elements" such as sulfur or selenium), "optoelectronic" (electronic devices that source, detect and control light) and "photovoltaic applications". Many thermoelectric materials share such properties, and the AI was quick to show that.

This suggests that latent knowledge regarding future discoveries is to a large extent embedded in past publications. AI systems are becoming more and more independent. And there is nothing to fear. They can help us enormously to navigate through the huge amount of data and information, which is being continuously created by human activities. Despite concerns related to privacy and security, AI is changing our societies. I believe it will lead us to make better decisions, improve our daily lives and ultimately make us smarter.

Provided by The Conversation

This article is republished from The Conversation under a Creative Commons license. Read the original article.

Citation: How an AI trained to read scientific papers could predict future discoveries (2019, September 24) retrieved 18 July 2024 from https://techxplore.com/news/2019-09-ai-scientific-papers-future-discoveries.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

AI learns complex gene-disease patterns

40 shares

Feedback to editors

New parameter helps clarify how soft materials fail under stress

52 minutes ago

Researchers develop novel electrode for improving flowless zinc-bromine battery

1 hour ago

Machine learning unlocks secrets to advanced alloys

1 hour ago

Engineers develop advanced solid-state electrolytes for high-performance all-solid-state lithium metal batteries

2 hours ago

Stretchable electronics might make their way onto the market thanks to roll-to-roll process

2 hours ago

Engineers develop OptoGPT for improving solar cells, smart windows, telescopes and more

5 hours ago

Free 3D-printing datasets enable analysis, confidence in printed parts

5 hours ago

Artificial intelligence meets cartography: Mapping tools can create satellite images from text prompts

5 hours ago

A new material for small electronics that gives batteries longer life

5 hours ago

Scientists develop 3D-printed active fabric for medical devices and soft robotics

6 hours ago

Load comments (0)

How an AI trained to read scientific papers could predict future discoveries

Scientific discoveries

New parameter helps clarify how soft materials fail under stress

Researchers develop novel electrode for improving flowless zinc-bromine battery

Machine learning unlocks secrets to advanced alloys

Engineers develop advanced solid-state electrolytes for high-performance all-solid-state lithium metal batteries

Stretchable electronics might make their way onto the market thanks to roll-to-roll process

Engineers develop OptoGPT for improving solar cells, smart windows, telescopes and more

Free 3D-printing datasets enable analysis, confidence in printed parts

Artificial intelligence meets cartography: Mapping tools can create satellite images from text prompts

A new material for small electronics that gives batteries longer life

Scientists develop 3D-printed active fabric for medical devices and soft robotics

AI learns complex gene-disease patterns

With little training, machine-learning algorithms can uncover hidden scientific knowledge

AI develops human-like number sense – taking us a step closer to building machines with general intelligence

A new machine learning strategy that could enhance computer vision

Robots that can learn like humans

Can we trust scientific discoveries made using machine learning?

Machine learning unlocks secrets to advanced alloys

Engineers develop OptoGPT for improving solar cells, smart windows, telescopes and more

Free 3D-printing datasets enable analysis, confidence in printed parts

Artificial intelligence meets cartography: Mapping tools can create satellite images from text prompts

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

New system enables intuitive teleoperation of a robotic manipulator in real-time

Phys.org

Medical Xpress

Science X

How an AI trained to read scientific papers could predict future discoveries

Scientific discoveries

New parameter helps clarify how soft materials fail under stress

Researchers develop novel electrode for improving flowless zinc-bromine battery

Machine learning unlocks secrets to advanced alloys

Engineers develop advanced solid-state electrolytes for high-performance all-solid-state lithium metal batteries

Stretchable electronics might make their way onto the market thanks to roll-to-roll process

Engineers develop OptoGPT for improving solar cells, smart windows, telescopes and more

Free 3D-printing datasets enable analysis, confidence in printed parts

Artificial intelligence meets cartography: Mapping tools can create satellite images from text prompts

A new material for small electronics that gives batteries longer life

Scientists develop 3D-printed active fabric for medical devices and soft robotics

Related Stories

AI learns complex gene-disease patterns

With little training, machine-learning algorithms can uncover hidden scientific knowledge

AI develops human-like number sense – taking us a step closer to building machines with general intelligence

A new machine learning strategy that could enhance computer vision

Robots that can learn like humans

Can we trust scientific discoveries made using machine learning?

Recommended for you

Machine learning unlocks secrets to advanced alloys

Engineers develop OptoGPT for improving solar cells, smart windows, telescopes and more

Free 3D-printing datasets enable analysis, confidence in printed parts

Artificial intelligence meets cartography: Mapping tools can create satellite images from text prompts

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

New system enables intuitive teleoperation of a robotic manipulator in real-time

Your Privacy