December 16, 2021

New model improves accuracy of machine learning in COVID-19 diagnosis while preserving privacy

Researchers in the UK and China have developed an artificial intelligence (AI) model that can diagnose COVID-19 as well as a panel of professional radiologists, while preserving the privacy of patient data.

The international team, led by the University of Cambridge and the Huazhong University of Science and Technology, used a technique called federated learning to build their model. Using federated learning, an AI model in one hospital or country can be independently trained and verified using a dataset from another hospital or country, without data sharing.

The researchers based their model on more than 9,000 CT scans from approximately 3,300 patients in 23 hospitals in the UK and China. Their results, reported in the journal Nature Machine Intelligence, provide a framework where AI techniques can be made more trustworthy and accurate, especially in areas such as medical diagnosis where privacy is vital.

AI has provided a promising solution for streamlining COVID-19 diagnoses and future public health crises. However, concerns surrounding security and trustworthiness impede the collection of large-scale representative medical data, posing a challenge for training a model that can be used worldwide.

In the early days of the COVID-19 pandemic, many AI researchers worked to develop models that could diagnose the disease. However, many of these models were built using low-quality data, "Frankenstein' datasets, and a lack of input from clinicians. Many of the same researchers from the current study highlighted that these earlier models were not fit for clinical use in the spring of 2021.

"AI has a lot of limitations when it comes to COVID-19 diagnosis, and we need to carefully screen and curate the data so that we end up with a model that works and is trustworthy," said co-first author Hanchen Wang from Cambridge's Department of Engineering. "Where earlier models have relied on arbitrary open-sourced data, we worked with a large team of radiologists from the NHS and Wuhan Tongji Hospital Group to select the data, so that we were starting from a strong position."

The researchers used two well-curated external validation datasets of appropriate size to test their model and ensure that it would work well on datasets from different hospitals or countries.

"Before COVID-19, people didn't realize just how much data you needed to collect in order to build medical AI applications," said co-author Dr. Michael Roberts from AstraZeneca and Cambridge's Department of Applied Mathematics and Theoretical Physics. "Different hospitals, different countries all have their own ways of doing things, so you need the datasets to be as large as possible in order to make something that will be useful to the widest range of clinicians."

The researchers based their framework on three-dimensional CT scans instead of two-dimensional images. CT scans offer a much higher level of detail, resulting in a better model. They used 9,573 CT scans from 3,336 patients collected from 23 hospitals located in China and the UK.

The researchers also had to mitigate for bias caused by the different datasets, and used federated learning to train a better generalized AI model, while preserving the privacy of each data center in a collaborative setting.

For a fair comparison, the researchers validated all the models on the same data, without overlapping with the training data. The team had a panel of radiologists make diagnostic predictions based on the same set of CT scans, and compared the accuracy of the AI models and human professionals.

The researchers say their model is useful not just for COVID-19, but for any other diseases that can be diagnosed using a CT scan. "The next time there's a pandemic, and there's every reason to believe that there will be, we'll be in a much better position to leverage AI techniques quickly so that we can understand new diseases faster," said Mr Wang.

"We've shown that encrypting medical data is possible, so we can build and use these tools while preserving patient privacy across internal and external borders," said Dr. Roberts. "By working with other countries, we can do so much more than we can alone."

The researchers are now collaborating with the newly-established WHO Hub for Pandemic and Epidemic Intelligence, to explore the possibility of advancing the privacy-preserving digital healthcare frameworks.

More information: Xiang Bai et al, Advancing COVID-19 diagnosis with privacy-preserving collaboration in artificial intelligence, Nature Machine Intelligence (2021). DOI: 10.1038/s42256-021-00421-z

Journal information: Nature Machine Intelligence

Provided by University of Cambridge

Citation: New model improves accuracy of machine learning in COVID-19 diagnosis while preserving privacy (2021, December 16) retrieved 17 July 2024 from https://techxplore.com/news/2021-12-accuracy-machine-covid-diagnosis-privacy.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Machine learning models for diagnosing COVID-19 are not yet suitable for clinical use: study

304 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

14 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

16 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

18 hours ago

Large language models make human-like reasoning mistakes, researchers find

18 hours ago

Unveiling a new class of synthetic fuels

19 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

19 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

20 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

22 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

Jul 16, 2024

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

New model improves accuracy of machine learning in COVID-19 diagnosis while preserving privacy

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Machine learning models for diagnosing COVID-19 are not yet suitable for clinical use: study

World first for AI and machine learning to treat COVID-19 patients worldwide

Researchers build models using machine learning technique to enhance predictions of COVID-19 outcomes

New imaging resource assists AI in the COVID-19 fight

A model to classify financial texts while protecting users' privacy

New machine learning method allows hospitals to share patient data—privately

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Phys.org

Medical Xpress

Science X

New model improves accuracy of machine learning in COVID-19 diagnosis while preserving privacy

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

Machine learning models for diagnosing COVID-19 are not yet suitable for clinical use: study

World first for AI and machine learning to treat COVID-19 patients worldwide

Researchers build models using machine learning technique to enhance predictions of COVID-19 outcomes

New imaging resource assists AI in the COVID-19 fight

A model to classify financial texts while protecting users' privacy

New machine learning method allows hospitals to share patient data—privately

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

A new neural network makes decisions like a human would

Your Privacy