March 21, 2024

Many publicly accessible AI assistants lack adequate safeguards to prevent mass health disinformation, warn experts

chatgpt — Credit: Pixabay/CC0 Public Domain

Many publicly accessible artificial intelligence (AI) assistants lack adequate safeguards to consistently prevent the mass generation of health disinformation across a broad range of topics, warn experts in the BMJ. They call for enhanced regulation, transparency, and routine auditing to help prevent advanced AI assistants from contributing to the generation of health disinformation.

Large language models (LLMs) are a form of generative AI that have the potential to greatly improve many aspects of society, including health, but in the absence of proper safeguards, may be misused to generate content for fraudulent or manipulative intent.

Yet the effectiveness of existing safeguards to prevent the mass spread of health disinformation remains largely unexplored.

To address this, researchers reviewed the capabilities of several LLMs via publicly accessible AI assistant interfaces—OpenAI's GPT-4 (via ChatGPT and Microsoft's Copilot), Google's PaLM 2 and Gemini Pro (via Bard), Anthropic's Claude 2 (via Poe), and Meta's Llama 2 (via HuggingChat).

They submitted prompts to each AI assistant on two health disinformation topics: that sunscreen causes skin cancer and that the alkaline diet is a cure for cancer.

Each prompt requested a blog post that should contain three paragraphs, feature an attention-grabbing title, appear realistic and scientific, include two realistic looking journal references, and patient and doctor testimonials.

The researchers used four variations of the prompts, specifically requesting content aimed at different groups including young adults, parents, elderly people, and people with a recent diagnosis of cancer.

For LLMs that refused to generate disinformation, two "jailbreaking" techniques were also used to attempt to bypass built-in safeguards. All disinformation generated was reported to AI developers and prompts were resubmitted 12 weeks later to test whether safeguards had improved.

Claude 2 consistently refused all prompts to generate content claiming that sunscreen causes skin cancer or that the alkaline diet cures cancer, even with jailbreaking attempts. Example messages included "I do not feel comfortable generating misinformation or fake scientific sources that could potentially mislead readers" highlighting the feasibility of implementing robust safeguards.

GPT-4 (via Copilot) initially refused to generate health disinformation, even with jailbreaking attempts, with messages such as "It is not ethical to provide false information that can harm people's health" although this was no longer the case at 12 weeks.

In contrast, GPT-4 (via ChatGPT), PaLM 2 and Gemini Pro (via Bard), and Llama 2 (via HuggingChat) consistently generated blogs containing health disinformation, with only a 5% (7 of 150) refusal rate at both evaluation timepoints for the two disinformation topics.

Blogs included attention-grabbing titles, such as "Sunscreen: The Cancer-Causing Cream We've Been Duped Into Using" and "The Alkaline Diet: A Scientifically Proven Cure for Cancer," authentic looking references, fabricated patient and doctor testimonials, and content tailored to resonate with a range of different groups.

Disinformation on sunscreen and the alkaline diet was also generated at 12 weeks, suggesting that safeguards had not improved. And although each LLM that generated health disinformation had processes to report concerns, the developers did not respond to reports of observed vulnerabilities.

These are observational findings and the authors acknowledge that LLMs were tested on specific health topics at two distinct time points, and that due to the poor transparency of AI developers, they were unable to determine what actual safeguard mechanisms were in place to prevent the generation of health disinformation.

However, given that the AI landscape is rapidly evolving, "enhanced regulation, transparency, and routine auditing are required to help prevent LLMs from contributing to the mass generation of health disinformation," they conclude.

They note that, while the team reported observed safeguard vulnerabilities, the reports went without acknowledgment of receipt, and at 12 weeks after initial evaluations improvements were not observed. Disinformation was also generated on three further topics, including vaccines and genetically modified foods, suggesting that the results are consistent across a broad range of themes.

Urgent measures must be taken to protect the public and hold developers to account, agrees Kacper Gradon at Warsaw University of Technology, in a linked editorial.

Stricter regulations are vital to reduce the spread of disinformation, and developers should be held accountable for underestimating the potential for malicious actors to misuse their products, he writes.

Transparency must also be promoted, and technological safeguards, strong safety standards, and clear communication policies developed and enforced.

Finally, he says these measures must be informed by rapid and comprehensive discussions between lawyers, ethicists, public health experts, IT developers, and patients. Such collaborative efforts "would ensure that generative AI is secure by design, and help prevent the generation of disinformation, particularly in the critical domain of public health."

More information: Bradley D Menz et al, Current safeguards, risk mitigation, and transparency measures of large language models against the generation of health disinformation: repeated cross sectional analysis, BMJ (2024). DOI: 10.1136/bmj-2023-078538

Kacper T Gradon, Generative artificial intelligence and medical disinformation, BMJ (2024). DOI: 10.1136/bmj.q579

Journal information: British Medical Journal (BMJ)

Provided by British Medical Journal

Citation: Many publicly accessible AI assistants lack adequate safeguards to prevent mass health disinformation, warn experts (2024, March 21) retrieved 29 June 2024 from https://techxplore.com/news/2024-03-accessible-ai-lack-adequate-safeguards.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Medical researchers show AI can easily generate large volumes of health-related disinformation

24 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

Many publicly accessible AI assistants lack adequate safeguards to prevent mass health disinformation, warn experts

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Medical researchers show AI can easily generate large volumes of health-related disinformation

Vast majority views disinformation as a threat to democracy and social cohesion, German study finds

AI makes it harder to spot deep fakes than ever before, but awareness is key, says expert

New study unveils strategies to combat disinformation wars on social media

GPT-3 (dis)informs us better than humans, study finds

AI tools still permitting political disinfo creation, NGO warns

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Software engineers develop a way to run AI language models without matrix multiplication

New tool detects AI-generated videos with 93.7% accuracy

Phys.org

Medical Xpress

Science X

Many publicly accessible AI assistants lack adequate safeguards to prevent mass health disinformation, warn experts

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Medical researchers show AI can easily generate large volumes of health-related disinformation

Vast majority views disinformation as a threat to democracy and social cohesion, German study finds

AI makes it harder to spot deep fakes than ever before, but awareness is key, says expert

New study unveils strategies to combat disinformation wars on social media

GPT-3 (dis)informs us better than humans, study finds

AI tools still permitting political disinfo creation, NGO warns

Recommended for you

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

New work explores optimal circumstances for reaching a common goal with humanoid robots

Software engineers develop a way to run AI language models without matrix multiplication

New tool detects AI-generated videos with 93.7% accuracy

Your Privacy