June 24, 2024

New prompt-based technique to enhance AI security

by Higher Education Press

Researchers have developed a new approach to AI security that employs text prompts to better protect AI systems from cyber threats. This method focuses on the creation of adversarial examples to prevent AI from being misled by inputs that are typically undetectable to humans.

The prompt-based technique streamlines the generation of these adversarial inputs, allowing for quicker response to potential threats without extensive computations. Preliminary testing has shown that this method can effectively safeguard AI responses with minimal direct interaction with the AI systems.

Dr. Feifei Ma, the lead researcher, outlines the process: "Our approach involved initially crafting malicious prompts to identify vulnerabilities in AI models. Following this identification, these prompts were utilized as training data, helping the AI to resist similar attacks in the future."

Subsequent experiments indicated that this training approach improved the robustness of AI systems. Models trained with adversarial prompts were less likely to succumb to similar attacks, demonstrating an enhancement in their defensive capabilities.

"This method allows us to expose and then mitigate vulnerabilities in AI models, which is especially critical in sectors like finance and health care," Dr. Ma noted.

The research, published in Frontiers of Computer Science, indicates that AI systems trained with these adversarial prompts are more capable of resisting similar manipulation tactics in the future, potentially improving their overall robustness against cyber threats.

It is a collaborative work between Chinese Academy of Sciences, University of Chinese Academy of Sciences, Stanford University, and National University of Singapore.

More information: Yuting Yang et al, A prompt-based approach to adversarial example generation and robustness enhancement, Frontiers of Computer Science (2023). DOI: 10.1007/s11704-023-2639-2

Provided by Higher Education Press

Citation: New prompt-based technique to enhance AI security (2024, June 24) retrieved 17 July 2024 from https://techxplore.com/news/2024-06-prompt-based-technique-ai.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New ransomware attack based on an evolutional generative adversarial network can evade security measures

16 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

11 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

13 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

15 hours ago

Large language models make human-like reasoning mistakes, researchers find

16 hours ago

Unveiling a new class of synthetic fuels

16 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

16 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

17 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

20 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

21 hours ago

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

New prompt-based technique to enhance AI security

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

New ransomware attack based on an evolutional generative adversarial network can evade security measures

A simple technique to defend ChatGPT against jailbreak attacks

Scientists uncover quantum-inspired vulnerabilities in neural networks

New method for comparing neural networks exposes how artificial intelligence works

Researcher suggests how to effectively utilize large language models

AI can write a wedding toast or summarize a paper, but what happens if it's asked to build a bomb?

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Large language models make human-like reasoning mistakes, researchers find

Phys.org

Medical Xpress

Science X

New prompt-based technique to enhance AI security

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

New ransomware attack based on an evolutional generative adversarial network can evade security measures

A simple technique to defend ChatGPT against jailbreak attacks

Scientists uncover quantum-inspired vulnerabilities in neural networks

New method for comparing neural networks exposes how artificial intelligence works

Researcher suggests how to effectively utilize large language models

AI can write a wedding toast or summarize a paper, but what happens if it's asked to build a bomb?

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Large language models make human-like reasoning mistakes, researchers find

Your Privacy