July 31, 2023

Analyzing generative AI's copyright crisis

by Shawn Ballard, Washington University in St. Louis

The recent explosion of artificial intelligence tools such as ChatGPT and Copilot have supercharged the assistance available to programmers. However, AI assistants may strip out comments embedded in code to convey copyright and attribution guidelines, leaving human coders none the wiser yet still on the hook legally for intellectual property infringement.

To combat this problem, computer science and engineering researchers in the McKelvey School of Engineering at Washington University in St. Louis have developed CodeIPPrompt, the first automated testing platform to evaluate how much language models generate IP-violating code. The team includes Ning Zhang and Chenguang Wang, both assistant professors; Yevgeniy Vorobeychik, professor; Zhiyuan Yu, a graduate student in Zhang's lab and first author on the paper; and Chaowei Xiao, assistant professor of computer science at Arizona State University.

Yu presented the work July 23 at the International Conference on Machine Learning in Honolulu. Notably, the team's analysis showed that copyright infringement issues are prevalent across state-of-the-art open-source models including CodeRl, CodeGen and CodeParrot, as well as in commercial products including Copilot, ChatGPT and GPT-4.

"We developed this tool to help people understand that if they're using these large language models to help write code, there's a good chance they might generate IP infringing content," Zhang said. "As users, we have a responsibility to use AI ethically. That's influenced by how we understand AI technology and the content it produces."

Though CodeIPPrompt can't say for sure if AI-generated code constitutes an IP violation—Zhang notes that issue is ultimately a legal question that will play out in the courts as cases are brought against the users of AI tools for copyright infringement—it can give users a risk score that indicates how similar generated code is to copyright protected content. Zhang anticipates that the tool will help guide the ongoing development of AI and point to potential mitigation strategies and other protections against IP violations in the future.

More information: CODEIPPROMPT: Intellectual Property Infringement Assessment of Code Language Models. openreview.net/pdf?id=zdmbZl0ia6

Provided by Washington University in St. Louis

Citation: Analyzing generative AI's copyright crisis (2023, July 31) retrieved 29 June 2024 from https://techxplore.com/news/2023-07-generative-ai-copyright-crisis.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

Researchers discover new vulnerability in large language models

57 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (0)

Analyzing generative AI's copyright crisis

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Researchers discover new vulnerability in large language models

If ChatGPT wrote it, who owns the copyright? It depends on where you live, but in Australia it's complicated

New language model boasts hundreds of billions of parameters

New platform allows easier, cheaper, and safer interactions with large language models like ChatGPT

Right to be Forgotten laws must extend to generative AI, say researchers

Microsoft's GitHub to add OpenAI chat functions to coding tool

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Phys.org

Medical Xpress

Science X

Analyzing generative AI's copyright crisis

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

Researchers discover new vulnerability in large language models

If ChatGPT wrote it, who owns the copyright? It depends on where you live, but in Australia it's complicated

New language model boasts hundreds of billions of parameters

New platform allows easier, cheaper, and safer interactions with large language models like ChatGPT

Right to be Forgotten laws must extend to generative AI, say researchers

Microsoft's GitHub to add OpenAI chat functions to coding tool

Recommended for you

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Your Privacy