October 16, 2023 report

Using a large-scale dataset holding a million real-world conversations to study how people interact with LLMs

by Bob Yirka , Tech Xplore

A team of computer scientists at the University of California Berkeley, working with one colleague from the University of California San Diego and another from Carnegie Mellon University, has created a large-scale dataset of 1 million real-world conversations to study how people interact with large language models (LLMs). They have published a paper describing their work and findings on the arXiv preprint server.

Over the past few years, LLMs such as ChatGPT have burst into the public realm, giving users across the world an opportunity to interact with chatbots backed up by artificial intelligence. Such access has led to millions of "intelligent" conversations between humans and chatbots, resulting in not only discussions, but assistance with activities like programing, text writing and test taking.

In this new study, the research team wanted to know what sorts of interactions are occurring with AI chatbots by category percentages, for example, what percentage of such conversations are about programing or a related topic. To find out, they obtained the texts of more than 1 million real-world conversations between people and their AI chatbots (25 of them) and then parsed them by subject type.

The conversations were global in nature, involving people and their chatbots speaking 150 languages. To learn more about the nature of such conversations, the researchers used a program to randomly choose 100,000 of them for study.

The research team found that roughly half of all the AI chatbot conversations were centered on what they describe as "safe" topics, such as computer programming, requests for help in writing text, or even gardening—the most popular topic involved resolution of software errors and solutions.

They also found that approximately 10% of such conversations involved what they team describe as "unsafe" topics—those with sexual or violent content. They found, for example, many examples of people asking their chatbot to provide them with erotic stories or to engage with them in sexual role playing.

The researchers suggest studying real-world LLM/human conversations can help makers of such systems define the way they want their products to be used and also to find out how effective controls designed to prevent "unsafe" use of such products are working.

More information: Lianmin Zheng et al, LMSYS-Chat-1M: A Large-Scale Real-World LLM Conversation Dataset, arXiv (2023). DOI: 10.48550/arxiv.2309.11998

Journal information: arXiv

Citation: Using a large-scale dataset holding a million real-world conversations to study how people interact with LLMs (2023, October 16) retrieved 30 June 2024 from https://techxplore.com/news/2023-10-large-scale-dataset-million-real-world-conversations.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

New study shows how people interacted with chatbots during COVID-19 pandemic

105 shares

Feedback to editors

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Jun 28, 2024

Researchers develop the fastest possible flow algorithm

Jun 28, 2024

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Jun 28, 2024

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Jun 27, 2024

Wireless receiver blocks interference for better mobile device performance

Jun 27, 2024

Researchers successfully develop domestic 6G antenna measurement system

Jun 27, 2024

Research shows how common plastics could passively cool and heat buildings with the seasons

Jun 27, 2024

Researchers suggest smart solution to harness waste heat from industry

Jun 27, 2024

Robotic hand with tactile fingertips achieves new dexterity feat

Jun 27, 2024

Help or hindrance? ER robots have potential to aid health care workers

Jun 27, 2024

Load comments (1)

Using a large-scale dataset holding a million real-world conversations to study how people interact with LLMs

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

New study shows how people interacted with chatbots during COVID-19 pandemic

Chatbot can explain apps and how they access hardware or data

Snapchat's 'creepy' AI blunder reminds us that chatbots aren't people. But as the lines blur, the risks grow

Doctors find mental health chatbots are effective in helping treat symptoms in people with depression

Researchers outline how AI chatbots could be approved as medical devices

Should you be using ChatGPT? Experts say 'yes,' but don't confuse it with a friend

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Phys.org

Medical Xpress

Science X

Using a large-scale dataset holding a million real-world conversations to study how people interact with LLMs

Researchers develop novel 3D printing strategy with controllable gradients porous structures

Researchers develop the fastest possible flow algorithm

Real-time modeling of 3D temperature distributions within nuclear microreactors to improve safety systems

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Wireless receiver blocks interference for better mobile device performance

Researchers successfully develop domestic 6G antenna measurement system

Research shows how common plastics could passively cool and heat buildings with the seasons

Researchers suggest smart solution to harness waste heat from industry

Robotic hand with tactile fingertips achieves new dexterity feat

Help or hindrance? ER robots have potential to aid health care workers

Related Stories

New study shows how people interacted with chatbots during COVID-19 pandemic

Chatbot can explain apps and how they access hardware or data

Snapchat's 'creepy' AI blunder reminds us that chatbots aren't people. But as the lines blur, the risks grow

Doctors find mental health chatbots are effective in helping treat symptoms in people with depression

Researchers outline how AI chatbots could be approved as medical devices

Should you be using ChatGPT? Experts say 'yes,' but don't confuse it with a friend

Recommended for you

Researchers develop the fastest possible flow algorithm

Is ChatGPT the key to stopping deepfakes? Study asks LLMs to spot AI-generated images

Robotic hand with tactile fingertips achieves new dexterity feat

Sony introduces AI for single-instrument accompaniment generation in music production

Mechanical computer relies on kirigami cubes, not electronics

New work explores optimal circumstances for reaching a common goal with humanoid robots

Your Privacy