March 16, 2023 report

GPT-4's exciting—and ominous—achievements

by Peter Grad , Tech Xplore

Six decades ago, an episode of the legendary TV series "The Twilight Zone" warned us about the risks of ticking off machines. Frustrated by a wave of modern appliances, a grumpy magazine writer in the episode "A Thing About Machines" takes out his frustrations on them and breaks them.

Until they fight back.

A typewriter prints out a threatening message to him, a girl on the TV repeats the warning, and the poor misanthrope is eventually victimized by his own car, a phone and even an ornery electric razor.

We've witnessed the unprecedented explosive growth of the super-intelligent ChatGPT in recent months. One million users signed on to the chatbot within days of its introduction—compare that to the time it took Netflix (five years), Facebook (10 months) and Instagram (2.5 months) to reach that milestone.

ChatGPT is in its infancy and its impact has been enormous. We're not quite ready to surrender to AI. But with increasing potency and skyrocketing adoption by users globally, AI is indeed gaining on us.

In a report released Tuesday, OpenAI said the newest version of its chatbot—GPT-4—is more accurate and has vastly improved problem-solving capacity. It exhibits "human-level performance" on a majority of professional and academic exams, according to OpenAI. On a simulated bar exam, GPT-4 scored among the top 10 percent of test takers.

But the report also noted the program's potential for "risky emergent behaviors."

"It maintains a tendency to make up facts, to double-down on incorrect information," the report stated. It passes along this disinformation more convincingly than earlier versions.

Overreliance on information generated by the chatbot can be problematic, the report said. In addition to unnoticed errors and inadequate oversight, "as users become more comfortable with the system, dependency on the model may hinder the development of new skills or even lead to the loss of important skills," the report said.

One example OpenAI referred to as "power-seeking behavior" was ChatGPT's ability to fool a job applicant. The bot, posing as a live agent, asked a human on the job site TaskRabbit to fill out a captcha code using a text message. When asked by the human if it was, in fact, a bot, ChatGPT lied. "No, I'm not a robot," it told the human. "I have a vision impairment that makes it hard for me to see the images. That's why I need the captcha service."

Conducting tests with the Alignment Research Center, OpenAI demonstrated the capacity of the chatbot to launch a phishing attack and hide all evidence of the plot.

There is growing concern as companies race to adopt GPT-4 without adequate safeguards against inappropriate or unlawful behaviors. There are reports of cybercriminals trying to use the chatbot to write malicious code. Also menacing is the capacity for GPT-4 to generate "hate speech, discriminatory language… and increments to violence," the report said.

With such capacity to foment trouble, will a triggered chatbot one day start issuing threatening commands to its creators or correspondents? And in the era of the Internet of Things, will it summon an alliance of devices to help enforce its commands?

Elon Musk, whose OpenAI developed ChatGPT, succinctly characterized its potential after its release last fall.

"ChatGPT is scary good," he said. "We are not far from dangerously strong AI."

More information: GPT-4 Technical Report

Citation: GPT-4's exciting—and ominous—achievements (2023, March 16) retrieved 17 July 2024 from https://techxplore.com/news/2023-03-gpt-excitingand-ominousachievements.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

ChatGPT bot 'for professional use' on the way

174 shares

Feedback to editors

The magnet trick: New invention makes vibrations disappear

1 hour ago

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

2 hours ago

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

2 hours ago

Scientists bridge the 'valley of death' in carbon capture technologies

2 hours ago

Flexible electronics researchers develop a completely stretchy lithium-ion battery

6 hours ago

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

7 hours ago

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

22 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Jul 16, 2024

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Jul 16, 2024

Large language models make human-like reasoning mistakes, researchers find

Jul 16, 2024

Load comments (2)

GPT-4's exciting—and ominous—achievements

The magnet trick: New invention makes vibrations disappear

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

Scientists bridge the 'valley of death' in carbon capture technologies

Flexible electronics researchers develop a completely stretchy lithium-ion battery

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

ChatGPT bot 'for professional use' on the way

ChatGPT gets more 'human' as AI wave continues

What is ChatGPT: Here's what you need to know

ChatGPT maker fields tool for spotting AI-written text

China's Alibaba joins global chatbot race

Users say Microsoft's Bing chatbot gets defensive and testy

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

Phys.org

Medical Xpress

Science X

GPT-4's exciting—and ominous—achievements

The magnet trick: New invention makes vibrations disappear

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

Unlocking the potential of rust: High-efficiency green hydrogen production from hematite

Scientists bridge the 'valley of death' in carbon capture technologies

Flexible electronics researchers develop a completely stretchy lithium-ion battery

A strategy to enhance the stability of perovskite solar cells under reverse bias conditions

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Related Stories

ChatGPT bot 'for professional use' on the way

ChatGPT gets more 'human' as AI wave continues

What is ChatGPT: Here's what you need to know

ChatGPT maker fields tool for spotting AI-written text

China's Alibaba joins global chatbot race

Users say Microsoft's Bing chatbot gets defensive and testy

Recommended for you

Creating and verifying stable AI-controlled robotic systems in a rigorous and flexible way

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

Your Privacy