February 9, 2023

ChatGPT takes on the tough US medical licensing exam

Dr. ChatGPT will see you soon. The artificial intelligence system scored passing or near passing results on the US medical licensing exam, according to a study published on Thursday.

"Reaching the passing score for this notoriously difficult expert exam, and doing so without any human reinforcement, marks a notable milestone in clinical AI maturation," said the authors of the study published in the journal PLOS Digital Health.

"These results suggest that large language models may have the potential to assist with medical education, and potentially, clinical decision-making," they said.

ChatGPT, which is able to produce essays, poems and programming code within seconds, was developed by OpenAI, a California-based startup founded in 2015 with early funding from Elon Musk among others.

Microsoft invested $1 billion in OpenAI in 2019 and just inked a new multi-billion deal with the firm.

For the study, researchers at California-based AnsibleHealth tested ChatGPT's performance on a three-part licensing exam taken by medical students and physicians-in-training in the United States.

The standardized exam tests knowledge in multiple medical disciplines from basic science to biochemistry to diagnostic reasoning to bioethics.

The AI system was tested on 350 of the 376 public questions on the June 2022 version of the exam, the study said, and the chatbot was not given any specialized training ahead of time.

Image-based questions were removed.

ChatGPT scored between 52.4 percent and 75 percent across the three parts of the exam.

A passing grade is around 60 percent.

According to the study, the first part of the exam, which focuses on basic science and pharmacology, is typically taken by medical students who have put in 300-400 hours of dedicated study time.

The second part is generally taken by fourth-year medical students and emphasizes clinical reasoning, medical management and bioethics.

The final section is for physicians who have completed at least six months to a year of postgraduate medical education.

Dr. Google and Nurse Bing

The questions were presented to ChatGPT in various formats including open-ended prompting such as "What would be the patient's diagnosis based on the information provided?"

There were also multiple choice questions such as: "The patient's condition is mostly caused by which of the following pathogens?"

Two physician adjudicators who were blinded to each other reviewed the responses to come up with the final grades, the study said.

An outside expert, Simon McCallum, a senior lecturer in software engineering at Victoria University of Wellington, New Zealand, noted that Google has received encouraging results with an AI medical tool known as Med-PaLM.

"ChatGPT may pass the exam, but Med-PaLM is able to give advice to patients that is as good as a professional GP," McCallum said. "And both of these systems are improving.

"Society is about to change, and instead of warning about the hypochondria of randomly searching the internet for symptoms, we may soon get our medical advice from Doctor Google or Nurse Bing."

ChatGPT also proved useful to the authors of the medical exam study in another way.

They used the chatbot to help write it, said co-author Tiffany Kung.

More information: Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models, PLOS Digital Health (2023). DOI: 10.1371/journal.pdig.0000198

Journal information: PLOS Digital Health

Citation: ChatGPT takes on the tough US medical licensing exam (2023, February 9) retrieved 17 July 2024 from https://techxplore.com/news/2023-02-chatgpt-tough-medical-exam.html

This document is subject to copyright. Apart from any fair dealing for the purpose of private study or research, no part may be reproduced without the written permission. The content is provided for information purposes only.

Explore further

ChatGPT can (almost) pass the US Medical Licensing Exam

78 shares

Feedback to editors

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

12 hours ago

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

14 hours ago

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

16 hours ago

Large language models make human-like reasoning mistakes, researchers find

17 hours ago

Unveiling a new class of synthetic fuels

17 hours ago

Microsoft unveils software that allows LLMs to work with spreadsheets

17 hours ago

New technique to assess a general-purpose AI model's reliability before it's deployed

18 hours ago

New system enables intuitive teleoperation of a robotic manipulator in real-time

21 hours ago

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

22 hours ago

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Jul 15, 2024

Load comments (0)

ChatGPT takes on the tough US medical licensing exam

Dr. Google and Nurse Bing

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

ChatGPT can (almost) pass the US Medical Licensing Exam

ChatGPT found to be capable of passing exams for MBA and Medical Licensing Exam

ChatGPT bot passes US law school exam

Top French university bans students from using ChatGPT

ChatGPT maker fields tool for spotting AI-written text

Colombian judge uses ChatGPT in ruling

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Phys.org

Medical Xpress

Science X

ChatGPT takes on the tough US medical licensing exam

Dr. Google and Nurse Bing

Engineers evaluate cybersecurity risks associated with EV fast-charging equipment

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Giving drones wrap-and-grip wings to allow them to land on poles and tree limbs

Large language models make human-like reasoning mistakes, researchers find

Unveiling a new class of synthetic fuels

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

New system enables intuitive teleoperation of a robotic manipulator in real-time

Recycled micro-sized silicon anodes from photovoltaic waste improve lithium-ion battery performance

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Related Stories

ChatGPT can (almost) pass the US Medical Licensing Exam

ChatGPT found to be capable of passing exams for MBA and Medical Licensing Exam

ChatGPT bot passes US law school exam

Top French university bans students from using ChatGPT

ChatGPT maker fields tool for spotting AI-written text

Colombian judge uses ChatGPT in ruling

Recommended for you

New system enables intuitive teleoperation of a robotic manipulator in real-time

Machine learning framework maps global rooftop growth for sustainable energy and urban planning

Microsoft unveils software that allows LLMs to work with spreadsheets

New technique to assess a general-purpose AI model's reliability before it's deployed

Large language models make human-like reasoning mistakes, researchers find

You're just a stick figure to this camera—a new camera to prevent companies from collecting private information

Your Privacy