All schools in Upshur County will close at 1 p.m. today. All B-UHS sporting events today are canceled.

WVU researchers test AI’s limits in emergency room diagnoses

WVU researchers found that while AI like ChatGPT can help emergency room doctors diagnose typical cases accurately, it often fails with patients who have atypical symptoms, emphasizing the need for more diverse data and human oversight.
West Virginia University researchers have determined AI technology can use input from physicians’ exam notes to assist in diagnosing diseases for patients with classic symptoms. (WVU Photo/Greg Ellis)

Artificial intelligence tools can assist emergency room physicians in accurately predicting disease but only for patients with typical symptoms, West Virginia University scientists have found.

Gangqing “Michael” Hu, assistant professor in the WVU School of Medicine Department of Microbiology, Immunology and Cell Biology and director of the WVU Bioinformatics Core facility, led a study that compared the precision and accuracy of four ChatGPT models in making medical diagnoses and explaining their reasoning.

His findings, published in the journal Scientific Reports, demonstrate the need for incorporating greater amounts of different types of data in training AI technology to assist in disease diagnosis.

More data can make the difference in whether AI gives patients the correct diagnoses for what are called “challenging cases,” which don’t exhibit classic symptoms. As an example, Hu pointed to a trio of scenarios from his study involving patients who had pneumonia without the typical fever.

“In these three cases, all of the GPT models failed to give an accurate diagnosis,” Hu said. “That made us dive in to look at the physicians’ notes and we noticed the pattern of these being challenging cases. ChatGPT tends to get a lot of information from different resources on the internet, but these may not cover atypical disease presentation.”

The study analyzed data from 30 public emergency department cases, which for reasons of privacy did not include demographics.

Hu explained that in using ChatGPT to assist with diagnosis, physicians’ notes are uploaded, and the tool is asked to provide its top three diagnoses. Results varied for the versions Hu tested: the GPT-3.5, GPT-4, GPT-4o and o1 series.

“When we looked at whether the AI models gave the correct diagnosis in any of their top three results, we didn’t see a significant improvement between the new version and the older version,” he said. “But when we look at each model’s number one diagnosis, the new version is about 15% to 20% higher in accuracy than the older version.”

Given AI models’ current low performance on complex and atypical cases, Hu said human oversight is a necessity for high-quality, patient-centered care when using AI as an assistive tool.

“We didn’t do this study out of curiosity to see if the new model will give better results. We wanted to establish a basis for future studies that involve additional input,” Hu said. “Currently, we input physician notes only. In the future we want to improve the accuracy by including images and findings from laboratory tests.”

Hu also plans to expand on findings from one of his recent studies in which he applied the ChatGPT-4 model to the task of role playing a physiotherapist, psychologist, nutritionist, artificial intelligence expert and athlete in a simulated panel discussion about sports rehabilitation.

He said he believes a model like that can improve AI’s diagnostic accuracy by taking a conversational approach in which multiple AI agents interact.

“From a position of trust, I think it’s very important to see the reasoning steps,” Hu said. “In this case, high-quality data including both typical and atypical cases helps build the trust.”

Hu emphasized that while ChatGPT is promising, it is not a certified medical device. He said if health care providers were to include images or other data in a clinical setting, the AI model would be an open-source system and installed in a hospital cluster to comply with privacy laws.

Other contributors to the study were Jinge Wang, a postdoctoral fellow, and Kenneth Shue, a lab volunteer from Montgomery County, Maryland, both in the School of Medicine Department of Microbiology, Immunology and Cell Biology; as well as Li Liu, Arizona State University. The work was supported by funding from the National Institutes of Health and National Science Foundation.

Hu said future research on using ChatGPT in emergency departments could examine whether enhancing AIs’ abilities to explain their reasoning could contribute to triage or decisions about patient treatment.

Share this story:

Local Businesses

RECENT Stories

Police seize $500,000 in banned products from vape stores in Upshur and Randolph counties

Authorities seized about $500,000 worth of banned vaping products and cash from five stores in Upshur and Randolph counties after receiving reports of illegal sales to minors, with arrests expected as the investigation continues.

Knabenshue named to Second Team All-State Softball Team

Buckhannon-Upshur freshman Madison Knabenshue, who led her team in hitting, scoring, and stolen bases, was named to the Second Team of the West Virginia All-State High School Softball Team.

Hank Ellis Insurance takes lead in men’s golf league standings

Hank Ellis Insurance surged into first place in the Buckhannon Country Club Bob Cupp Memorial Golf League after a decisive win, while First Community Bank leaped from last to a tie for fourth with Feola’s Flowers.

Former Buckhannon mayor sentenced to decade in prison for child pornography offense

David Walter McCauley, former Buckhannon mayor, was sentenced to 10 years in federal prison and 10 years supervised release for child pornography involving a 17-year-old boy, following an FBI-led investigation.

American Legion Post 7 cancels Flag Retirement Ceremony due to rain

The American Legion Post 7 has canceled its June 14 Flag Retirement Ceremony because of expected rain and thunderstorms, and will reschedule the event for a later date.

Sandra Louise Collins Garrison

Sandra Louise Collins Garrison, a longtime educator in Buckhannon, is remembered by family and former students following her death at age 73, with funeral services set for June 18 at Way of Holiness Church.
LUCAS Truck

LUCAS to offer lung cancer screening in Rock Cave, Weston, and Buckhannon

LUCAS, WVU Medicine’s mobile lung cancer screening unit, will offer low-dose CT scans in Rock Cave, Weston, and Buckhannon in July for insured and eligible uninsured patients by appointment.

Man arrested for allegedly attempting to cash unauthorized check at BC Bank in Buckhannon

Christian Lyons, 47, of Ashford, was arrested on a felony charge after allegedly attempting to cash a check without authorization at a Buckhannon bank, with surveillance and account details helping police identify him.
Summer basketball

Summer Basketball League continues to grow under UCRC director Mitchell Shaw

The Upshur County Summer Basketball League, overseen by UCRC director Mitchell Shaw, is thriving and expanding with 18 teams and 183 players, offering organized, community-focused competition and new sponsorship-driven prizes in Buckhannon.