Leveraging AI to Assist Clinicians with Physical Exams

Physical examinations are important diagnostic tools that can reveal critical insights into a patient's health, but complex conditions may be overlooked if a clinician lacks specialized training in that area. While previous research has investigated using large language models (LLMs) as tools to aid in providing diagnoses, their use in physical exams remains untapped. To address this gap, researchers from Mass General Brigham prompted the LLM GPT-4 to recommend physical exam instructions based on patient symptoms. The study suggests the potential of using LLMs as aids for clinicians during physical exams. Results are published in the Journal of Medical Artificial Intelligence.

"Medical professionals early in their career may face challenges in performing the appropriate patient-tailored physical exam because of their limited experience or other context-dependent factors, such as lower resourced settings," said senior author Marc D. Succi, MD, strategic innovation leader at Mass General Brigham Innovation, associate chair of innovation and commercialization for enterprise radiology and executive director of the Medically Engineered Solutions in Healthcare (MESH) Incubator at Mass General Brigham. "LLMs have the potential to serve as a bridge and parallel support physicians and other medical professionals with physical exam techniques and enhance their diagnostic abilities at the point of care."

Succi and his colleagues prompted GPT-4 to recommend physical exam instructions based on the patient’s primary symptom, for example, a painful hip. GPT-4’s responses were then evaluated by three attending physicians on a scale of 1 to 5 points based on accuracy, comprehensiveness, readability and overall quality. They found that GPT-4 performed well at providing instructions, scoring at least 80% of the possible points. The highest score was for "Leg Pain Upon Exertion" and the lowest was for "Lower Abdominal Pain."

"GPT-4 performed well in many respects, yet its occasional vagueness or omissions in critical areas, like diagnostic specificity, remind us of the necessity of physician judgment to ensure comprehensive patient care," said lead author Arya Rao, a student researcher in the MESH Incubator attending Harvard Medical School.

Although GPT-4 provided detailed responses, the researchers found that it occasionally left out key instructions or was overly vague, indicating the need for a human evaluator. According to researchers, the LLM’s strong performance suggests its potential as a tool to help fill gaps in physicians’ knowledge and aid in diagnosing medical conditions in the future.

Rao, Arya S et al.
A Large Language Model-Guided Approach to the Focused Physical Exam.
Journal of Medical Artificial Intelligence, 2024. doi: 10.21037/jmai-24-275

Most Popular Now

Most Advanced Artificial Touch for Brain…

For the first time ever, a complex sense of touch for individuals living with spinal cord injuries is a step closer to reality. A new study published in Science, paves...

Predicting the Progression of Autoimmune…

Autoimmune diseases, where the immune system mistakenly attacks the body's own healthy cells and tissues, often have a preclinical stage before diagnosis that’s characterized by mild symptoms or certain antibodies...

Major EU Project to Investigate Societal…

A new €3 million EU research project led by University College Dublin (UCD) Centre for Digital Policy will explore the benefits and risks of Artificial Intelligence (AI) from a societal...

Using AI to Uncover Hospital Patients�…

Across the United States, no hospital is the same. Equipment, staffing, technical capabilities, and patient populations can all differ. So, while the profiles developed for people with common conditions may...

New AI Tool Uses Routine Blood Tests to …

Doctors around the world may soon have access to a new tool that could better predict whether individual cancer patients will benefit from immune checkpoint inhibitors - a type of...

New Method Tracks the 'Learning Cur…

Introducing Annotatability - a powerful new framework to address a major challenge in biological research by examining how artificial neural networks learn to label genomic data. Genomic datasets often contain...

Picking the Right Doctor? AI could Help

Years ago, as she sat in waiting rooms, Maytal Saar-Tsechansky began to wonder how people chose a good doctor when they had no way of knowing a doctor's track record...

From Text to Structured Information Secu…

Artificial intelligence (AI) and above all large language models (LLMs), which also form the basis for ChatGPT, are increasingly in demand in hospitals. However, patient data must always be protected...

AI Innovation Unlocks Non-Surgical Way t…

Researchers have developed an artificial intelligence (AI) model to detect the spread of metastatic brain cancer using MRI scans, offering insights into patients’ cancer without aggressive surgery. The proof-of-concept study, co-led...

Deep Learning Model Helps Detect Lung Tu…

A new deep learning model shows promise in detecting and segmenting lung tumors, according to a study published in Radiology, a journal of the Radiological Society of North America (RSNA)...

One of the Largest Global Surveys of Soc…

As leaders gather for the World Economic Forum Annual Meeting 2025 in Davos, Leaps by Bayer, the impact investing arm of Bayer, and Boston Consulting Group (BCG) announced the launch...

New Study Reveals AI's Transformati…

Intensive care units (ICUs) face mounting pressure to effectively manage resources while delivering optimal patient care. Groundbreaking research published in the INFORMS journal Information Systems Research highlights how a novel...