TY - JOUR
T1 - Evaluating the efficacy of major language models in providing guidance for hand trauma nerve laceration patients
T2 - a case study on Google’s AI BARD, Bing AI, and ChatGPT
AU - Lim, Bryan
AU - Seth, Ishith
AU - Bulloch, Gabriella
AU - Xie, Yi
AU - Hunter-Smith, David J.
AU - Rozen, Warren M.
N1 - Publisher Copyright:
© The Author(s) 2023.
PY - 2023
Y1 - 2023
N2 - This study evaluated three prominent Large Language Models (LLMs)-Google’s AI BARD, Bing’s AI, and ChatGPT-4 in providing patient advice for hand laceration. Five simulated patient inquiries on hand trauma were prompted to them. A panel of Board-certified plastic surgical residents evaluated the responses for accuracy, comprehensiveness, and appropriate sources. Responses were also compared against existing literature and guidelines. This study suggests that ChatGPT outperforms BARD and Bing AI in providing reliable, evidence-based clinical advice, but they still face limitations in depth and specificity. Healthcare professionals are essential in interpreting LLM recommendations, and future research should improve LLM performance by integrating specialized databases and human expertise to advance nerve injury management and optimize patient-centred care.
AB - This study evaluated three prominent Large Language Models (LLMs)-Google’s AI BARD, Bing’s AI, and ChatGPT-4 in providing patient advice for hand laceration. Five simulated patient inquiries on hand trauma were prompted to them. A panel of Board-certified plastic surgical residents evaluated the responses for accuracy, comprehensiveness, and appropriate sources. Responses were also compared against existing literature and guidelines. This study suggests that ChatGPT outperforms BARD and Bing AI in providing reliable, evidence-based clinical advice, but they still face limitations in depth and specificity. Healthcare professionals are essential in interpreting LLM recommendations, and future research should improve LLM performance by integrating specialized databases and human expertise to advance nerve injury management and optimize patient-centred care.
KW - Artificial intelligence
KW - BARD
KW - Bings AI
KW - ChatGPT
KW - large language model
KW - nerve injury
UR - http://www.scopus.com/inward/record.url?scp=85172765435&partnerID=8YFLogxK
U2 - 10.20517/2347-9264.2023.70
DO - 10.20517/2347-9264.2023.70
M3 - Article
AN - SCOPUS:85172765435
SN - 2347-9264
VL - 10
JO - Plastic and Aesthetic Research
JF - Plastic and Aesthetic Research
M1 - 43
ER -