TY - JOUR
T1 - Artificial Intelligence Language Model Performance for Rapid Intraoperative Queries in Plastic Surgery
T2 - ChatGPT and the Deep Inferior Epigastric Perforator Flap
AU - Atkinson, Connor J.
AU - Seth, Ishith
AU - Xie, Yi
AU - Ross, Richard J.
AU - Hunter-Smith, David J.
AU - Rozen, Warren M.
AU - Cuomo, Roberto
N1 - Publisher Copyright:
© 2024 by the authors.
PY - 2024/2/1
Y1 - 2024/2/1
N2 - Background: The integration of artificial intelligence in healthcare has led to the development of large language models that can address various medical queries, including intraoperatively. This study investigates the potential of ChatGPT in addressing intraoperative questions during the deep inferior epigastric perforator flap procedure. Methods: A series of six intraoperative questions specific to the DIEP flap procedure, derived from real-world clinical scenarios, were proposed to ChatGPT. A panel of four experienced board-certified plastic surgeons evaluated ChatGPT’s performance in providing accurate, relevant, and comprehensible responses. Results: The Likert scale demonstrated to be medically accurate, systematic in presentation, and logical when providing alternative solutions. The mean readability score of the Flesch Reading Ease Score was 28.7 (±0.8), the Flesch–Kincaid Grade Level was 12.4 (±0.5), and the Coleman–Liau Index was 14.5 (±0.5). Suitability-wise, the DISCERN score of ChatGPT was 48 (±2.5) indicating suitable and comprehensible language for experts. Conclusions: Generative AI tools such as ChatGPT can serve as a supplementary tool for surgeons to offer valuable insights and foster intraoperative problem-solving abilities. However, it lacks consideration of individual patient factors and surgical nuances. Nevertheless, further refinement of its training data and rigorous scrutiny under experts to ensure the accuracy and up-to-date nature of the information holds the potential for it to be utilized in the surgical field.
AB - Background: The integration of artificial intelligence in healthcare has led to the development of large language models that can address various medical queries, including intraoperatively. This study investigates the potential of ChatGPT in addressing intraoperative questions during the deep inferior epigastric perforator flap procedure. Methods: A series of six intraoperative questions specific to the DIEP flap procedure, derived from real-world clinical scenarios, were proposed to ChatGPT. A panel of four experienced board-certified plastic surgeons evaluated ChatGPT’s performance in providing accurate, relevant, and comprehensible responses. Results: The Likert scale demonstrated to be medically accurate, systematic in presentation, and logical when providing alternative solutions. The mean readability score of the Flesch Reading Ease Score was 28.7 (±0.8), the Flesch–Kincaid Grade Level was 12.4 (±0.5), and the Coleman–Liau Index was 14.5 (±0.5). Suitability-wise, the DISCERN score of ChatGPT was 48 (±2.5) indicating suitable and comprehensible language for experts. Conclusions: Generative AI tools such as ChatGPT can serve as a supplementary tool for surgeons to offer valuable insights and foster intraoperative problem-solving abilities. However, it lacks consideration of individual patient factors and surgical nuances. Nevertheless, further refinement of its training data and rigorous scrutiny under experts to ensure the accuracy and up-to-date nature of the information holds the potential for it to be utilized in the surgical field.
KW - artificial intelligence
KW - ChatGPT
KW - DIEP
KW - intraoperative
KW - large language model
KW - plastic surgery
UR - http://www.scopus.com/inward/record.url?scp=85184726594&partnerID=8YFLogxK
U2 - 10.3390/jcm13030900
DO - 10.3390/jcm13030900
M3 - Article
C2 - 38337594
AN - SCOPUS:85184726594
SN - 2077-0383
VL - 13
JO - Journal of Clinical Medicine
JF - Journal of Clinical Medicine
IS - 3
M1 - 900
ER -