华西口腔医学杂志 ›› 2024, Vol. 42 ›› Issue (6): 810-815.doi: 10.7518/hxkq.2024.2024144

• 数字化口腔医学专栏 • 上一篇    下一篇

生成式人工智能在口腔医学领域应用价值的比较研究

叶元龙(), 曾维, 陈金龙, 刘磊()   

  1. 口腔疾病防治全国重点实验室 国家口腔医学中心 国家口腔疾病临床医学研究中心 四川大学华西口腔医院创伤与整形外科,成都 610041
  • 收稿日期:2024-04-12 修回日期:2024-10-28 出版日期:2024-12-01 发布日期:2024-11-29
  • 通讯作者: 刘磊 E-mail:1401881557@qq.com;drliulei@163.com
  • 作者简介:叶元龙,医师,博士,E-mail:1401881557@qq.com
  • 基金资助:
    国家临床重点专科项目(2023年)

Application value of generative artificial intelligence in the field of stomatology

Ye Yuanlong(), Zeng Wei, Chen Jinlong, Liu Lei()   

  1. State Key Laboratory of Oral Diseases & National Center for Stomatology & National Clinical Research Center for Oral Diseases & Dept. of Traumatic and Plastic Surgery, West China Hospital of Stomatology, Sichuan University, Chengdu 610041, China
  • Received:2024-04-12 Revised:2024-10-28 Online:2024-12-01 Published:2024-11-29
  • Contact: Liu Lei E-mail:1401881557@qq.com;drliulei@163.com
  • Supported by:
    National Key Clinical Specialty Project(2023)

摘要:

目的 本研究旨在比较3种生成式人工智能技术(GAI)在中文语境下口腔医学领域的应用价值及其存在的问题,从而为其应用提供参考依据。 方法 本研究设计了36个涵盖口腔医学各专业的问题,包括病历撰写、专业知识解答、文章翻译润色等多个方面。将这些问题分别输入至ChatGPT4-turbo、Gemini(2024.2)和文心一言4.0进行回答,邀请3名经验丰富的口腔医师采用盲评法对答案进行四级李斯特量表评估,对GAI在不同应用场景的使用价值进行评价。 结果 在临床文书撰写和图片制作方面,Gemini 45分,文心一言38分,ChatGPT 33分;在科研辅助方面,Gemini 45分,文心一言39分,ChatGPT 35分;在教学辅助能力方面,文心一言54分,Gemini 50分,ChatGPT 48分;在患者咨询和导诊方面,Gemini 78分,文心一言59分,ChatGPT 48分。在总分方面,Gemini 218分,文心一言190分,ChatGPT 164分。在应用场景评价中,得分最高的3项为文章翻译润色、医患沟通文案撰写和科普宣传文案撰写,分别为26、23、23分;得分最低的2项为指定文献的搜索汇报和图片生成,分别为13和12分。 结论 中文语境下在口腔医学领域应用价值从高到低依次为Gemini、文心一言和ChatGPT。总体来看,GAI在翻译润色、医患沟通文案撰写和科普文章撰写方面有较大的应用价值,在指定文献的搜索汇报和图片生成方面的应用价值最低。

关键词: 生成式人工智能, Gemini, 文心一言, ChatGPT, 口腔医学

Abstract:

Objective This study aims to compare and analyze three types of generative artificial intelligence (GAI) and explore their application value and existing problems in the field of stomatology in the Chinese context. Methods A total of 36 questions were designed, covering all the professional areas of stomatology. The questions encompassed various aspects including medical records, professional knowledge, and translation and editing. These questions were submitted to ChatGPT4-turbo, Gemini (2024.2) and ERNIE Bot 4.0. After obtaining the answers, a blinded evaluation was conducted by three experienced oral medicine physicians using a four-point Likert scale. The value of GAI in various application scenarios was evaluated. Results Gemini scored 45, ERNIE Bot scored 38, and ChatGPT scored 33 for clinical documentation and image production. For research assistance, Gemini achieved 45, ERNIE Bot had 39, and ChatGPT scored 35. Teaching assistance capabilities were rated at 54 for ERNIE Bot, 50 for Gemini, and 48 for ChatGPT. In patient consultation and guidance, Gemini scored 78, ERNIE Bot scored 59, and ChatGPT scored 48. Overall, the total scores were 218, 190, and 164 for Gemini, ERNIE Bot, and ChatGPT, respectively. Among GAI applications, the top scoring categories were article translation and polishing (26), patient-doctor communication documentation (23), and popular science content creation (23). The lowest scoring categories were literature search and reporting (13) and image generation (12). Conclusion In the Chinese context, the application value of GAI is the highest for Gemini, followed by ERNIE Bot and ChatGPT. GAI shows significant value in translation, patient-doctor communication, and popular science writing. However, its value in literature search, reporting, and image generation remains limited.

Key words: generative artificial intelligence, Gemini, ERNIE Bot, ChatGPT, stomatology

中图分类号: