West China Journal of Stomatology ›› 2023, Vol. 41 ›› Issue (2): 208-217.doi: 10.7518/hxkq.2023.2022301

Previous Articles     Next Articles

Construction of a caries diagnosis model based on microbiome novelty score

Sun Yanfei1,2(), Lu Jie3, Yang Jiazhen4, Liu Yuhan5, Liu Lu6, Zeng Fei7, Niu Yufen2,8, Dong Lei2,8, Yang Fang1,2()   

  1. 1.School of Stomatology, Qingdao University, Qingdao 266003, China
    2.Dept. of Pediatric Dentistry, Center of Stomatology, Municipal Hospital, Qingdao 266071, China
    3.Dept. of Stomatology, Pujiang Stomatological Hospital, Jinhua 322299, China
    4.Dept. of Pediatric Dentistry, Stomatological Hospital of Qingdao, Qingdao 266000, China
    5.Central Laboratory, Stomatological Hospital of Qing-dao, Qingdao 266000, China
    6.Qingdao Institute of Bioenergy and Bioprocess Technology, Chinese Academy of Sciences, Qingdao 266101, China
    7.Dept. of Stomatology, Affiliated Hospital of Jining Medical University, Jining 272000, China
    8.School of Stomatology, Dalian Medical University, Dalian 116044, China
  • Received:2022-08-06 Revised:2022-12-30 Online:2023-04-01 Published:2023-04-14
  • Contact: Yang Fang E-mail:yanfei1201@163.com;yangf82@qdu.edu.cn
  • Supported by:
    The National Natural Science Foundation of China(81670979);Correspondence: Yang Fang, E-mail: yangf82@qdu.edu.cn

Abstract:

Objective This study aimed to analyze the bacteria in dental caries and establish an optimized dental-ca-ries diagnosis model based on 16S ribosomal RNA (rRNA) data of oral flora. Methods We searched the public databa-ses of microbiomes including NCBI, MG-RAST, EMBL-EBI, and QIITA and collected data involved in the relevant research on human oral microbiomes worldwide. The samples in the caries dataset (1 703) were compared with healthy ones (20 540) by using the microbial search engine (MSE) to obtain the microbiome novelty score (MNS) and construct a caries diagnosis model based on this index. Nonparametric multivariate ANOVA was used to analyze and compare the impact of different host factors on the oral flora MNS, and the model was optimized by controlling related factors. Finally, the effect of the model was evaluated by receiver operating characteristic (ROC) curve analysis. Results 1) The oral microbiota distribution obviously differed among people with various oral-health statuses, and the species richness and species diversity index decreased. 2) ROC curve was used to evaluate the caries data set, and the area under ROC curve was AUC=0.67. 3) Among the five hosts’ factors including caries status, country, age, decayed missing filled tooth (DMFT) indices, and sampling site displayed the strongest effect on MNS of samples (P=0.001). 4) The AUC of the model was 0.87, 0.74, 0.74, and 0.75 in high caries, medium caries, low caries samples in Chinese children, and mixed dental plaque samples after controlling host factors, respectively. Conclusion The model based on the analysis of 16S rRNA data of oral flora had good diagnostic efficiency.

Key words: oral microbiome, big-data, caries, high-throughput sequencing, microbiome

CLC Number: