华西口腔医学杂志 ›› 2020, Vol. 38 ›› Issue (2): 149-154.doi: 10.7518/hxkq.2020.02.007

• 临床研究 • 上一篇    下一篇

汉语普通话腭裂语音数据库的搭建与应用

马平川1, 毛渤淳1, 郭春丽1, 于晨浩1, 李若琳1, 何凌2, 尹恒1()   

  1. 1. 口腔疾病研究国家重点实验室 国家口腔疾病临床医学研究中心 四川大学华西口腔医院唇腭裂外科,成都 610041
    2. 四川大学电气信息学院,成都 610065
  • 收稿日期:2019-01-09 修回日期:2019-12-27 出版日期:2020-04-01 发布日期:2020-04-15
  • 通讯作者: 尹恒 E-mail:yinheng@scu.edu.cn
  • 作者简介:马平川,学士,E-mail:mapingchuan1997@126.com
  • 基金资助:
    国家自然科学基金青年基金(61503264)

Establishment and application of mandarin cleft palate speech database

Ma Pingchuan1, Mao Bochun1, Guo Chunli1, Yu Chenhao1, Li Ruoling1, He Ling2, Yin Heng1()   

  1. 1. State Key Laboratory of Oral Diseases & National Clinical Research Center for Oral Diseases & Dept. of Cleft Lip and Palate Surgery, West China Hospital of Stomatology, Sichuan University, Chengdu 610041, China
    2. School of Electrical Engineering and Information, Sichuan University, Chengdu 610065, China
  • Received:2019-01-09 Revised:2019-12-27 Online:2020-04-01 Published:2020-04-15
  • Contact: Heng Yin E-mail:yinheng@scu.edu.cn
  • Supported by:
    Youth Program of National Natural Science Foundation of China(61503264)

摘要:

目的 收集腭裂患者语音样本,统一进行归类标注、分级、整理后建立汉语普通话腭裂语音数据库,对于高鼻音乃至腭裂语音的诊断、临床教学、腭裂专业语音师的规范培训以及腭裂语音相关的科研工作建立基础。方法 收集2016年5月—2018年3月四川大学华西口腔医院语音治疗中心的患者与志愿者共768人,按照汉语普通话标准评估材料,收集语音样本,对所有的语音样本进行切分和按照高鼻音等级分类,整理后放入四川大学华西口腔医院唇腭裂生物信息数据库平台。结果 数据库纳入被采集者共768人,其中儿童456人(男227,女229),成人312人(其中男178,女134),正常共鸣369人,轻度高鼻音155人,中度高鼻音102人,重度高鼻音142人。包括64 512个词语、24 576个音素、7 680个数字,完成汉语普通话语音高鼻音数据库的搭建。结论 本研究首次建立了针对汉语普通话腭裂高鼻音数据库,已经为多项语音信号研究提供了源数据,并投入临床教学,未来对于汉语普通话腭裂患者的诊断教学以及相关科研活动有着重要的帮助和指导作用。

关键词: 数据库, 腭裂语音, 高鼻音, 普通话

Abstract:

Objective This research aims to collect speech samples from patients with cleft palate, establish a mandarin-based database of cleft palate speech after sample analysis and classification, and provide a reference for the diagnosis of hypernasal or cleft palate, clinical education, and standard training for professional speech therapists and related research. Methods A total of 768 speech samples were collected from patients and volunteers from the Speech Therapy Center, West China Hospital of Stomatology, between May 2016 and March 2018. These samples were edited and categoried before being saved into the cleft lip and palate biologic information database. Results A mandarin-based database of cleft palate speech was established from 768 subjects, including 456 children (male 227, female 229), 312 adults (male 178, female 134), 369 normal speech voices, 155 low-level hypernasal samples, 102 moderate-level hypernasal samples, 142 high-level hypernasal samples, and 64 512 words, 24 576 phonemes, and 7 680 numbers. Conclusion This study first established a mandarin-based database of cleft palate speech, which has enormous value for the education of speech pathology of cleft palate in mandarin and further research.

Key words: database, cleft palate speech, hypernasal, mandarin

中图分类号: