计算机工程与应用 ›› 2012, Vol. 48 ›› Issue (5): 118-120.

• 数据库、信号与信息处理 • 上一篇    下一篇

多用途汉语方言语音数据库的设计

高 原1,顾明亮1,2,孙 平2,王 侠2,张长水3   

  1. 1.徐州师范大学 语言科学学院,江苏 徐州 221116
    2.徐州师范大学 物理与电子工程学院,江苏 徐州 221116
    3.清华大学 自动化系,北京 100084
  • 收稿日期:1900-01-01 修回日期:1900-01-01 出版日期:2012-02-11 发布日期:2012-02-11

Design of general-purpose Chinese dialect speech database

GAO Yuan1, GU Mingliang1,2, SUN Ping2, WANG Xia2, ZHANG Changshui3   

  1. 1.School of Linguistic Science, Xuzhou Normal University, Xuzhou, Jiangsu 221116, China
    2.School of Physics and Electronic Engineering, Xuzhou Normal University, Xuzhou, Jiangsu 221116, China
    3.Department of Automation, Tsinghua University, Beijing 100084, China
  • Received:1900-01-01 Revised:1900-01-01 Online:2012-02-11 Published:2012-02-11

摘要: 建立了一个多用途汉语方言语音数据库,用于说话人信息处理、方言特征词识别、语音识别等领域的研究。以多通道的方式采集时长106小时的语音数据,包括七种主要的汉语方言区语音,对数据进行预处理。在此基础上提出了汉语方言数据库的设计标准以及实施方案,有助于推动汉语语音库特别是方言语音库的建立。

关键词: 汉语方言数据库, 说话人信息处理, 方言特征词识别

Abstract: This paper describes a general-purpose Chinese dialect speech database, which can be applied to speaker information analysis, character-words recognition, speech recognition etc. The speech database, which includes seven kinds of most common Chinese dialects, has reached one hundred and six hours by multi-channel record modes and has already preprocessed. Based on the work, the design criteria and implementation scheme of Chinese dialects speech database are proposed, which is useful for the establishment of Chinese speech database, especially Chinese dialect speech database.

Key words: Chinese dialect speech database, speaker information analysis, character-words recognition