嵌入式语音识别系统中拒识算法的研究及实现

CASIA OpenIR > 毕业生 > 硕士学位论文

	嵌入式语音识别系统中拒识算法的研究及实现
其他题名	The Research And Realization of Rejection Algorithm in Embedded Speech Recognition Systems
	方敏
	2004-06-01
学位类型	工学硕士
中文摘要	语音识别技术经过多年发展正日趋成熟，各种语音识别系统也不断被开发出来。走向产品化是语音识别技术的终极目标，为此需要解决抗噪声、环境适应性、集外词检测与拒识等关键问题，以增强系统的鲁棒性和可靠性。其中，由于现在大多数识别器存在词表限制，高效拒识功能的加入对提高系统性能尤为重要。本文首先回顾了拒识算法研究所依据的统计语音识别理论与方法，并给出两种主流的汉语孤立词语音识别器：进而对语音识别拒识问题的本质及各种解决方法进行深入剖析，包括主流的显式建模方法和隐式建模（即可信度度量）方法：由于拒识算法目前大多受识别平台限制、与任务高度相关，因此本文针对两种不同的孤立词识别器，分别研究对应的平台相关拒识算法。在上述研究的基础上，本文进行平台无关拒识算法方面的一些探索，提出基于音节段模型识别的拒识方法。语音识别技术在嵌入式平台的应用是语音识别实用化、产品化的主要方向。嵌入式语音识别系统（ESRS）的研究开发涉及软件和硬件、资源和性能等多方面的协调，还涉及嵌入式环境下的特定算法，因而是一项富有挑战性的课题。本文对此进行系统分析和研究，并结合具体实例详细阐述嵌入式语音识别系统的设计原则、软硬件开发相关问题等。最后，对嵌入式语音识别系统的未来工作进行了展望。
英文摘要	After many years' development, Automatic Speech Recognition (ASR) technology has been mature; and many ASR systems have been developed. However, there are still many challenges confronting researchers, such as noise reducing, environment adaptation and Out-Of-Vocabulary (OOV) words detection and rejection etc. It is important to solve these problems to enhance the robustness and reliability of ASR systems. In this paper we briefly look back on stochastic speech recognition fundamentals and the framework of ASR systems. Then we focus on the rejection problem in ASR. Our discussion of rejection problem ranges from its definition, essence and evaluation to the popular solutions nowadays. We note that researchers Usually solve it in two ways, namely explicit modeling and implicit modeling (mainly by confidence measure). Due to the constraints of specific ASR tasks and specific recognizer to rejection, we propose several platform-dependent rejection approaches in two ASR platforms which models whole word and mandarin tri-phone respectively. In addition, we try some platform-independent rejection algorithms based on Syllable Segmental Models (SSM). The application of ASR technology in embedded systems is an important direction for its popularization. The development of Embedded Speech Recognition Systems (ESRS) involves in many factors such as hardware and software, resources and performances etc. In this paper, we analyze and expound the main problems of ESRS in details and give several examples we made during postgraduate stage. Finally we discuss the promising prospect of ASR SOC.
关键词	语音识别统计拒识嵌入式系统 Speech Recognition Stochastic Methods Rejection Embedded Systems
语种	中文
文献类型	学位论文
条目标识符	http://ir.ia.ac.cn/handle/173211/6770
专题	毕业生_硕士学位论文
推荐引用方式 GB/T 7714	方敏. 嵌入式语音识别系统中拒识算法的研究及实现[D]. 中国科学院自动化研究所. 中国科学院研究生院,2004.