CASIA OpenIR  > 09年以前成果
Statistic model based dynamic channel compensation for telephony speech recognition
Zhang, HY; Han, ZB; Xu, B
2004-10-01
发表期刊CHINESE JOURNAL OF ELECTRONICS
卷号13期号:4页码:665-670
文章类型Article
摘要The degradation of speech recognition performance in real-life environments and through transmission channels is a main embarrassment for many speech-based applications around the world, especially when non-stationary noise and changing channel exist. Previous works have shown that the main reason for this performance degradation is the variational mismatch caused by different telephone channels between the testing and training sets. In this paper, we propose a statistic model based implementation to dynamically compensate this mismatch. Firstly, we focus on a Maximum-likelihood (ML) estimation algorithm for telephone channels. In experiments on Mandarin Large vocabulary continuous speech recognition (LVCSR) over telephone lines, the Character error rate (CER) decreases more than 20%. The average delay is about 300similar to400ms. Secondly, we will extend it by introducing a phone-conditioned prior statistic model for the channels and applying Maximum a posteriori (MAP) estimation technique. Compared to the ML based method, the MAP based algorithm follows with the variations within channels more effectively. Average delay of the algorithm is decreased to 200ms. An additional 7similar to8% CER relative reduction is observed in LVCSR.
关键词Automatic Speech Recognition (Asr) Telephone Channel Compensation Statistic Model Maximum Likelihood Estimation Maximum a Posteriori Estimation
WOS标题词Science & Technology ; Technology
收录类别SCI
语种英语
WOS研究方向Engineering
WOS类目Engineering, Electrical & Electronic
WOS记录号WOS:000224787200024
引用统计
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/8927
专题09年以前成果
作者单位1.Chinese Acad Sci, Inst Automat, Hightech Innovat Ctr, Beijing 100080, Peoples R China
2.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100080, Peoples R China
推荐引用方式
GB/T 7714
Zhang, HY,Han, ZB,Xu, B. Statistic model based dynamic channel compensation for telephony speech recognition[J]. CHINESE JOURNAL OF ELECTRONICS,2004,13(4):665-670.
APA Zhang, HY,Han, ZB,&Xu, B.(2004).Statistic model based dynamic channel compensation for telephony speech recognition.CHINESE JOURNAL OF ELECTRONICS,13(4),665-670.
MLA Zhang, HY,et al."Statistic model based dynamic channel compensation for telephony speech recognition".CHINESE JOURNAL OF ELECTRONICS 13.4(2004):665-670.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Zhang, HY]的文章
[Han, ZB]的文章
[Xu, B]的文章
百度学术
百度学术中相似的文章
[Zhang, HY]的文章
[Han, ZB]的文章
[Xu, B]的文章
必应学术
必应学术中相似的文章
[Zhang, HY]的文章
[Han, ZB]的文章
[Xu, B]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。