Knowledge Commons of Institute of Automation,CAS
Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech | |
Li, Peng; Guan, Yong; Xu, Bo; Liu, Wenju | |
发表期刊 | IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING |
2006-11-01 | |
卷号 | 14期号:6页码:2014-2023 |
文章类型 | Article |
摘要 | Monaural speech separation is a very challenging problem in speech signal processing. It has been studied extensively, and many separation systems based on computational auditory scene analysis (CASA) have been proposed in the last two decades. Although the research on CASA has tended to introduce high-level knowledge into separation processes using primitive data-driven methods, the knowledge on speech quality still has not been combined with it. This makes the performance evaluation of CASA mainly focused on the signal-to-noise ratio (SNR) improvement. Actually, the quality of the separated speech is not directly related to its SNR. In order to solve this problem, we propose a new method which combines CASA with objective quality assessment of speech (OQAS). In the grouping process of CASA, we use OQAS as the guide to instruct the CASA system. With this combination, the performance of the speech separation can be improved not only in SNR, but also in mean opinion score (MOS). Our system is systematically evaluated and compared with previous systems, and it yields substantially better performance, especially for the subjective perceptual quality of separated speech. |
关键词 | Computational Auditory Scene Analysis (Casa) Grouping Monaural Speech Separation Objective Quality Assessment Of Speech (Oqas) Segmentation |
WOS标题词 | Science & Technology ; Technology |
关键词[WOS] | PITCH ; NOISE ; MODEL |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Acoustics ; Engineering |
WOS类目 | Acoustics ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000241567200014 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/9330 |
专题 | 09年以前成果 |
作者单位 | 1.Chinese Acad Sci, Hightech Innovat Ctr, Inst Automat, Beijing 100080, Peoples R China 2.Chinese Acad Sci, Natl Lab Pattern Recognit, Inst Automat, Beijing 100080, Peoples R China |
推荐引用方式 GB/T 7714 | Li, Peng,Guan, Yong,Xu, Bo,et al. Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech[J]. IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,2006,14(6):2014-2023. |
APA | Li, Peng,Guan, Yong,Xu, Bo,&Liu, Wenju.(2006).Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech.IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING,14(6),2014-2023. |
MLA | Li, Peng,et al."Monaural speech separation based on computational auditory scene analysis and objective quality assessment of speech".IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING 14.6(2006):2014-2023. |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
李鹏-2006-IEEE Transac(673KB) | 期刊论文 | 作者接受稿 | 开放获取 | CC BY-NC-SA | 浏览 下载 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论