Knowledge Commons of Institute of Automation,CAS
Monaural voiced speech segregation based on elaborate harmonic grouping strategies | |
Liu WenJu; Zhang XueLiang; Jiang Wei; Li Peng; Xu Bo | |
发表期刊 | SCIENCE CHINA-INFORMATION SCIENCES |
2011-12-01 | |
卷号 | 54期号:12页码:2471-2480 |
文章类型 | Article |
摘要 | In this paper, an enhanced algorithm based on several elaborate harmonic grouping strategies for monaural voiced speech segregation is proposed. Main achievements of the proposed algorithm lie in three aspects. Firstly, the algorithm classifies the time-frequency (T-F) units into resolved and unresolved ones by carrier-to-envelope energy ratio, which leads to more accurate classification results than by cross-channel correlation. Secondly, resolved T-F units are grouped together according to minimum amplitude principle, which has been verified to exist in human perception, as well as the harmonic principle. Finally, "enhanced" envelope autocorrelation function is employed to detect amplitude modulation rates, which helps a lot in reducing half-frequency error in grouping of unresolved units. Systematic evaluation and comparison show that performance of separation is greatly improved by the proposed algorithm. Specifically, signal-to-noise ratio (SNR) is improved by 0.96 dB compared with that of previous method. Besides, our algorithm is also effective in improving the PESQ score and subjective perception score. |
关键词 | Computational Auditory Scene Analysis Voiced Speech Separation Harmonistic Principle Minimum Amplitude Principle Elaborate Harmonic Grouping Strategies |
WOS标题词 | Science & Technology ; Technology |
关键词[WOS] | BLIND SEPARATION ; MODULATION |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Computer Science |
WOS类目 | Computer Science, Information Systems |
WOS记录号 | WOS:000297709400003 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/40922 |
专题 | 复杂系统认知与决策实验室_听觉模型与认知计算 |
推荐引用方式 GB/T 7714 | Liu WenJu,Zhang XueLiang,Jiang Wei,et al. Monaural voiced speech segregation based on elaborate harmonic grouping strategies[J]. SCIENCE CHINA-INFORMATION SCIENCES,2011,54(12):2471-2480. |
APA | Liu WenJu,Zhang XueLiang,Jiang Wei,Li Peng,&Xu Bo.(2011).Monaural voiced speech segregation based on elaborate harmonic grouping strategies.SCIENCE CHINA-INFORMATION SCIENCES,54(12),2471-2480. |
MLA | Liu WenJu,et al."Monaural voiced speech segregation based on elaborate harmonic grouping strategies".SCIENCE CHINA-INFORMATION SCIENCES 54.12(2011):2471-2480. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论