Exploiting the directional coherence function for multichannel source extraction
Liang, Shan1; Li, Guanjun1,2; Nie, Shuai1; Yang, ZhanLei3; Liu, WenJu1; Tao, Jianhua1
发表期刊SPEECH COMMUNICATION
ISSN0167-6393
2021-04-01
卷号128页码:1-14
通讯作者Liu, WenJu(lwj@nlpr.ia.ac.cn)
摘要The desired speech detector plays an important role for controlling the speech distortion in spatial filtering based speech enhancement algorithms. However, the conventional complex coherence(CC) based algorithms can only distinguish the coherent speech and diffuse noise. To improve the performance on the scenarios that both the coherent interference and diffuse noise are present, we propose a directional coherence function(DCF) based detector in this paper. Based on a pair of complementary filters which can suppress the diffuse noise and the coherent interference respectively, the DCF is computed as the normalized correlation between the filters? outputs. Meanwhile, the filters are solved by convex programming method and satisfy the constraints on speech distortionless and white noise gain(WNG). Consequently, the value of DCF will be close to 1 only for the desired speech dominated time-frequency(T-F) bins and much smaller than 1 for the noise or interference dominated T-F bins. To extract the desired speech, the DCF based Desired Speech Presence Probability(DSPP) is used to control the adaptation in general sidelobe canceler(GSC), and subsequently used as the post-filtering weight. Systematical experiments on several scenarios show that the proposed algorithm achieves significantly and consistently better noise suppression performance than the narrowband direction-of-arrival(DOA) estimates based algorithms.
关键词Directional coherence function Coherent-to-Diffuse Ratio General sidelobe canceller Desired Speech Presence Probability
DOI10.1016/j.specom.2021.01.002
收录类别SCI
语种英语
资助项目National Key RD Plan of China[2016YFB1001404] ; China National Nature Science Foundation[61971419] ; China National Nature Science Foundation[61573357] ; China National Nature Science Foundation[61503382] ; China National Nature Science Foundation[61403370] ; China National Nature Science Foundation[61273267] ; China National Nature Science Foundation[91120303]
项目资助者National Key RD Plan of China ; China National Nature Science Foundation
WOS研究方向Acoustics ; Computer Science
WOS类目Acoustics ; Computer Science, Interdisciplinary Applications
WOS记录号WOS:000631974400001
出版者ELSEVIER
七大方向——子方向分类语音识别与合成
引用统计
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/44138
专题多模态人工智能系统全国重点实验室_智能交互
通讯作者Liu, WenJu
作者单位1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit NLPR, Beijing, Peoples R China
2.Univ Chinese Acad Sci, Sch Artificial Intelligence, Beijing, Peoples R China
3.Huawei Technol, Shanghai, Peoples R China
第一作者单位模式识别国家重点实验室
通讯作者单位模式识别国家重点实验室
推荐引用方式
GB/T 7714
Liang, Shan,Li, Guanjun,Nie, Shuai,et al. Exploiting the directional coherence function for multichannel source extraction[J]. SPEECH COMMUNICATION,2021,128:1-14.
APA Liang, Shan,Li, Guanjun,Nie, Shuai,Yang, ZhanLei,Liu, WenJu,&Tao, Jianhua.(2021).Exploiting the directional coherence function for multichannel source extraction.SPEECH COMMUNICATION,128,1-14.
MLA Liang, Shan,et al."Exploiting the directional coherence function for multichannel source extraction".SPEECH COMMUNICATION 128(2021):1-14.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Liang, Shan]的文章
[Li, Guanjun]的文章
[Nie, Shuai]的文章
百度学术
百度学术中相似的文章
[Liang, Shan]的文章
[Li, Guanjun]的文章
[Nie, Shuai]的文章
必应学术
必应学术中相似的文章
[Liang, Shan]的文章
[Li, Guanjun]的文章
[Nie, Shuai]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。