Knowledge Commons of Institute of Automation,CAS
AN INVESTIGATION OF SUMMED-CHANNEL SPEAKER RECOGNITION WITH MULTI-SESSION ENROLLMENT | |
Shanshan, Zhang; Ce, Zhang; Rong, Zheng; Xu, Bo; Shanshan,Zhang | |
2014 | |
会议名称 | 2014 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP) |
会议录名称 | IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) |
会议日期 | 2014 |
会议地点 | Florence |
摘要 | ;This paper describes a general framework of speaker recognition on summed-channel condition for both enrolling and test data. We present several methods for clustering the target speaker who is involved in multiple summed-channel enrolling excerpts. In our approach, each excerpt is segmented separately by a speaker diarization system as the first stage. Then segments belonging to the same speaker are clustered to train the target speaker model, and speaker verification is applied finally. We propose several effective objective functions to measure the purity of clustered segments in multi-session enrollment. Different confidence measures for summed-channel scoring are also presented. We report experimental results on female part in the NIST 2008 speaker recognition evaluation data, which show that our approach applied on summedchannel condition loses only 1% of the performance measured by equal error rates (EER) compared to the two-channel condition. |
关键词 | Speaker Recognition Summed-channel Speaker Clustering Multi-session |
收录类别 | EI |
语种 | 英语 |
文献类型 | 会议论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/41217 |
专题 | 复杂系统认知与决策实验室_听觉模型与认知计算 |
通讯作者 | Shanshan,Zhang |
推荐引用方式 GB/T 7714 | Shanshan, Zhang,Ce, Zhang,Rong, Zheng,et al. AN INVESTIGATION OF SUMMED-CHANNEL SPEAKER RECOGNITION WITH MULTI-SESSION ENROLLMENT[C],2014. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论