CASIA OpenIR  > 数字内容技术与服务研究中心  > 听觉模型与认知计算
An End-to-End Text-Independent Speaker Identification System on Short Utterances
Ji RF(吉瑞芳); Cai XY(蔡新元); Xu B(徐波)
2018-09
Conference NameInterspeech 2018
Conference Date2018-9-2——2018-9-6
Conference Place印度,海得拉巴
Abstract

In the field of speaker recognition, text-independent speaker identification on short utterances is still a challenging task, since it is rather tough to extract a robust and dicriminative speaker feature in short duration condition. This paper explores an end-to-end speaker identification system, which maps utterances to a speaker identity subspace where the similarity of speakers can be measured by Euclidean distance. To be specific, we apply GRU architectures to extract utterance-level feature. Then it is assumed that one’s various utterances can be viewed as transformations of a single object in an ideal speaker identity subspace. Based on this assumption, the ResCNN architecture is utilized to model the transformation, and the whole system is jointly optimized by speaker identity subspace loss. Experimental results demonstrate the effectiveness of our proposed system and superiority over pervious methods. For example, the GRU learned feature reduces the equal error rate by 27.53% relatively and the speaker identity subspace loss further brings 7.22% relative reduction compared to softmax loss.

Document Type会议论文
Identifierhttp://ir.ia.ac.cn/handle/173211/23545
Collection数字内容技术与服务研究中心_听觉模型与认知计算
Affiliation中国科学院大学,中科院自动化研究所
Recommended Citation
GB/T 7714
Ji RF,Cai XY,Xu B. An End-to-End Text-Independent Speaker Identification System on Short Utterances[C],2018.
Files in This Item: Download All
File Name/Size DocType Version Access License
吉瑞芳_[2018 Interspeec(605KB)会议论文 开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Ji RF(吉瑞芳)]'s Articles
[Cai XY(蔡新元)]'s Articles
[Xu B(徐波)]'s Articles
Baidu academic
Similar articles in Baidu academic
[Ji RF(吉瑞芳)]'s Articles
[Cai XY(蔡新元)]'s Articles
[Xu B(徐波)]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Ji RF(吉瑞芳)]'s Articles
[Cai XY(蔡新元)]'s Articles
[Xu B(徐波)]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: 吉瑞芳_[2018 Interspeech]An End-to-End Text-Independent Speaker Identification System on Short Utterances.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.