An iVector Extractor Using Pre-trained Neural Networks for Speaker Verification
Shanshan, Zhang; Rong, Zheng; Bo, Xu
2014
会议名称International Symposium on Chinese Spoken Language Processing
会议录名称International Symposium on Chinese Spoken Language Processing
会议日期2014
会议地点Singapore
摘要
; The iVector representation of speech utterances is currently
widely used in speaker and language recognition tasks. In this
paper, an iVector extractor using pre-trained neural networks
is proposed for speaker verification. It can be viewed as
an alternative to the classical total variability approach. In
the proposed system, a neural network with bottleneck layer
is trained with speaker labeled utterances, then we utilize
the bottleneck features of the network to represent the input
utterance. As a new iVector representation, it shows comparable
performance with the state-of-the-art Total Variability Model
(TVM) based iVector extraction system on NIST 2008 SRE.
We further achieve a 10% reduction in equal error rates with
combination of the proposed extraction system and the TVM
system.
关键词Ivector Extractor Bottleneck Feature Speaker Verification
收录类别EI
语种英语
文献类型会议论文
条目标识符http://ir.ia.ac.cn/handle/173211/11806
专题数字内容技术与服务研究中心_听觉模型与认知计算
通讯作者Shanshan, Zhang
作者单位Interactive Digital Media Technology Research Center Institute of Automation, Chinese Academy of Sciences
推荐引用方式
GB/T 7714
Shanshan, Zhang,Rong, Zheng,Bo, Xu. An iVector Extractor Using Pre-trained Neural Networks for Speaker Verification[C],2014.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
301_Full_Paper.pdf(265KB)会议论文 开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Shanshan, Zhang]的文章
[Rong, Zheng]的文章
[Bo, Xu]的文章
百度学术
百度学术中相似的文章
[Shanshan, Zhang]的文章
[Rong, Zheng]的文章
[Bo, Xu]的文章
必应学术
必应学术中相似的文章
[Shanshan, Zhang]的文章
[Rong, Zheng]的文章
[Bo, Xu]的文章
相关权益政策
暂无数据
收藏/分享
文件名: 301_Full_Paper.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。