FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization
Zhengkun Tian1,2; Jiangyan Yi1; Ye Bai1,2; Jianhua Tao1,2,3; Shuai Zhang1,2; Zhengqi Wen1
2021-06
会议名称INTERSPEECH
会议日期30 August – 3 September
会议地点Brno, Czechia
摘要

Transducer-based models, such as RNN-Transducer and transformer-transducer, have achieved great success in speech recognition. A typical transducer model decodes the output sequence conditioned on the current acoustic state and previously predicted tokens step by step. Statistically, The number of blank tokens in the prediction results accounts for nearly 90% of all tokens. It takes a lot of computation and time to predict the blank tokens, but only the non-blank tokens will appear in the final output sequence. Therefore, we propose a method named fast-skip regularization, which tries to align the blank position predicted by a transducer with that predicted by a connectionist temporal classification (CTC) model. During the inference, the transducer model can predict the blank tokens in advance by a simple CTC project layer without many complicated forward calculations of the transducer decoder and then skip them, which will reduce the computation and improve the inference speed greatly. All experiments are conducted on a public Chinese mandarin dataset AISHELL-1. The results show that the fast-skip regularization can indeed help the transducer model learn the blank position alignments. Besides, the inference with fast-skip can be speeded up nearly 4 times with only a little performance degradation.

收录类别EI
语种英语
文献类型会议论文
条目标识符http://ir.ia.ac.cn/handle/173211/48612
专题多模态人工智能系统全国重点实验室_智能交互
通讯作者Jianhua Tao
作者单位1.NLPR, Institute of Automation, Chinese Academy of Sciences
2.School of Artificial Intelligence, University of Chinese Academy of Sciences
3.CAS Center for Excellence in Brain Science and Intelligence Technology
第一作者单位模式识别国家重点实验室
通讯作者单位模式识别国家重点实验室
推荐引用方式
GB/T 7714
Zhengkun Tian,Jiangyan Yi,Ye Bai,et al. FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization[C],2021.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
tian21_interspeech.p(839KB)会议论文 开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Zhengkun Tian]的文章
[Jiangyan Yi]的文章
[Ye Bai]的文章
百度学术
百度学术中相似的文章
[Zhengkun Tian]的文章
[Jiangyan Yi]的文章
[Ye Bai]的文章
必应学术
必应学术中相似的文章
[Zhengkun Tian]的文章
[Jiangyan Yi]的文章
[Ye Bai]的文章
相关权益政策
暂无数据
收藏/分享
文件名: tian21_interspeech.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。