Institutional Repository of Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization | |
Zhengkun Tian1,2![]() ![]() ![]() ![]() ![]() ![]() | |
2021-06 | |
Conference Name | INTERSPEECH |
Conference Date | 30 August – 3 September |
Conference Place | Brno, Czechia |
Abstract | Transducer-based models, such as RNN-Transducer and transformer-transducer, have achieved great success in speech recognition. A typical transducer model decodes the output sequence conditioned on the current acoustic state and previously predicted tokens step by step. Statistically, The number of blank tokens in the prediction results accounts for nearly 90% of all tokens. It takes a lot of computation and time to predict the blank tokens, but only the non-blank tokens will appear in the final output sequence. Therefore, we propose a method named fast-skip regularization, which tries to align the blank position predicted by a transducer with that predicted by a connectionist temporal classification (CTC) model. During the inference, the transducer model can predict the blank tokens in advance by a simple CTC project layer without many complicated forward calculations of the transducer decoder and then skip them, which will reduce the computation and improve the inference speed greatly. All experiments are conducted on a public Chinese mandarin dataset AISHELL-1. The results show that the fast-skip regularization can indeed help the transducer model learn the blank position alignments. Besides, the inference with fast-skip can be speeded up nearly 4 times with only a little performance degradation. |
Indexed By | EI |
Language | 英语 |
Document Type | 会议论文 |
Identifier | http://ir.ia.ac.cn/handle/173211/48612 |
Collection | 模式识别国家重点实验室_智能交互 |
Corresponding Author | Jianhua Tao |
Affiliation | 1.NLPR, Institute of Automation, Chinese Academy of Sciences 2.School of Artificial Intelligence, University of Chinese Academy of Sciences 3.CAS Center for Excellence in Brain Science and Intelligence Technology |
First Author Affilication | Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China |
Corresponding Author Affilication | Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China |
Recommended Citation GB/T 7714 | Zhengkun Tian,Jiangyan Yi,Ye Bai,et al. FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization[C],2021. |
Files in This Item: | Download All | |||||
File Name/Size | DocType | Version | Access | License | ||
tian21_interspeech.p(839KB) | 会议论文 | 开放获取 | CC BY-NC-SA | View Download |
Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.
Edit Comment