CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection 会议论文
, Singapore, Singapore, 2022.05
作者:  Minglun Han;  Linhao Dong;  Zhenlin Liang;  Meng Cai;  Shiyu Zhou;  Zejun Ma;  Bo Xu
Adobe PDF(463Kb)  |  收藏  |  浏览/下载:161/45  |  提交时间:2023/05/29
Automatic Speech Recognition  Context Biasing  Speech Recognition Customization  Continuous Integrate-and-Fire Mechanism  
TWO-STAGE PRE-TRAINING FOR SEQUENCE TO SEQUENCE SPEECH RECOGNITION 会议论文
, 线上会议, 2021-7-18
作者:  Fan ZY(范志赟);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(230Kb)  |  收藏  |  浏览/下载:161/43  |  提交时间:2022/09/17
pre-training  speech recognition  encoder-decoder  sequence-to-sequence  
Exploring wav2vec 2.0 on speaker verification and language identification 会议论文
, 线上会议, 2021-8-30
作者:  Fan ZY(范志赟);  Li M(李蒙);  Zhou SY(周世玉);  Xu B(徐波)
Adobe PDF(2081Kb)  |  收藏  |  浏览/下载:164/31  |  提交时间:2022/09/17
self-supervised  speaker verification  language identification  multi-task learning  wav2vec 2.0  
CIF-Based Collaborative Decoding for End-to-End Contextual Speech Recognition 会议论文
, Toronto, Canada, 2021-06-06
作者:  Minglun Han;  Linhao Dong;  Shiyu Zhou;  Bo Xu
Adobe PDF(469Kb)  |  收藏  |  浏览/下载:123/37  |  提交时间:2023/05/29
Contextual Speech Recognition  Automatic Speech Recognition  Context Biasing  
Extending Recurrent Neural Aligner for Streaming End-to-End Speech Recognition in Mandarin 会议论文
, Hyderabad, India, 2018-09
作者:  Dong, Linhao;  Zhou, Shiyu;  Chen, Wei;  Xu, Bo
浏览  |  Adobe PDF(321Kb)  |  收藏  |  浏览/下载:239/69  |  提交时间:2020/06/13
speech recognition  recurrent neural aligner  mandarin  end-to-end  
Syllable-Based Sequence-to-Sequence Speech Recognition with the Transformer in Mandarin Chinese 会议论文
Interspeech, 印度的海德拉巴, 2018
作者:  Shiyu Zhou;  Linhao Dong;  Shuang Xu;  Bo Xu
收藏  |  浏览/下载:84/0  |  提交时间:2020/10/27
Asr  Multi-head Attention  Syllable Based Acoustic Modeling  Sequence-to-sequence  
A Comparison of Modeling Units in Sequence-to-Sequence Speech Recognition with the Transformer on Mandarin Chinese 会议论文
ICONIP, Siem Reap, Cambodia, 2018
作者:  Shiyu Zhou;  Linhao Dong;  Shuang Xu;  Bo Xu
收藏  |  浏览/下载:75/0  |  提交时间:2020/10/27
Asr  Multi-head Attention  Modeling Units  Sequence-to-sequence  Transformer  
Word-level Permutation and Improved Lower Frame Rate for RNN-Based Acoustic Modeling 会议论文
iconip2017, Guangzhou, China, November 14-18, 2017
作者:  Yuanyuan Zhao;  Shiyu Zhou;  Shuang Xu;  Bo Xu
收藏  |  浏览/下载:73/0  |  提交时间:2020/10/27
Rnn-based Acoustic Model  Acoustic Trajectory  Lower Frame Rate  Word-level Permutation  
Multilingual Recurrent Neural Networks with Residual Learning for Low-Resource Speech Recognition 会议论文
Interspeech, Stockholm, 2017
作者:  Shiyu Zhou;  Yuanyuan Zhao;  Shuang Xu;  Bo Xu
收藏  |  浏览/下载:56/0  |  提交时间:2020/10/27
Lstm  Multilingual Speech Recognition  Low-resource  Residual Learning  Shared-hidden-layer