CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共25条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
A Time Delay Neural Network with Shared Weight Self-Attention for Small-Footprint Keyword Spotting 会议论文
, graz, 2019
作者:  Ye Bai;  Jiangyan Yi;  Zhengqi Wen;  Zhengkun Tian;  Chenghao Zhao;  Cunhang Fan
Adobe PDF(290Kb)  |  收藏  |  浏览/下载:131/46  |  提交时间:2021/06/25
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(1163Kb)  |  收藏  |  浏览/下载:175/49  |  提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别  
Deep Time Delay Neural Network for Speech Enhancement with Full Data Learning 会议论文
, Hong Kong, 24-27 Jan. 2021
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi;  Song, Leichao
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:198/47  |  提交时间:2021/06/01
Utterance-level Permutation Invariant Training with Discriminative Learning for Single Channel Speech Separation 会议论文
, Taipei, Taiwan, 26-29 Nov. 2018
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Wen, Zhengqi;  Yi, Jiangyan;  Bai, Ye
Adobe PDF(208Kb)  |  收藏  |  浏览/下载:104/40  |  提交时间:2021/06/01
Joint Training for Simultaneous Speech Denoising and Dereverberation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(158Kb)  |  收藏  |  浏览/下载:175/52  |  提交时间:2021/06/01
Gated Recurrent Fusion of Spatial and Spectral Features for Multi-channel Speech Separation with Deep Embedding Representations 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(260Kb)  |  收藏  |  浏览/下载:161/44  |  提交时间:2021/06/01
Discriminative Learning for Monaural Speech Separation Using Deep Embedding Features 会议论文
, Graz, Austria, September 15–19, 2019
作者:  Fan, Cunhang;  Liu, Bin;  Tao, Jianhua;  Yi, Jiangyan;  Wen, Zhengqi
Adobe PDF(320Kb)  |  收藏  |  浏览/下载:114/39  |  提交时间:2021/06/01
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:345/47  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
Improving BLSTM RNN Based Mandarin Speech Recognition Using Accent Dependent Bottleneck Features 会议论文
, Jeju, Korea, December 13-16, 2016
作者:  Yi, Jiangyan;  Ni, Hao;  Wen, Zhengqi;  Tao, Jianhua
收藏  |  浏览/下载:34/0  |  提交时间:2020/10/27
The Parameterized Phoneme Identity Feature as a Continuous Real-Valued Vector for Neural Network based Speech Synthesis 会议论文
INTERSPEECH, San Francisco,USA, Sep 8-12, 2016
作者:  Wen ZQ(温正棋);  Li Y(李雅);  Tao JH(陶建华);  Wen, Zhengqi
收藏  |  浏览/下载:68/0  |  提交时间:2020/10/27
Phoneme Embedded Vector  Word Embedding  Speech Synthesis  Blstm-rnn