CASIA OpenIR

浏览/检索结果: 共86条,第1-10条 帮助

  只显示已认领条目
已选(0)清除 条数/页:   排序方式:
VLP2MSA: Expanding vision-language pre-training to multimodal sentiment analysis 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 9
作者:  Yi, Guofeng;  Fan, Cunhang;  Zhu, Kang;  Lv, Zhao;  Liang, Shan;  Wen, Zhengqi;  Pei, Guanxiong;  Li, Taihao;  Tao, Jianhua
收藏  |  浏览/下载:53/0  |  提交时间:2024/02/22
Multimodal sentiment analysis  Vision-language  Multimodal fusion  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:196/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech  
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2022, 页码: 762-766
作者:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:260/76  |  提交时间:2022/06/14
FSR: Accelerating the Inference Process of Transducer-Based Models by Applying Fast-Skip Regularization 会议论文
, Brno, Czechia, 30 August – 3 September
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(839Kb)  |  收藏  |  浏览/下载:181/41  |  提交时间:2022/06/14
Self-Attention Transducers for End-to-End Speech Recognition 会议论文
, Graz, Austria, September 15–19, 2019
作者:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Ye Bai;  Zhengqi Wen
Adobe PDF(278Kb)  |  收藏  |  浏览/下载:90/36  |  提交时间:2022/06/14
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Ye Bai;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(629Kb)  |  收藏  |  浏览/下载:146/46  |  提交时间:2022/06/14
One In A Hundred: Selecting the Best Predicted Sequence from Numerous Candidates for Speech Recognition 会议论文
, Tokyo, Japan, 14-17 December 2021
作者:  Zhengkun Tian;  Jiangyan Yi;  Ye Bai;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(563Kb)  |  收藏  |  浏览/下载:165/39  |  提交时间:2022/06/14
NeuralDPS: Neural Deterministic Plus Stochastic Model With Multiband Excitation for Noise-Controllable Waveform Generation 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 865-878
作者:  Wang, Tao;  Fu, Ruibo;  Yi, Jiangyan;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:237/0  |  提交时间:2022/06/06
Vocoders  Stochastic processes  Neural networks  Speech processing  Signal to noise ratio  Acoustics  Speech enhancement  Vocoder  speech synthesis  deterministic plus stochastic  multiband excitation  noise control  
Learn Spelling from Teachers: Transferring Knowledge from Language Models to Sequence-to-Sequence Speech Recognition 会议论文
, Graz, 2019
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen
Adobe PDF(779Kb)  |  收藏  |  浏览/下载:57/8  |  提交时间:2021/06/25
基于随机时频掩蔽的 DNN-HMM 声学模型数据扩增 会议论文
, 青海西宁, 2019
作者:  白烨;  易江燕;  陶建华;  温正棋
Adobe PDF(444Kb)  |  收藏  |  浏览/下载:150/26  |  提交时间:2021/06/25