CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共24条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Hybrid Autoregressive and Non-Autoregressive Transformer Models for Speech Recognition 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2022, 页码: 762-766
作者:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(934Kb)  |  收藏  |  浏览/下载:259/76  |  提交时间:2022/06/14
Deep Learning for Mobile Mental Health: Challenges and recent advances 期刊论文
IEEE SIGNAL PROCESSING MAGAZINE, 2021, 卷号: 38, 期号: 6, 页码: 96-105
作者:  Han, Jing;  Zhang, Zixing;  Mascolo, Cecilia;  Andre, Elisabeth;  Tao, Jianhua;  Zhao, Ziping;  Schuller, Bjoern W.
收藏  |  浏览/下载:118/0  |  提交时间:2021/12/28
Continual Learning for Fake Audio Detection 会议论文
, 线上(捷克), 2021-9
作者:  Ma Haoxin;  Yi Jiangyan;  Tao Jianhua;  Bai Ye;  Tian Zhengkun;  Wang Chenglong
Adobe PDF(2113Kb)  |  收藏  |  浏览/下载:224/58  |  提交时间:2022/06/20
fake audio detection  continual learning  detecting fake without forgetting  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:164/44  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(1163Kb)  |  收藏  |  浏览/下载:177/49  |  提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别  
Spike-Triggered Non-Autoregressive Transformer for End-to-End Speech Recognition 会议论文
, Shanghai, China, October 25–29, 2020
作者:  Zhengkun Tian;  Jiangyan Yi;  Jianhua Tao;  Ye Bai;  Shuai Zhang;  Zhengqi Wen
Adobe PDF(629Kb)  |  收藏  |  浏览/下载:146/46  |  提交时间:2022/06/14
Conversational Emotion Recognition Using Self-Attention Mechanisms and Graph Neural Networks 会议论文
, Shanghai, China, 25-29 October, 2020
作者:  Zheng Lian;  Jianhua Tao;  Bin Liu;  Jian Huang;  Zhanlei Yang;  Rongjun Li
Adobe PDF(509Kb)  |  收藏  |  浏览/下载:119/39  |  提交时间:2021/06/16
Multi-modal Continuous Dimensional Emotion Recognition Using Recurrent Neural Network and Self-Atention Mechanism 会议论文
, Seattle, United States, 12-16 October, 2020
作者:  Licai Sun;  Zheng Lian;  Jianhua Tao;  Bin Liu;  Mingyue Niu
Adobe PDF(1041Kb)  |  收藏  |  浏览/下载:154/49  |  提交时间:2021/06/16
Multimodal Spatiotemporal Representation for Automatic Depression Level Detection 期刊论文
IEEE Transactions on Affective Computing, 2020, 期号: 0, 页码: 0
作者:  Mingyue Niu;  Jianhua Tao;  Bin Liu;  Jian Huang;  Zheng Lian
Adobe PDF(2831Kb)  |  收藏  |  浏览/下载:170/49  |  提交时间:2021/06/01
Multimodal depression detection  Spatio-Temporal Attention  Audio/Video Segment-Level Feature  Eigen Evolution Pooling  Audio/Video Level Feature  Multimodal Attention Feature Fusion  
Phoneme dependent speaker embedding and model factorization for multi-speaker speech synthesis and adaptation 会议论文
, Brighton,UK, MAY 12-17,2019
作者:  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi;  Zheng, Yibin
浏览  |  Adobe PDF(429Kb)  |  收藏  |  浏览/下载:213/74  |  提交时间:2020/06/24
speech synthesis  speaker adaptation  speaker embedding  phoneme representation