CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共17条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Self-supervised graph representation learning via bootstrapping 期刊论文
NEUROCOMPUTING, 2021, 卷号: 456, 页码: 88-96
作者:  Che, Feihu;  Yang, Guohua;  Zhang, Dawei;  Tao, Jianhua;  Liu, Tong
Adobe PDF(1379Kb)  |  收藏  |  浏览/下载:364/60  |  提交时间:2021/11/03
Graph representation learning  Self-supervised  Bootstrapping  Graph neural network  
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(1163Kb)  |  收藏  |  浏览/下载:188/55  |  提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别  
DECN: Dialogical Emotion Correction Network for Conversational Emotion Recognition 期刊论文
Neurocomputing, 2021, 期号: 0, 页码: 0
作者:  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2238Kb)  |  收藏  |  浏览/下载:151/29  |  提交时间:2021/06/16
Emotion recognition in conversations (ERC)  Context-sensitive modeling  Dialogical Emotion Correction Network (DECN)  Interaction modeling  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:182/51  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
Multimodal Spatiotemporal Representation for Automatic Depression Level Detection 期刊论文
IEEE Transactions on Affective Computing, 2020, 期号: 0, 页码: 0
作者:  Mingyue Niu;  Jianhua Tao;  Bin Liu;  Jian Huang;  Zheng Lian
Adobe PDF(2831Kb)  |  收藏  |  浏览/下载:188/51  |  提交时间:2021/06/01
Multimodal depression detection  Spatio-Temporal Attention  Audio/Video Segment-Level Feature  Eigen Evolution Pooling  Audio/Video Level Feature  Multimodal Attention Feature Fusion  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:354/58  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:392/49  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
Expression Analysis Based on Face Regions in Real-world Conditions 期刊论文
International Journal of Automation and Computing, 2020, 卷号: 17, 期号: 1, 页码: 96-107
作者:  Zheng Lian;  Ya Li;  Jian-Hua Tao;  Jian Huang;  Ming-Yue Niu
浏览  |  Adobe PDF(1364Kb)  |  收藏  |  浏览/下载:221/42  |  提交时间:2021/02/22
Facial emotion analysis  face areas  class activation map  confusion matrix  concerned area.  
CTC Regularized Model Adaptation for Improving LSTM RNN Based Multi-Accent Mandarin Speech Recognition 期刊论文
JOURNAL OF SIGNAL PROCESSING SYSTEMS FOR SIGNAL IMAGE AND VIDEO TECHNOLOGY, 2018, 卷号: 90, 期号: 7, 页码: 985-997
作者:  Jiangyan Yi;  Zhengqi Wen;  Jianhua Tao;  Hao Ni;  Bin Liu
浏览  |  Adobe PDF(1416Kb)  |  收藏  |  浏览/下载:143/55  |  提交时间:2020/10/22
multi-accent, Mandarin speech recognition,LSTM-RNN-CTC, model adaptation, CTC regularization  
Deep Learning Based Speech Separation via NMF-style Reconstructions 期刊论文
IEEE/ACM TRANSACTIONS ON AUDIO, SPEECH, AND LANGUAGE PROCESSING, 2018, 卷号: 26, 期号: 11, 页码: 2043-2055
作者:  Shuai Nie;  Shan Liang;  Wenju Liu;  Xueliang Zhang;  Jianhua Tao
浏览  |  Adobe PDF(2922Kb)  |  收藏  |  浏览/下载:209/80  |  提交时间:2020/10/22
Speech separation  deep neural network (DNN)  nonnegative matrix factorization (NMF)  spectro-temporal structures