CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共32条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:350/47  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:317/58  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
A time-frequency channel attention and vectorization network for automatic depression level prediction 期刊论文
Neurocomputing, 2021, 期号: 450, 页码: 208-218
作者:  Mingyue Niu;  Bin Liu;  Jianhua Tao;  Qifei Li
Adobe PDF(2001Kb)  |  收藏  |  浏览/下载:159/42  |  提交时间:2021/06/01
Sphere embedding normalization  DenseNet  Transition layer  Time-frequency channel attention block  Time-frequency vectorization block  Depression detection  
MULTI-SCALE AND MULTI-REGION FACIAL DISCRIMINATIVE REPRESENTATION FOR AUTOMATIC DEPRESSION LEVEL PREDICTION 会议论文
, 加拿大多伦多, 2021-6
作者:  MIngyue Niu;  Jianhua Tao;  Bin Liu
Adobe PDF(1629Kb)  |  收藏  |  浏览/下载:164/48  |  提交时间:2021/06/01
Conversational Emotion Recognition Using Self-Attention Mechanisms and Graph Neural Networks 会议论文
, Shanghai, China, 25-29 October, 2020
作者:  Zheng Lian;  Jianhua Tao;  Bin Liu;  Jian Huang;  Zhanlei Yang;  Rongjun Li
Adobe PDF(509Kb)  |  收藏  |  浏览/下载:116/38  |  提交时间:2021/06/16
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 期号: 28, 页码: 1303-1314
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Adobe PDF(1344Kb)  |  收藏  |  浏览/下载:262/57  |  提交时间:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training  
Multi-modal Continuous Dimensional Emotion Recognition Using Recurrent Neural Network and Self-Atention Mechanism 会议论文
, Seattle, United States, 12-16 October, 2020
作者:  Licai Sun;  Zheng Lian;  Jianhua Tao;  Bin Liu;  Mingyue Niu
Adobe PDF(1041Kb)  |  收藏  |  浏览/下载:153/48  |  提交时间:2021/06/16
Listen Attentively, and Spell Once: Whole Sentence Generation via a Non-Autoregressive Architecture for Low-Latency Speech Recognition 会议论文
, shanghai, 2020
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(801Kb)  |  收藏  |  浏览/下载:107/28  |  提交时间:2021/06/25
Multimodal Spatiotemporal Representation for Automatic Depression Level Detection 期刊论文
IEEE Transactions on Affective Computing, 2020, 期号: 0, 页码: 0
作者:  Mingyue Niu;  Jianhua Tao;  Bin Liu;  Jian Huang;  Zheng Lian
Adobe PDF(2831Kb)  |  收藏  |  浏览/下载:167/49  |  提交时间:2021/06/01
Multimodal depression detection  Spatio-Temporal Attention  Audio/Video Segment-Level Feature  Eigen Evolution Pooling  Audio/Video Level Feature  Multimodal Attention Feature Fusion  
Context-Dependent Domain Adversarial Neural Network for Multimodal Emotion Recognition 会议论文
, Shanghai, China, 25-29 October, 2020
作者:  Zheng Lian;  Jianhua Tao;  Bin Liu;  Jian Huang;  Zhanlei Yang;  Rongjun Li
Adobe PDF(348Kb)  |  收藏  |  浏览/下载:136/42  |  提交时间:2021/06/16