CASIA OpenIR

浏览/检索结果: 共6条,第1-6条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 卷号: 14, 期号: 3, 页码: 2415-2429
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
收藏  |  浏览/下载:79/0  |  提交时间:2023/11/15
Emotion recognition  Feature extraction  Training  Acoustics  Semisupervised learning  Benchmark testing  Hidden Markov models  Semi-supervised multi-modal interaction network (SMIN)  conversational emotion recognition  semi-supervised learning  intra-modal interaction  cross-modal interaction  
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8419-8432
作者:  Lian, Zheng;  Chen, Lan;  Sun, Licai;  Liu, Bin;  Tao, Jianhua
收藏  |  浏览/下载:117/0  |  提交时间:2023/11/17
Oral communication  Correlation  Data models  Task analysis  Feature extraction  Tensors  Benchmark testing  Conversational data  graph complete network (GCNet)  incomplete multimodal learning  speaker-sensitive modeling  temporal-sensitive modeling  
Gated Recurrent Fusion With Joint Training Framework for Robust End-to-End Speech Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 198-209
作者:  Fan, Cunhang;  Yi, Jiangyan;  Tao, Jianhua;  Tian, Zhengkun;  Liu, Bin;  Wen, Zhengqi
Adobe PDF(2534Kb)  |  收藏  |  浏览/下载:371/48  |  提交时间:2021/03/08
Speech enhancement  Speech recognition  Training  Noise measurement  Logic gates  Acoustic distortion  Task analysis  Gated recurrent fusion  robust end-to-end speech recognition  speech distortion  speech enhancement  speech transformer  
CTNet: Conversational Transformer Network for Emotion Recognition 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 期号: 29, 页码: 985-1000
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2230Kb)  |  收藏  |  浏览/下载:331/58  |  提交时间:2021/05/06
Emotion recognition  Context modeling  Feature extraction  Fuses  Speech processing  Data models  Bidirectional control  Context-sensitive modeling  conversational transformer network (CTNet)  conversational emotion recognition  multimodal fusion  speaker-sensitive modeling  
WAGNN: A Weighted Aggregation Graph Neural Network for robot skill learning 期刊论文
ROBOTICS AND AUTONOMOUS SYSTEMS, 2020, 卷号: 130, 页码: 9
作者:  Zhang, Fengyi;  Liu, Zhiyong;  Xiong, Fangzhou;  Su, Jianhua;  Qiao, Hong
Adobe PDF(1550Kb)  |  收藏  |  浏览/下载:317/44  |  提交时间:2020/07/20
Skill transfer learning  Serial structures  Robot skill learning  Graph Neural Network  
End-to-End Post-Filter for Speech Separation With Deep Attention Fusion Features 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2020, 卷号: 28, 期号: 28, 页码: 1303-1314
作者:  Fan, Cunhang;  Tao, Jianhua;  Liu, Bin;  Yi, Jiangyan;  Wen, Zhengqi;  Liu, Xuefei
Adobe PDF(1344Kb)  |  收藏  |  浏览/下载:275/58  |  提交时间:2020/06/22
Feature extraction  Training  Interference  Speech enhancement  Clustering algorithms  Spectrogram  Speech separation  end-to-end post-filter  deep attention fusion features  deep clustering  permutation invariant training