CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共135条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
EmotionNAS: Two-stream Neural Architecture Search for Speech Emotion Recognition 会议论文
, Dublin, Ireland, 20-24 August 2023
作者:  Haiyang Sun;  Zheng Lian;  Bin Liu;  Ying Li;  Licai Sun;  Cong Cai;  Jianhua Tao;  Meng Wang;  Yuan Cheng
Adobe PDF(826Kb)  |  收藏  |  浏览/下载:7/4  |  提交时间:2024/05/31
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:11/3  |  提交时间:2024/05/31
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition 会议论文
, Ottawa, ON, Canada, October 29-November 3, 2023
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(1960Kb)  |  收藏  |  浏览/下载:6/2  |  提交时间:2024/05/31
MER 2023: Multi-label Learning, Modality Robustness, and Semi-Supervised Learning 会议论文
, Ottawa, ON, Canada, October 29-November 3, 2023
作者:  Zheng Lian;  Haiyang Sun;  Licai Sun;  Kang Chen;  Mingyu Xu;  Kexin Wang;  Ke Xu;  Yu He;  Ying Li;  Jinming Zhao;  Ye Liu;  Bin Liu;  Jiangyan Yi;  Meng Wang;  Erik Cambria;  Guoying Zhao;  Björn W. Schuller;  Jianhua Tao
Adobe PDF(993Kb)  |  收藏  |  浏览/下载:7/2  |  提交时间:2024/05/31
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:13/3  |  提交时间:2024/05/31
GPT-4V with Emotion: A Zero-shot Benchmark for Generalized Emotion Recognition 期刊论文
Information Fusion, 2024, 页码: 1-12
作者:  Zheng Lian;  Licai Sun;  Haiyang Sun;  Kang Chen;  Zhuofan Wen;  Hao Gu;  Bin Liu;  Jianhua Tao
Adobe PDF(6888Kb)  |  收藏  |  浏览/下载:16/1  |  提交时间:2024/05/31
Adversarial Multi-Task Learning for Mandarin Prosodic Boundary Prediction With Multi-Modal Embeddings 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2023, 卷号: 31, 页码: 2963-2973
作者:  Yi, Jiangyan;  Tao, Jianhua;  Fu, Ruibo;  Wang, Tao;  Zhang, Chu Yuan;  Wang, Chenglong
收藏  |  浏览/下载:54/0  |  提交时间:2023/11/17
Adversarial training  multi-task learning  prosodic boundaries  speech synthesis  multi-modal embeddings  
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8419-8432
作者:  Lian, Zheng;  Chen, Lan;  Sun, Licai;  Liu, Bin;  Tao, Jianhua
Adobe PDF(3959Kb)  |  收藏  |  浏览/下载:140/0  |  提交时间:2023/11/17
Oral communication  Correlation  Data models  Task analysis  Feature extraction  Tensors  Benchmark testing  Conversational data  graph complete network (GCNet)  incomplete multimodal learning  speaker-sensitive modeling  temporal-sensitive modeling  
SMIN: Semi-Supervised Multi-Modal Interaction Network for Conversational Emotion Recognition 期刊论文
IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, 2023, 卷号: 14, 期号: 3, 页码: 2415-2429
作者:  Lian, Zheng;  Liu, Bin;  Tao, Jianhua
Adobe PDF(2103Kb)  |  收藏  |  浏览/下载:111/1  |  提交时间:2023/11/15
Emotion recognition  Feature extraction  Training  Acoustics  Semisupervised learning  Benchmark testing  Hidden Markov models  Semi-supervised multi-modal interaction network (SMIN)  conversational emotion recognition  semi-supervised learning  intra-modal interaction  cross-modal interaction  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:216/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech