CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共18条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Distinguishing Neural Speech Synthesis Models Through Fingerprints in Speech Waveforms 会议论文
, Taiyuan, Shanxi, China, 2024-07-27
作者:  Zhang, Chu Yuan;  Yi, Jiangyan;  Tao, Jianhua;  Wang, Chenglong;  Yan, Xinrui
Adobe PDF(2254Kb)  |  收藏  |  浏览/下载:25/12  |  提交时间:2024/06/26
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:65/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
MAE-DFER: Efficient Masked Autoencoder for Self-supervised Dynamic Facial Expression Recognition 会议论文
, Ottawa, ON, Canada, October 29-November 3, 2023
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(1960Kb)  |  收藏  |  浏览/下载:38/11  |  提交时间:2024/05/31
VRA: Variational Rectified Activation for Out-of-distribution Detection 会议论文
, New Orleans, USA, 2023 年 12 月 10 日 – 2023 年 12 月 16 日
作者:  Mingyu Xu;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(1172Kb)  |  收藏  |  浏览/下载:37/12  |  提交时间:2024/05/31
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:52/12  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
GCNet: Graph Completion Network for Incomplete Multimodal Learning in Conversation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 7, 页码: 8419-8432
作者:  Lian, Zheng;  Chen, Lan;  Sun, Licai;  Liu, Bin;  Tao, Jianhua
Adobe PDF(3959Kb)  |  收藏  |  浏览/下载:180/8  |  提交时间:2023/11/17
Oral communication  Correlation  Data models  Task analysis  Feature extraction  Tensors  Benchmark testing  Conversational data  graph complete network (GCNet)  incomplete multimodal learning  speaker-sensitive modeling  temporal-sensitive modeling  
Multi-aspect self-supervised learning for heterogeneous information network 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2021, 卷号: 233, 页码: 14
作者:  Che, Feihu;  Tao, Jianhua;  Yang, Guohua;  Liu, Tong;  Zhang, Dawei
Adobe PDF(2661Kb)  |  收藏  |  浏览/下载:257/52  |  提交时间:2021/12/28
Heterogeneous information network  Self-supervised  Contrastive learning  Graph neural network  
Fast End-to-End Speech Recognition via Non-Autoregressive Models and Cross-Modal Knowledge Transferring from BERT 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2021, 期号: 29, 页码: 1897 - 1911
作者:  Ye Bai;  Jiangyan Yi;  Jianhua Tao;  Zhengkun Tian;  Zhengqi Wen;  Shuai Zhang
Adobe PDF(1163Kb)  |  收藏  |  浏览/下载:213/66  |  提交时间:2021/06/25
端到端语音识别、迁移学习、知识蒸馏、老师-学生学习、BERT、非自回归语音识别  
Speech Emotion Recognition via Contrastive Loss under Siamese Networks 会议论文
, Seoul, Republic of Korea, 22-26 October, 2018
作者:  Zheng Lian;  Ya Li;  Jianhua Tao;  Jian Huang
Adobe PDF(10778Kb)  |  收藏  |  浏览/下载:130/29  |  提交时间:2021/06/16
Unsupervised Representation Learning with Future Observation Prediction for Speech Emotion Recognition 会议论文
, Graz, Austria, 15-19 September, 2019
作者:  Zheng Lian;  Jianhua Tao;  Bin Liu;  Jian Huang
Adobe PDF(373Kb)  |  收藏  |  浏览/下载:110/38  |  提交时间:2021/06/16