CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Memory-Adaptive Vision-and-Language Navigation 期刊论文
Pattern Recognition, 2024, 卷号: 153, 页码: 110511
作者:  Keji He;  Ya Jing;  Yan Huang;  Zhihe Lu;  Dong An;  Liang Wang
Adobe PDF(3831Kb)  |  收藏  |  浏览/下载:46/18  |  提交时间:2024/06/26
Vision-and-Language Navigation  Memory bank  History noises  Memory-Adaptive Model  
Efficient multimodal transformer with dual-level feature restoration for robust multimodal sentiment analysis 期刊论文
IEEE Transactions on Affective Computing, 2023, 卷号: 15, 期号: 1, 页码: 1-17
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2371Kb)  |  收藏  |  浏览/下载:66/18  |  提交时间:2024/05/31
Transformers  Robustness  Semantics  Data models  Computational modeling  Videos  Training  Multimodal sentiment analysis  unaligned and incomplete data  efficient multimodal Transformer  dual-level feature restoration  robustness  
HiCMAE: Hierarchical Contrastive Masked Autoencoder for self-supervised Audio-Visual Emotion Recognition 期刊论文
Information Fusion, 2024, 卷号: 108, 页码: 1-20
作者:  Licai Sun;  Zheng Lian;  Bin Liu;  Jianhua Tao
Adobe PDF(2281Kb)  |  收藏  |  浏览/下载:53/12  |  提交时间:2024/05/31
Audio-Visual Emotion Recognition  Self-supervised learning  Masked autoencoder  Contrastive learning  
Learning and Controlling Multiscale Dynamics in Spiking Neural Networks Using Recursive Least Square Modifications 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 14
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(8060Kb)  |  收藏  |  浏览/下载:93/20  |  提交时间:2024/03/27
Direct dynamic programming (DDP)  Lorenz system  multiscale dynamics  point-to-point control  recursive least square (RLS)  spiking neural network (SNN)  
Surveillance Face Anti-Spoofing 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2024, 卷号: 19, 页码: 1535-1546
作者:  Fang, Hao;  Liu, Ajian;  Wan, Jun;  Escalera, Sergio;  Zhao, Chenxu;  Zhang, Xu;  Li, Stan Z.;  Lei, Zhen
Adobe PDF(3072Kb)  |  收藏  |  浏览/下载:94/7  |  提交时间:2024/02/22
Face anti-spoofing  dataset  surveillance scenes  
FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 卷号: 18, 页码: 4775-4786
作者:  Liu, Ajian;  Tan, Zichang;  Yu, Zitong;  Zhao, Chenxu;  Wan, Jun;  Liang, Yanyan;  Lei, Zhen;  Zhang, Du;  Li, Stan Z.;  Guo, Guodong
Adobe PDF(10966Kb)  |  收藏  |  浏览/下载:182/2  |  提交时间:2023/11/17
Face anti-spoofing  flexible-modal testing  vision transformer  mutual-attention  fusion-attention  
Constrained Maximum Cross-Domain Likelihood for Domain Generalization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 15
作者:  Lin, Jianxin;  Tang, Yongqiang;  Wang, Junping;  Zhang, Wensheng
Adobe PDF(2518Kb)  |  收藏  |  浏览/下载:167/7  |  提交时间:2023/11/17
Optimization  Feature extraction  Metalearning  Entropy  Training  Hospitals  Task analysis  Distribution shift  domain adaptation  domain generalization  domain-invariant representation  joint distribution alignment  
Everybody’s Talkin’: Let Me Talk as You Want 期刊论文
IEEE Transactions on Information Forensics and Security, 2022, 卷号: 17, 期号: 1, 页码: 585 - 598
作者:  宋林森;  吴文岩;  钱晨;  赫然;  Loy, Chen Change
Adobe PDF(15432Kb)  |  收藏  |  浏览/下载:105/15  |  提交时间:2023/06/29
Talking face generation  Video generation  GAN  Audio dubbing  
Topic-Oriented Dialogue Summarization 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2023, 卷号: 31, 页码: 1797 - 1810
作者:  Lin, Haitao;  Zhu, Junnan;  Xiang, Lu;  Zhai, Feifei;  Zhou, Yu;  Zhang, Jiajun;  Zong, Chengqing
Adobe PDF(3037Kb)  |  收藏  |  浏览/下载:245/90  |  提交时间:2023/06/13
dialogue summarization  abstractive summarization  controllable text generation  natural language processing  
CampNet: Context-Aware Mask Prediction for End-to-End Text-Based Speech Editing 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2022, 卷号: 30, 页码: 2241-2254
作者:  Wang, Tao;  Yi, Jiangyan;  Fu, Ruibo;  Tao, Jianhua;  Wen, Zhengqi
收藏  |  浏览/下载:246/0  |  提交时间:2022/09/19
Speech processing  Decoding  Predictive models  Acoustics  Transfer learning  Training  Task analysis  Coarse-to-fine decoding  mask prediction  one-shot learning  text-based speech editing  text-to-speech