CASIA OpenIR

浏览/检索结果: 共377条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A novel transformer autoencoder for multi-modal emotion recognition with incomplete data 期刊论文
NEURAL NETWORKS, 2024, 卷号: 172, 页码: 12
作者:  Cheng, Cheng;  Liu, Wenzhe;  Fan, Zhaoxin;  Feng, Lin;  Jia, Ziyu
收藏  |  浏览/下载:30/0  |  提交时间:2024/03/27
Multi-modal signals  Emotion recognition  Incomplete data  Transformer autoencoder  Convolutional encoder  
Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 234-244
作者:  Wang, Jinguang;  Qian, Shengsheng;  Hu, Jun;  Hong, Richang
收藏  |  浏览/下载:8/0  |  提交时间:2024/03/26
Fake news detection  multi-modal learning  social media  
A cross-modal clinical prediction system for intensive care unit patient outcome 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 16
作者:  Sun, Mengxuan;  Yang, Xuebing;  Niu, Jinghao;  Gu, Yifan;  Wang, Chutong;  Zhang, Wensheng
收藏  |  浏览/下载:30/0  |  提交时间:2024/02/21
Electronic health records  Clinical outcome prediction  Patient representation  Cross-modal contrastive learning  
Emotion-Aware Music Driven Movie Montage 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 3, 页码: 540-553
作者:  Liu, Wu-Qin;  Lin, Min-Xuan;  Huang, Hai-Bin;  Ma, Chong-Yang;  Song, Yu;  Dong, Wei-Ming;  Xu, Chang-Sheng
收藏  |  浏览/下载:101/0  |  提交时间:2023/12/21
movie montage  emotion analysis  audio-visual modality  contrastive learning  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:59/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
FM-ViT: Flexible Modal Vision Transformers for Face Anti-Spoofing 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 卷号: 18, 页码: 4775-4786
作者:  Liu, Ajian;  Tan, Zichang;  Yu, Zitong;  Zhao, Chenxu;  Wan, Jun;  Liang, Yanyan;  Lei, Zhen;  Zhang, Du;  Li, Stan Z.;  Guo, Guodong
收藏  |  浏览/下载:120/0  |  提交时间:2023/11/17
Face anti-spoofing  flexible-modal testing  vision transformer  mutual-attention  fusion-attention  
Medical visual question answering with symmetric interaction attention and cross-modal gating 期刊论文
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 卷号: 85, 页码: 10
作者:  Chen, Zhi;  Zou, Beiji;  Dai, Yulan;  Zhu, Chengzhang;  Kong, Guilan;  Zhang, Wensheng
收藏  |  浏览/下载:76/0  |  提交时间:2023/11/17
Medical visual question answering  Self-attention  Information interaction  Cross-modal gating  
Self-Supervised Modality-Aware Multiple Granularity Pre-Training for RGB-Infrared Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2023, 卷号: 18, 页码: 3044-3057
作者:  Wan, Lin;  Jing, Qianyan;  Sun, Zongyuan;  Zhang, Chuang;  Li, Zhihang;  Chen, Yehansen
收藏  |  浏览/下载:71/0  |  提交时间:2023/11/17
Task analysis  Training  Feature extraction  Lighting  Cameras  Visualization  Self-supervised learning  Cross-modality person re-identification  self-supervised learning  multi-modality pre-training  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:96/12  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
3D Semantic Segmentation of Aerial Photogrammetry Models Based on Orthographic Projection 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 页码: early-access
作者:  Mengqi Rong;  Shuhan Shen
Adobe PDF(5811Kb)  |  收藏  |  浏览/下载:108/36  |  提交时间:2023/09/25