CASIA OpenIR

浏览/检索结果: 共362条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A novel transformer autoencoder for multi-modal emotion recognition with incomplete data 期刊论文
NEURAL NETWORKS, 2024, 卷号: 172, 页码: 12
作者:  Cheng, Cheng;  Liu, Wenzhe;  Fan, Zhaoxin;  Feng, Lin;  Jia, Ziyu
收藏  |  浏览/下载:21/0  |  提交时间:2024/03/27
Multi-modal signals  Emotion recognition  Incomplete data  Transformer autoencoder  Convolutional encoder  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:45/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
A cross-modal clinical prediction system for intensive care unit patient outcome 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 16
作者:  Sun, Mengxuan;  Yang, Xuebing;  Niu, Jinghao;  Gu, Yifan;  Wang, Chutong;  Zhang, Wensheng
收藏  |  浏览/下载:24/0  |  提交时间:2024/02/21
Electronic health records  Clinical outcome prediction  Patient representation  Cross-modal contrastive learning  
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:35/7  |  提交时间:2024/02/23
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:78/11  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
Medical visual question answering with symmetric interaction attention and cross-modal gating 期刊论文
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2023, 卷号: 85, 页码: 10
作者:  Chen, Zhi;  Zou, Beiji;  Dai, Yulan;  Zhu, Chengzhang;  Kong, Guilan;  Zhang, Wensheng
收藏  |  浏览/下载:60/0  |  提交时间:2023/11/17
Medical visual question answering  Self-attention  Information interaction  Cross-modal gating  
A Multi-Modal Neural Geometric Solver with Textual Clauses Parsed from Diagram 会议论文
, 中国 澳门, 2023-7-19
作者:  Zhang Ming-Liang;  Yin Fei;  Liu Cheng-Lin
Adobe PDF(1110Kb)  |  收藏  |  浏览/下载:29/9  |  提交时间:2024/04/03
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:151/41  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
3D Semantic Segmentation of Aerial Photogrammetry Models Based on Orthographic Projection 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 页码: early-access
作者:  Mengqi Rong;  Shuhan Shen
Adobe PDF(5811Kb)  |  收藏  |  浏览/下载:100/35  |  提交时间:2023/09/25
DCAT: Dual Cross-Attention-Based Transformer for Change Detection 期刊论文
Remote Sensing, 2023, 卷号: 15, 期号: 9, 页码: 2395
作者:  Yuan Zhou;  Chunlei Huo;  Jiahang Zhu;  Leigang Huo;  Chunhong Pan
Adobe PDF(47919Kb)  |  收藏  |  浏览/下载:121/17  |  提交时间:2023/06/16
change detection  transformer  dual cross-attention  remote sensing