CASIA OpenIR

浏览/检索结果: 共479条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A novel transformer autoencoder for multi-modal emotion recognition with incomplete data 期刊论文
NEURAL NETWORKS, 2024, 卷号: 172, 页码: 12
作者:  Cheng, Cheng;  Liu, Wenzhe;  Fan, Zhaoxin;  Feng, Lin;  Jia, Ziyu
收藏  |  浏览/下载:27/0  |  提交时间:2024/03/27
Multi-modal signals  Emotion recognition  Incomplete data  Transformer autoencoder  Convolutional encoder  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:56/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:96/9  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Cross-Scenario Unknown-Aware Face Anti-Spoofing with Evidential Semantic Consistency Learning 期刊论文
IEEE Transactions on Information Forensics and Security, 2024, 页码: 3093 - 3108
作者:  Jiang, Fangling;  Liu, Yunfan;  Si, Haolin;  Meng, Jingjing;  Li, Qi
Adobe PDF(2675Kb)  |  收藏  |  浏览/下载:111/36  |  提交时间:2024/02/23
Text-to-Image Vehicle Re-Identification: Multi-Scale Multi-View Cross-Modal Alignment Network and a Unified Benchmark 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 页码: 14
作者:  Ding, Leqi;  Liu, Lei;  Huang, Yan;  Li, Chenglong;  Zhang, Cheng;  Wang, Wei;  Wang, Liang
收藏  |  浏览/下载:16/0  |  提交时间:2024/03/27
Task analysis  Feature extraction  Visualization  Training  Electronic mail  Benchmark testing  Trajectory  Text-to-image vehicle re-identification  cross-modal alignment  multi-scale multi-view analysis  benchmark dataset  
A cross-modal clinical prediction system for intensive care unit patient outcome 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 16
作者:  Sun, Mengxuan;  Yang, Xuebing;  Niu, Jinghao;  Gu, Yifan;  Wang, Chutong;  Zhang, Wensheng
收藏  |  浏览/下载:29/0  |  提交时间:2024/02/21
Electronic health records  Clinical outcome prediction  Patient representation  Cross-modal contrastive learning  
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:40/8  |  提交时间:2024/02/23
What Does Sora Show: The Beginning of TAO to Imaginative Intelligence and Scenarios Engineering 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 809-815
作者:  Fei-Yue Wang;  Qinghai Miao;  Lingxi Li;  Qinghua Ni;  Xuan Li;  Juanjuan Li;  Lili Fan;  Yonglin Tian;  Qing-Long Han
Adobe PDF(571Kb)  |  收藏  |  浏览/下载:15/4  |  提交时间:2024/03/18
Feature Matching via Topology-Aware Graph Interaction Model 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 113-130
作者:  Yifan Lu;  Jiayi Ma;  Xiaoguang Mei;  Jun Huang;  Xiao-Ping Zhang
Adobe PDF(26799Kb)  |  收藏  |  浏览/下载:242/160  |  提交时间:2024/01/02
Feature matching  graph cut  outlier filtering  topology preserving  
基于语言−视觉对比学习的多模态视频行为识别方法 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 2, 页码: 417-430
作者:  张颖;  张冰冰;  董微;  安峰民;  张建新;  张强
Adobe PDF(6014Kb)  |  收藏  |  浏览/下载:15/5  |  提交时间:2024/04/12
视频行为识别  语言-视觉对比学习  多模态模型  时序建模  提示学习