CASIA OpenIR

浏览/检索结果: 共322条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A novel transformer autoencoder for multi-modal emotion recognition with incomplete data 期刊论文
NEURAL NETWORKS, 2024, 卷号: 172, 页码: 12
作者:  Cheng, Cheng;  Liu, Wenzhe;  Fan, Zhaoxin;  Feng, Lin;  Jia, Ziyu
收藏  |  浏览/下载:22/0  |  提交时间:2024/03/27
Multi-modal signals  Emotion recognition  Incomplete data  Transformer autoencoder  Convolutional encoder  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:45/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:84/7  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Text-to-Image Vehicle Re-Identification: Multi-Scale Multi-View Cross-Modal Alignment Network and a Unified Benchmark 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 页码: 14
作者:  Ding, Leqi;  Liu, Lei;  Huang, Yan;  Li, Chenglong;  Zhang, Cheng;  Wang, Wei;  Wang, Liang
收藏  |  浏览/下载:14/0  |  提交时间:2024/03/27
Task analysis  Feature extraction  Visualization  Training  Electronic mail  Benchmark testing  Trajectory  Text-to-image vehicle re-identification  cross-modal alignment  multi-scale multi-view analysis  benchmark dataset  
A cross-modal clinical prediction system for intensive care unit patient outcome 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 16
作者:  Sun, Mengxuan;  Yang, Xuebing;  Niu, Jinghao;  Gu, Yifan;  Wang, Chutong;  Zhang, Wensheng
收藏  |  浏览/下载:25/0  |  提交时间:2024/02/21
Electronic health records  Clinical outcome prediction  Patient representation  Cross-modal contrastive learning  
ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2024, 卷号: 31, 页码: 241-245
作者:  Tao, Manli;  Zhao, Chaoyang;  Wang, Jinqiao;  Tang, Ming
收藏  |  浏览/下载:7/0  |  提交时间:2024/03/26
Three-dimensional displays  Proposals  Object detection  Feature extraction  Point cloud compression  Aggregates  Sun  3D object detection  image candidates  pseudo 3D proposal  target missing  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:29/9  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Multi-modal fusion for robust hand gesture recognition based on heterogeneous networks 期刊论文
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2023, 页码: 12
作者:  Zou, Yongxiang;  Cheng, Long;  Han, Lijun;  Li, Zhengwei
收藏  |  浏览/下载:58/0  |  提交时间:2023/11/16
leap motion  sEMG  multi-modal  graph neural network  hand gesture recognition  
A Unified Multimodal De- and Re-Coupling Framework for RGB-D Motion Recognition 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 10, 页码: 11428-11442
作者:  Zhou, Benjia;  Wang, Pichao;  Wan, Jun;  Liang, Yanyan;  Wang, Fan
收藏  |  浏览/下载:113/0  |  提交时间:2023/11/16
Spatiotemporal phenomena  Representation learning  Training  Optimization  Task analysis  Three-dimensional displays  Solid modeling  Complement feature  late fusion  motion recognition  RGB-D  video augmentation  
Protecting by attacking: A personal information protecting method with cross-modal adversarial examples 期刊论文
NEUROCOMPUTING, 2023, 卷号: 551, 页码: 11
作者:  Zhao, Mengnan;  Wang, Bo;  Guo, Weikuo;  Wang, Wei
收藏  |  浏览/下载:48/0  |  提交时间:2023/11/17
Security  Cross-modal  Image captioning  Adversarial attacks