CASIA OpenIR

浏览/检索结果: 共405条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A novel transformer autoencoder for multi-modal emotion recognition with incomplete data 期刊论文
NEURAL NETWORKS, 2024, 卷号: 172, 页码: 12
作者:  Cheng, Cheng;  Liu, Wenzhe;  Fan, Zhaoxin;  Feng, Lin;  Jia, Ziyu
收藏  |  浏览/下载:26/0  |  提交时间:2024/03/27
Multi-modal signals  Emotion recognition  Incomplete data  Transformer autoencoder  Convolutional encoder  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:56/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:96/9  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Text-to-Image Vehicle Re-Identification: Multi-Scale Multi-View Cross-Modal Alignment Network and a Unified Benchmark 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 页码: 14
作者:  Ding, Leqi;  Liu, Lei;  Huang, Yan;  Li, Chenglong;  Zhang, Cheng;  Wang, Wei;  Wang, Liang
收藏  |  浏览/下载:16/0  |  提交时间:2024/03/27
Task analysis  Feature extraction  Visualization  Training  Electronic mail  Benchmark testing  Trajectory  Text-to-image vehicle re-identification  cross-modal alignment  multi-scale multi-view analysis  benchmark dataset  
A cross-modal clinical prediction system for intensive care unit patient outcome 期刊论文
KNOWLEDGE-BASED SYSTEMS, 2024, 卷号: 283, 页码: 16
作者:  Sun, Mengxuan;  Yang, Xuebing;  Niu, Jinghao;  Gu, Yifan;  Wang, Chutong;  Zhang, Wensheng
收藏  |  浏览/下载:29/0  |  提交时间:2024/02/21
Electronic health records  Clinical outcome prediction  Patient representation  Cross-modal contrastive learning  
AnyFace++: A Unified Framework for Free-style Text-to-Face Synthesis and Manipulation 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024, 页码: 1-15
作者:  Sun, Jianxin;  Deng, Qiyao;  Li, Qi;  Sun, Muyi;  Liu, Yunfan;  Sun, Zhenan
Adobe PDF(16839Kb)  |  收藏  |  浏览/下载:39/7  |  提交时间:2024/02/23
ImFusion: Boosting Two-Stage 3D Object Detection via Image Candidates 期刊论文
IEEE SIGNAL PROCESSING LETTERS, 2024, 卷号: 31, 页码: 241-245
作者:  Tao, Manli;  Zhao, Chaoyang;  Wang, Jinqiao;  Tang, Ming
收藏  |  浏览/下载:10/0  |  提交时间:2024/03/26
Three-dimensional displays  Proposals  Object detection  Feature extraction  Point cloud compression  Aggregates  Sun  3D object detection  image candidates  pseudo 3D proposal  target missing  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:34/10  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Multi-modal fusion for robust hand gesture recognition based on heterogeneous networks 期刊论文
SCIENCE CHINA-TECHNOLOGICAL SCIENCES, 2023, 页码: 12
作者:  Zou, Yongxiang;  Cheng, Long;  Han, Lijun;  Li, Zhengwei
收藏  |  浏览/下载:70/0  |  提交时间:2023/11/16
leap motion  sEMG  multi-modal  graph neural network  hand gesture recognition  
A Unified Multimodal De- and Re-Coupling Framework for RGB-D Motion Recognition 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 10, 页码: 11428-11442
作者:  Zhou, Benjia;  Wang, Pichao;  Wan, Jun;  Liang, Yanyan;  Wang, Fan
收藏  |  浏览/下载:128/0  |  提交时间:2023/11/16
Spatiotemporal phenomena  Representation learning  Training  Optimization  Task analysis  Three-dimensional displays  Solid modeling  Complement feature  late fusion  motion recognition  RGB-D  video augmentation