CASIA OpenIR

浏览/检索结果: 共92条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
A novel transformer autoencoder for multi-modal emotion recognition with incomplete data 期刊论文
NEURAL NETWORKS, 2024, 卷号: 172, 页码: 12
作者:  Cheng, Cheng;  Liu, Wenzhe;  Fan, Zhaoxin;  Feng, Lin;  Jia, Ziyu
收藏  |  浏览/下载:23/0  |  提交时间:2024/03/27
Multi-modal signals  Emotion recognition  Incomplete data  Transformer autoencoder  Convolutional encoder  
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:45/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
Objformer: Boosting 3D object detection via instance-wise interaction 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 9
作者:  Tao, Manli;  Zhao, Chaoyang;  Tang, Ming;  Wang, Jinqiao
收藏  |  浏览/下载:55/0  |  提交时间:2024/02/22
3D object detection  Point clouds  Incompletion and occlusion  Instance-wise interaction  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:85/7  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
Text-to-Image Vehicle Re-Identification: Multi-Scale Multi-View Cross-Modal Alignment Network and a Unified Benchmark 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 页码: 14
作者:  Ding, Leqi;  Liu, Lei;  Huang, Yan;  Li, Chenglong;  Zhang, Cheng;  Wang, Wei;  Wang, Liang
收藏  |  浏览/下载:14/0  |  提交时间:2024/03/27
Task analysis  Feature extraction  Visualization  Training  Electronic mail  Benchmark testing  Trajectory  Text-to-image vehicle re-identification  cross-modal alignment  multi-scale multi-view analysis  benchmark dataset  
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:38/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:31/9  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
Contrastive Multi-Modal Knowledge Graph Representation Learning 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 卷号: 35, 期号: 9, 页码: 8983-8996
作者:  Fang, Quan;  Zhang, Xiaowei;  Hu, Jun;  Wu, Xian;  Xu, Changsheng
收藏  |  浏览/下载:49/0  |  提交时间:2023/11/17
Knowledge graph  multimedia  graph neural network  contrastive learning  
Multi-Source Knowledge Reasoning Graph Network for Multi-Modal Commonsense Inference 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 卷号: 19, 期号: 4, 页码: 17
作者:  Ma, Xuan;  Yang, Xiaoshan;  Xu, Changsheng
收藏  |  浏览/下载:51/0  |  提交时间:2023/11/17
Knowledge reasoning  multi-modal commonsense inference  graph neural network  
Visual enhanced hierarchical network for sentence-based video thumbnail generation 期刊论文
APPLIED INTELLIGENCE, 2023, 页码: 17
作者:  Wu, Junxian;  Zhang, Yujia;  Zhao, Xiaoguang
收藏  |  浏览/下载:36/0  |  提交时间:2023/11/17
Video thumbnail  DVTG task  Multi-modal fusion  Visual information  Hierarchical multi-layer perceptions