CASIA OpenIR

浏览/检索结果: 共137条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Incremental Audio-Visual Fusion for Person Recognition in Earthquake Scene 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 19
作者:  You, Sisi;  Zuo, Yukun;  Yao, Hantao;  Xu, Changsheng
收藏  |  浏览/下载:56/0  |  提交时间:2023/12/21
Cross-modal audio-visual fusion  incremental learning  person recognition  elastic weight consolidation  feature replay  
IterDepth: Iterative Residual Refinement for Outdoor Self-Supervised Multi-Frame Monocular Depth Estimation 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 329-341
作者:  Feng, Cheng;  Chen, Zhen;  Zhang, Congxuan;  Hu, Weiming;  Li, Bing;  Lu, Feng
收藏  |  浏览/下载:13/0  |  提交时间:2024/03/26
Estimation  Iterative methods  Cameras  Task analysis  Feature extraction  Decoding  Training  Monocular depth estimation  iterative refinement  self-supervised learning  deep learning  
Multi-modal spatial relational attention networks for visual question answering 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 13
作者:  Yao, Haibo;  Wang, Lipeng;  Cai, Chengtao;  Sun, Yuxin;  Zhang, Zhi;  Luo, Yongkang
收藏  |  浏览/下载:47/0  |  提交时间:2024/02/22
Visual question answering  Spatial relation  Attention mechanism  Pre -training strategy  
GAN-Based Facial Attribute Manipulation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 12, 页码: 14590-14610
作者:  Liu, Yunfan;  Li, Qi;  Deng, Qiyao;  Sun, Zhenan;  Yang, Ming-Hsuan
Adobe PDF(15297Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/02/22
Generative adversarial networks  image translation  facial attribute manipulation  
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:74/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
Knowledge-Embedded Mutual Guidance for Visual Reasoning 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2023, 页码: 13
作者:  Zheng, Wenbo;  Yan, Lan;  Chen, Long;  Li, Qiang;  Wang, Fei-Yue
收藏  |  浏览/下载:93/0  |  提交时间:2023/11/16
Attention model  joint learning  knowledge embedding  visual reasoning  
PSAQ-ViT V2: Toward Accurate and General Data-Free Quantization for Vision Transformers 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 12
作者:  Li, Zhikai;  Chen, Mengjuan;  Xiao, Junrui;  Gu, Qingyi
收藏  |  浏览/下载:48/0  |  提交时间:2023/11/17
Data-free quantization  model compression  patch similarity  quantized vision transformers (ViTs)  
Hierarchical Curriculum Learning for No-Reference Image Quality Assessment 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 页码: 20
作者:  Wang, Juan;  Chen, Zewen;  Yuan, Chunfeng;  Li, Bing;  Ma, Wentao;  Hu, Weiming
收藏  |  浏览/下载:81/0  |  提交时间:2023/11/17
No-reference image quality assessment  Hierarchical curriculum learning  Prior knowledge  Cross-dataset quality assessment correlation  
Towards Fine-Grained Optimal 3D Face Dense Registration: An Iterative Dividing and Diffusing Method 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 页码: 21
作者:  Fan, Zhenfeng;  Peng, Silong;  Xia, Shihong
收藏  |  浏览/下载:106/0  |  提交时间:2023/11/17
3D face  Dense correspondence  Non-rigid registration  3D morphable model  
ParallelEye Pipeline: An Effective Method to Synthesize Images for Improving the Visual Intelligence of Intelligent Vehicles 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Li, Xuan;  Wang, Kunfeng;  Gu, Xianfeng;  Deng, Fang;  Wang, Fei-Yue
收藏  |  浏览/下载:41/0  |  提交时间:2023/11/17
Annotations  Pipelines  Autonomous vehicles  Generative adversarial networks  Task analysis  Semantics  Visualization  Generative adversarial network (GAN)  intelligent vehicles  object detection  simulated scene  synthetic image