CASIA OpenIR

浏览/检索结果: 共59条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 3256-3270
作者:  Zhang, Yujia;  Li, Qianzhong;  Pan, Yi;  Zhao, Xiaoguang;  Tan, Min
收藏  |  浏览/下载:9/0  |  提交时间:2024/07/03
Feature extraction  Visualization  Task analysis  Representation learning  Location awareness  Linguistics  Grounding  Video-based referring expression comprehension  multi-stage learning  image-language cross-generative fusion  consistency loss  
Correntropy-Induced Wasserstein GCN: Learning Graph Embedding via Domain Adaptation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 页码: 3980-3993
作者:  Wei Wang;  Gaowei Zhang;  Hongyong Han;  Chi Zhang
Adobe PDF(8686Kb)  |  收藏  |  浏览/下载:21/8  |  提交时间:2024/06/27
Grammar-Induced Wavelet Network for Human Parsing 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 期号: 31, 页码: 4502-4514
作者:  Xiaomei Zhang;  Yingying Chen;  Ming Tang;  Zhen Lei;  Jinqiao Wang
Adobe PDF(3308Kb)  |  收藏  |  浏览/下载:47/18  |  提交时间:2024/06/03
General vs. Long-Tailed Age Estimation: An Approach to Kill Two Birds With One Stone 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 6155-6167
作者:  Bao, Zenghao;  Tan, Zichang;  Li, Jun;  Wan, Jun;  Ma, Xibo;  Lei, Zhen
Adobe PDF(1634Kb)  |  收藏  |  浏览/下载:60/2  |  提交时间:2024/02/22
General age estimation  long-tailed age estimation  class-wise mean absolute error  
Coarse Mask Guided Interactive Object Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5808-5822
作者:  Li, Jing;  Fan, Junsong;  Wang, Yuxi;  Yang, Yuran;  Zhang, Zhaoxiang
Adobe PDF(4323Kb)  |  收藏  |  浏览/下载:73/5  |  提交时间:2024/02/22
Segmentation  Interactive  Transformer  Annotation Tool  
MCSfM: Multi-Camera-Based Incremental Structure-From-Motion 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 6441-6456
作者:  Cui, Hainan;  Gao, Xiang;  Shen, Shuhan
收藏  |  浏览/下载:60/0  |  提交时间:2024/02/22
Structure-from-motion  multi-camera based reconstruction  multi-camera calibration  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:95/8  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:146/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
An Efficient Sampling-Based Attention Network for Semantic Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2850-2863
作者:  He, Xingjian;  Liu, Jing;  Wang, Weining;  Lu, Hanqing
Adobe PDF(3252Kb)  |  收藏  |  浏览/下载:399/80  |  提交时间:2022/06/10
Stochastic processes  Sampling methods  Semantics  Image segmentation  Computational complexity  Pattern recognition  Convolution  Semantic segmentation  stochastic sampling-based attention  deterministic sampling-based attention  
Geometry Sensitive Cross-Modal Reasoning for Composed Query Based Image Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 1000-1011
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:270/0  |  提交时间:2022/02/16
Visualization  Image retrieval  Semantics  Cognition  Geometry  Task analysis  Electronic mail  Composed query based image retrieval  semantic gap  spatial structure  inter-modal attention  text-guided visual reasoning