CASIA OpenIR

浏览/检索结果: 共55条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 3256-3270
作者:  Zhang, Yujia;  Li, Qianzhong;  Pan, Yi;  Zhao, Xiaoguang;  Tan, Min
收藏  |  浏览/下载:9/0  |  提交时间:2024/07/03
Feature extraction  Visualization  Task analysis  Representation learning  Location awareness  Linguistics  Grounding  Video-based referring expression comprehension  multi-stage learning  image-language cross-generative fusion  consistency loss  
General vs. Long-Tailed Age Estimation: An Approach to Kill Two Birds With One Stone 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 6155-6167
作者:  Bao, Zenghao;  Tan, Zichang;  Li, Jun;  Wan, Jun;  Ma, Xibo;  Lei, Zhen
Adobe PDF(1634Kb)  |  收藏  |  浏览/下载:60/2  |  提交时间:2024/02/22
General age estimation  long-tailed age estimation  class-wise mean absolute error  
Coarse Mask Guided Interactive Object Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5808-5822
作者:  Li, Jing;  Fan, Junsong;  Wang, Yuxi;  Yang, Yuran;  Zhang, Zhaoxiang
Adobe PDF(4323Kb)  |  收藏  |  浏览/下载:73/5  |  提交时间:2024/02/22
Segmentation  Interactive  Transformer  Annotation Tool  
MCSfM: Multi-Camera-Based Incremental Structure-From-Motion 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 6441-6456
作者:  Cui, Hainan;  Gao, Xiang;  Shen, Shuhan
收藏  |  浏览/下载:60/0  |  提交时间:2024/02/22
Structure-from-motion  multi-camera based reconstruction  multi-camera calibration  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:95/8  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:147/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
An Efficient Sampling-Based Attention Network for Semantic Segmentation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2850-2863
作者:  He, Xingjian;  Liu, Jing;  Wang, Weining;  Lu, Hanqing
Adobe PDF(3252Kb)  |  收藏  |  浏览/下载:399/80  |  提交时间:2022/06/10
Stochastic processes  Sampling methods  Semantics  Image segmentation  Computational complexity  Pattern recognition  Convolution  Semantic segmentation  stochastic sampling-based attention  deterministic sampling-based attention  
Geometry Sensitive Cross-Modal Reasoning for Composed Query Based Image Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 1000-1011
作者:  Zhang, Feifei;  Xu, Mingliang;  Xu, Changsheng
收藏  |  浏览/下载:271/0  |  提交时间:2022/02/16
Visualization  Image retrieval  Semantics  Cognition  Geometry  Task analysis  Electronic mail  Composed query based image retrieval  semantic gap  spatial structure  inter-modal attention  text-guided visual reasoning  
Urban Scene LOD Vectorized Modeling From Photogrammetry Meshes 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 卷号: 30, 页码: 7458-7471
作者:  Han, Jiali;  Zhu, Lingjie;  Gao, Xiang;  Hu, Zhanyi;  Zhou, Liyang;  Liu, Hongmin;  Shen, Shuhan
Adobe PDF(8168Kb)  |  收藏  |  浏览/下载:326/43  |  提交时间:2021/11/03
Urban reconstruction  building modeling  Markov random field  segment based modeling  
Efficient Center Voting for Object Detection and 6D Pose Estimation in 3D Point Cloud 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 卷号: 30, 期号: 1, 页码: 5072-5084
作者:  Guo, Jianwei;  Xing, Xuejun;  Quan, Weize;  Yan, Dong-Ming;  Gu, Qingyi;  Liu, Yang;  Zhang, Xiaopeng
Adobe PDF(8313Kb)  |  收藏  |  浏览/下载:232/9  |  提交时间:2021/08/15
Three-dimensional displays  Pose estimation  Shape  Object detection  Feature extraction  Object recognition  Transmission line matrix methods  6D pose estimation  3D object recognition  point pair features  3D point cloud