CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 3256-3270
作者:  Zhang, Yujia;  Li, Qianzhong;  Pan, Yi;  Zhao, Xiaoguang;  Tan, Min
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Feature extraction  Visualization  Task analysis  Representation learning  Location awareness  Linguistics  Grounding  Video-based referring expression comprehension  multi-stage learning  image-language cross-generative fusion  consistency loss  
Exploring Intrinsic Discrimination and Consistency for Weakly Supervised Object Localization 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 期号: 0, 页码: 1045 - 1058
作者:  Changwei Wang;  Rongtao Xu;  Shibiao Xu;  Weiliang Meng;  Ruisheng Wang;  Xiaopeng Zhang
Adobe PDF(3269Kb)  |  收藏  |  浏览/下载:63/22  |  提交时间:2024/05/29
Weakly supervised object localization  intrinsic discrimination and consistency  deep metric learning  geometric transformation consistency  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:87/3  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:142/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
Cycle-Consistent Weakly Supervised Visual Grounding With Individual and Contextual Representations 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5167-5180
作者:  Zhang, Ruisong;  Wang, Chuang;  Liu, Cheng-Lin
收藏  |  浏览/下载:137/0  |  提交时间:2023/11/16
Visualization  Grounding  Task analysis  Sports equipment  Image reconstruction  Transformers  Training  Weakly supervised learning  visual grounding  cycle consistency  individual and contextual representations  
Cross-Batch Hard Example Mining With Pseudo Large Batch for ID vs. Spot Face Recognition 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3224-3235
作者:  Tan, Zichang;  Liu, Ajian;  Wan, Jun;  Liu, Hao;  Lei, Zhen;  Guo, Guodong;  Li, Stan Z.
Adobe PDF(10124Kb)  |  收藏  |  浏览/下载:289/5  |  提交时间:2022/07/25
Face recognition  Training  Measurement  Graphics processing units  Deep learning  Logic gates  Feature extraction  Face recognition  ID vs spot  deep learning  cross-batch hard example mining  pseudo large batch  
An Iterative Co-Training Transductive Framework for Zero Shot Learning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 卷号: 30, 页码: 6943-6956
作者:  Liu, Bo;  Hu, Lihua;  Dong, Qiulei;  Hu, Zhanyi
Adobe PDF(2452Kb)  |  收藏  |  浏览/下载:294/65  |  提交时间:2021/11/02
Visualization  Semantics  Training  Feature extraction  Testing  Detectors  Predictive models  Zero-shot learning  transductive learning co-training  
Efficient Center Voting for Object Detection and 6D Pose Estimation in 3D Point Cloud 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 卷号: 30, 期号: 1, 页码: 5072-5084
作者:  Guo, Jianwei;  Xing, Xuejun;  Quan, Weize;  Yan, Dong-Ming;  Gu, Qingyi;  Liu, Yang;  Zhang, Xiaopeng
Adobe PDF(8313Kb)  |  收藏  |  浏览/下载:230/9  |  提交时间:2021/08/15
Three-dimensional displays  Pose estimation  Shape  Object detection  Feature extraction  Object recognition  Transmission line matrix methods  6D pose estimation  3D object recognition  point pair features  3D point cloud  
General Subspace Learning With Corrupted Training Data Via Graph Embedding 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2013, 卷号: 22, 期号: 11, 页码: 4380-4393
作者:  Bao, Bing-Kun;  Liu, Guangcan;  Hong, Richang;  Yan, Shuicheng;  Xu, Changsheng
Adobe PDF(3715Kb)  |  收藏  |  浏览/下载:362/73  |  提交时间:2015/08/12
Subspace Learning  Corrupted Training Data  Discriminant Analysis  Graph Embedding