CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
收藏  |  浏览/下载:37/0  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
Efficient Token-Guided Image-Text Retrieval With Consistent Multimodal Contrastive Training 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3622-3633
作者:  Liu, Chong;  Zhang, Yuqi;  Wang, Hongsong;  Chen, Weihua;  Wang, Fan;  Huang, Yan;  Shen, Yi-Dong;  Wang, Liang
收藏  |  浏览/下载:99/0  |  提交时间:2023/11/17
Index Terms-Image-text retrieval  multimodal transformer  multimodal contrastive training  
General vs. Long-Tailed Age Estimation: An Approach to Kill Two Birds With One Stone 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 6155-6167
作者:  Bao, Zenghao;  Tan, Zichang;  Li, Jun;  Wan, Jun;  Ma, Xibo;  Lei, Zhen
收藏  |  浏览/下载:17/0  |  提交时间:2024/02/22
General age estimation  long-tailed age estimation  class-wise mean absolute error  
Cycle-Consistent Weakly Supervised Visual Grounding With Individual and Contextual Representations 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5167-5180
作者:  Zhang, Ruisong;  Wang, Chuang;  Liu, Cheng-Lin
收藏  |  浏览/下载:107/0  |  提交时间:2023/11/16
Visualization  Grounding  Task analysis  Sports equipment  Image reconstruction  Transformers  Training  Weakly supervised learning  visual grounding  cycle consistency  individual and contextual representations