CASIA OpenIR

浏览/检索结果: 共25条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931
作者:  Zhang Xi(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(4719Kb)  |  收藏  |  浏览/下载:32/8  |  提交时间:2024/07/08
VQACL: A Novel Visual Question Answering Continual Learning Setting 会议论文
, Canada, 2023
作者:  Zhang X(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(1199Kb)  |  收藏  |  浏览/下载:30/7  |  提交时间:2024/07/08
On the Effects of Structural Modeling for Neural Semantic Parsing 会议论文
Proceedings of the 27th Conference on Computational Natural Language Learning (CoNLL), Singapore, Singapore, 2023-12
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(730Kb)  |  收藏  |  浏览/下载:37/21  |  提交时间:2024/06/27
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:39/19  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
Pro-tuning: Unified Prompt Tuning for Vision Tasks 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 34, 期号: 6, 页码: 4653 - 4667
作者:  Xing Nie;  Bolin Ni;  Jianlong Chang;  Gaofeng Meng;  Chunlei Huo;  Shiming Xiang;  Qi Tian
Adobe PDF(2224Kb)  |  收藏  |  浏览/下载:31/9  |  提交时间:2024/06/21
Health and Senior Care Video Moment Localization With Procedure Knowledge Distillation 会议论文
, Istanbul, Turkiye, Dec 5-8
作者:  Chaochen Wu;  Meiyun Zuo;  Guan Luo;  Yuna Jiang
Adobe PDF(3140Kb)  |  收藏  |  浏览/下载:48/19  |  提交时间:2024/06/05
A Multi-Modal Neural Geometric Solver with Textual Clauses Parsed from Diagram 会议论文
, 中国 澳门, 2023-7-19
作者:  Zhang Ming-Liang;  Yin Fei;  Liu Cheng-Lin
Adobe PDF(1110Kb)  |  收藏  |  浏览/下载:108/34  |  提交时间:2024/04/03
VQAPT: A New visual question answering model for personality traits in social media images 期刊论文
PATTERN RECOGNITION LETTERS, 2023, 卷号: 175, 页码: 66-73
作者:  Biswas, Kunal;  Shivakumara, Palaiahnakote;  Pal, Umapada;  Liu, Cheng-Lin;  Lu, Yue
收藏  |  浏览/下载:74/0  |  提交时间:2024/02/22
Personality trait images  Multimodal concept  Text recognition  Social media images  Natural language processing  Visual question answering  
Reducing Vision-Answer Biases for Multiple-Choice VQA 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4621-4634
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(2684Kb)  |  收藏  |  浏览/下载:98/9  |  提交时间:2023/11/17
Multiple-choice VQA  vision-answer bias  causal intervention  counterfactual interaction learning  
Latent Structure Mining With Contrastive Modality Fusion for Multimedia Recommendation 期刊论文
IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING, 2023, 卷号: 35, 期号: 9, 页码: 9154-9167
作者:  Zhang, Jinghao;  Zhu, Yanqiao;  Liu, Qiang;  Zhang, Mengqi;  Wu, Shu;  Wang, Liang
Adobe PDF(1134Kb)  |  收藏  |  浏览/下载:172/15  |  提交时间:2023/11/17
Multimedia recommendation  graph structure learning  contrastive learning