已选(0)清除
条数/页: 排序方式: |
| Unbiased Visual Question Answering by Leveraging Instrumental Variable 期刊论文 IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6648-6662 作者: Pan, Yonghua; Liu, Jing; Jin, Lu; Li, Zechao 收藏  |  浏览/下载:7/0  |  提交时间:2024/07/22 Visualization Correlation Instruments Training Predictive models Color Generators Visual question answering instrumental variable causal inference out of distribution |
| Toward Human-centered XAI in Practice: A survey 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 740-770 作者: Xiangwei Kong; Shujie Liu; Luhao Zhu Adobe PDF(1550Kb)  |  收藏  |  浏览/下载:25/4  |  提交时间:2024/07/18 Artificial intelligence (AI) application explainable AI (XAI) human-centered design visual computing medical diagnosis |
| TextFormer: A Query-based End-to-end Text Spotter with Mixed Supervision 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 704-717 作者: Yukun Zhai; Xiaoqiang Zhang; Xiameng Qin; Sanyuan Zhao; Xingping Dong; Jianbing Shen Adobe PDF(2312Kb)  |  收藏  |  浏览/下载:26/8  |  提交时间:2024/07/18 End-to-end text spotting arbitrarily-shaped texts transformer mixed supervision multitask modeling |
| 面向视觉-语言的跨模态预训练与匹配方法研究 学位论文 , 2024 作者: chen yuxin Adobe PDF(46981Kb)  |  收藏  |  浏览/下载:35/2  |  提交时间:2024/07/11 视觉语言匹配 图像文本预训练 知识蒸馏 双向匹配评估 令牌合并 |
| NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文 IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931 作者: Zhang Xi(张熙); Feifei Zhang; Changsheng Xu Adobe PDF(4719Kb)  |  收藏  |  浏览/下载:41/10  |  提交时间:2024/07/08 |
| Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning 会议论文 , Chengdu, China, 2021-10 作者: Zhang X(张熙); Feifei Zhang; Changsheng Xu Adobe PDF(5740Kb)  |  收藏  |  浏览/下载:40/9  |  提交时间:2024/07/08 |
| VQACL: A Novel Visual Question Answering Continual Learning Setting 会议论文 , Canada, 2023 作者: Zhang X(张熙); Feifei Zhang; Changsheng Xu Adobe PDF(1199Kb)  |  收藏  |  浏览/下载:37/8  |  提交时间:2024/07/08 |
| 面向多模态语义理解与推理的视觉问答研究 学位论文 , 2024 作者: 张熙 Adobe PDF(39126Kb)  |  收藏  |  浏览/下载:54/2  |  提交时间:2024/07/08 多模态 视觉问答 语义挖掘 可靠关联 推理泛化 |
| CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition 会议论文 , 中国, 2023.06.08 作者: Jinzhi Zheng; Ruyi Ji; Libo Zhang; Yanjun Wu; Chen Zhao Adobe PDF(1516Kb)  |  收藏  |  浏览/下载:35/12  |  提交时间:2024/07/08 |
| 基于多模态协同的驾驶行为预测 学位论文 , 2024 作者: 董清辉 Adobe PDF(5017Kb)  |  收藏  |  浏览/下载:38/1  |  提交时间:2024/07/08 人车共驾,驾驶行为预测,多模态协同,轨迹预测,多任务学习 |