CASIA OpenIR

Browse/Search Results:  1-10 of 507 Help

Selected(0)Clear Items/Page:    Sort:
Learning Top-K Subtask Planning Tree Based on Discriminative Representation Pretraining for Decision-making 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 782-800
Authors:  Jingqing Ruan;   Kaishen Wang;   Qingyang Zhang;   Dengpeng Xing;   Bo Xu
Adobe PDF(4577Kb)  |  Favorite  |  View/Download:11/4  |  Submit date:2024/07/18
Reinforcement learning  representation learning  subtask planning  task decomposition  pretraining.  
ReChoreoNet: Repertoire-based Dance Re-choreography with Music-conditioned Temporal and Style Clues 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 771-781
Authors:  Ho Yin Au;  Jie Chen;  Junkun Jiang;  Yike Guo
Adobe PDF(2161Kb)  |  Favorite  |  View/Download:10/3  |  Submit date:2024/07/18
Generative model  cross-modality learning  normalizing flow  tempo synchronization  style transfer  
Toward Human-centered XAI in Practice: A survey 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 740-770
Authors:  Xiangwei Kong;  Shujie Liu;  Luhao Zhu
Adobe PDF(1550Kb)  |  Favorite  |  View/Download:10/1  |  Submit date:2024/07/18
Artificial intelligence (AI) application  explainable AI (XAI)  human-centered design  visual computing  medical diagnosis  
TextFormer: A Query-based End-to-end Text Spotter with Mixed Supervision 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 704-717
Authors:  Yukun Zhai;   Xiaoqiang Zhang;   Xiameng Qin;   Sanyuan Zhao;  Xingping Dong;   Jianbing Shen
Adobe PDF(2312Kb)  |  Favorite  |  View/Download:14/5  |  Submit date:2024/07/18
End-to-end text spotting  arbitrarily-shaped texts  transformer  mixed supervision  multitask modeling  
面向视觉-语言的跨模态预训练与匹配方法研究 学位论文
, 2024
Authors:  chen yuxin
Adobe PDF(46981Kb)  |  Favorite  |  View/Download:26/1  |  Submit date:2024/07/11
视觉语言匹配  图像文本预训练  知识蒸馏  双向匹配评估  令牌合并  
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931
Authors:  Zhang Xi(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(4719Kb)  |  Favorite  |  View/Download:25/6  |  Submit date:2024/07/08
Multi-Level Counterfactual Contrast for Visual Commonsense Reasoning 会议论文
, Chengdu, China, 2021-10
Authors:  Zhang X(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(5740Kb)  |  Favorite  |  View/Download:31/7  |  Submit date:2024/07/08
VQACL: A Novel Visual Question Answering Continual Learning Setting 会议论文
, Canada, 2023
Authors:  Zhang X(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(1199Kb)  |  Favorite  |  View/Download:27/6  |  Submit date:2024/07/08
面向多模态语义理解与推理的视觉问答研究 学位论文
, 2024
Authors:  张熙
Adobe PDF(39126Kb)  |  Favorite  |  View/Download:35/2  |  Submit date:2024/07/08
多模态  视觉问答  语义挖掘  可靠关联  推理泛化  
CMFN: Cross-Modal Fusion Network for Irregular Scene Text Recognition 会议论文
, 中国, 2023.06.08
Authors:  Jinzhi Zheng;  Ruyi Ji;  Libo Zhang;  Yanjun Wu;  Chen Zhao
Adobe PDF(1516Kb)  |  Favorite  |  View/Download:23/10  |  Submit date:2024/07/08