CASIA OpenIR

浏览/检索结果: 共412条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931
作者:  Zhang Xi(张熙);  Feifei Zhang;  Changsheng Xu
Adobe PDF(4719Kb)  |  收藏  |  浏览/下载:19/5  |  提交时间:2024/07/08
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:17/7  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks 会议论文
, Queensland, Australia, 2023-6
作者:  Hu GZ(胡光政);  Li HR(李浩然);  Liu SS(刘莎莎);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(2785Kb)  |  收藏  |  浏览/下载:27/7  |  提交时间:2024/07/04
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:15/5  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:14/7  |  提交时间:2024/06/25
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:21/8  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation 会议论文
, Singapore, 2023-12
作者:  Zhitao He;  Pengfei Cao;  Yubo Chen;  Kang Liu;  Jun Zhao
Adobe PDF(1153Kb)  |  收藏  |  浏览/下载:11/3  |  提交时间:2024/06/25
Modeling Socially Normative Navigation Behaviors from Demonstrations with Inverse Reinforcement Learning 会议论文
, Vancouver, British Columbia, Canada, 2019-08-22至2019-08-26
作者:  Xingyuan Gao;  Xiaoguang Zhao;  Min Tan
Adobe PDF(1500Kb)  |  收藏  |  浏览/下载:27/13  |  提交时间:2024/06/21
Memory-based Error Label Suppression for Embodied Self-Improving Object Detection 会议论文
, 意大利巴里, 2024-8-28
作者:  Deng JR(邓杰仁);  Zhang HJ(张好剑);  Hu JH(胡建华);  Wang YK(王云宽)
Adobe PDF(2603Kb)  |  收藏  |  浏览/下载:34/12  |  提交时间:2024/06/20
Exploiting Curriculum Learning in Unsupervised Neural Machine Translation 会议论文
, Online, November 7–11, 2021
作者:  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(866Kb)  |  收藏  |  浏览/下载:43/12  |  提交时间:2024/06/13