已选(0)清除
条数/页: 排序方式: |
| NExT-OOD: Overcoming Dual Multiple-Choice VQA Biases 期刊论文 IEEE Transactions on Pattern Analysis and Machine Intelligence, 2023, 页码: 1913-1931 作者: Zhang Xi(张熙) ; Feifei Zhang; Changsheng Xu![](/image/person.jpg)
Adobe PDF(4719Kb)  |   收藏  |  浏览/下载:19/5  |  提交时间:2024/07/08 |
| Learning State-Specific Action Masks for Reinforcement Learning 期刊论文 Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60 作者: Wang ZY(王梓薏) ; Li XR(李欣然); Sun LY(孙罗洋); Zhang HF(张海峰); Liu HL(刘华林); Jun Wang
Adobe PDF(2976Kb)  |   收藏  |  浏览/下载:17/7  |  提交时间:2024/07/05 reinforcement learning exploration efficiency space reduction |
| NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks 会议论文 , Queensland, Australia, 2023-6 作者: Hu GZ(胡光政) ; Li HR(李浩然) ; Liu SS(刘莎莎); Zhu YH(朱圆恒) ; Zhao DB(赵冬斌)![](/image/person.jpg)
Adobe PDF(2785Kb)  |   收藏  |  浏览/下载:27/7  |  提交时间:2024/07/04 |
| Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文 , 日本, 2024-6 作者: Zhang Qingyang ; Xu Bo![](/image/person.jpg)
Adobe PDF(8862Kb)  |   收藏  |  浏览/下载:15/5  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang ; Zhang Hongming; Xing Dengpeng ; Bo Xu![](/image/person.jpg)
Adobe PDF(9639Kb)  |   收藏  |  浏览/下载:14/7  |  提交时间:2024/06/25 |
| User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文 , 新加坡, May 13 - 17, 2024 作者: Zhang, Zhiyuan ; Zhang, Qichao ; Wu, Xiaoxu; Shi, Xiaowen; Liao, Guogang; Wang, Yongkong; Wang, xingxing; Zhao, Dongbin![](/image/person.jpg)
Adobe PDF(2077Kb)  |   收藏  |  浏览/下载:21/8  |  提交时间:2024/06/25 Ads Allocation Reinforcement Learning User Response Modeling |
| LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation 会议论文 , Singapore, 2023-12 作者: Zhitao He ; Pengfei Cao ; Yubo Chen ; Kang Liu; Jun Zhao![](/image/person.jpg)
Adobe PDF(1153Kb)  |   收藏  |  浏览/下载:11/3  |  提交时间:2024/06/25 |
| Modeling Socially Normative Navigation Behaviors from Demonstrations with Inverse Reinforcement Learning 会议论文 , Vancouver, British Columbia, Canada, 2019-08-22至2019-08-26 作者: Xingyuan Gao ; Xiaoguang Zhao ; Min Tan![](/image/person.jpg)
Adobe PDF(1500Kb)  |   收藏  |  浏览/下载:27/13  |  提交时间:2024/06/21 |
| Memory-based Error Label Suppression for Embodied Self-Improving Object Detection 会议论文 , 意大利巴里, 2024-8-28 作者: Deng JR(邓杰仁) ; Zhang HJ(张好剑) ; Hu JH(胡建华) ; Wang YK(王云宽)![](/image/person.jpg)
Adobe PDF(2603Kb)  |   收藏  |  浏览/下载:34/12  |  提交时间:2024/06/20 |
| Exploiting Curriculum Learning in Unsupervised Neural Machine Translation 会议论文 , Online, November 7–11, 2021 作者: Lu JL(陆金梁) ; Zhang JJ(张家俊)![](/image/person.jpg)
Adobe PDF(866Kb)  |   收藏  |  浏览/下载:43/12  |  提交时间:2024/06/13 |