已选(0)清除
条数/页: 排序方式: |
| Learning to Play Football from Sports Perspective: A Knowledge-embedded Deep Reinforcement Learning Framework 期刊论文 IEEE Transactions on Games, 2022, 页码: 12 作者: Liu BY(刘博寅) Adobe PDF(2957Kb)  |  收藏  |  浏览/下载:14/4  |  提交时间:2024/07/12 |
| QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12 作者: Liu BY(刘博寅) Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:11/1  |  提交时间:2024/07/12 |
| NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks 会议论文 , Queensland, Australia, 2023-6 作者: Hu GZ(胡光政); Li HR(李浩然); Liu SS(刘莎莎); Zhu YH(朱圆恒); Zhao DB(赵冬斌) Adobe PDF(2785Kb)  |  收藏  |  浏览/下载:27/7  |  提交时间:2024/07/04 |
| An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game 会议论文 , 线上, 2020-5 作者: Liu MS(刘民颂); Zhu YH(朱圆恒); Zhao DB(赵冬斌) Adobe PDF(727Kb)  |  收藏  |  浏览/下载:15/7  |  提交时间:2024/07/04 |
| 面向多机器人博弈的深度强化学习方法 学位论文 , 2024 作者: 胡光政 Adobe PDF(17740Kb)  |  收藏  |  浏览/下载:21/0  |  提交时间:2024/07/04 多智能体深度强化学习 多机器人博弈 极小极大Q学习 值分解 最大熵 |
| Humor Detection System for MuSE 2023: Contextual Modeling, Pseudo Labelling, and Post-smoothing 会议论文 , 加拿大多伦多, 2023-11 作者: Xu MY(徐名宇); Chen S(陈顺); Lian Z(连政); Liu B(刘斌) Adobe PDF(557Kb)  |  收藏  |  浏览/下载:16/8  |  提交时间:2024/06/27 |
| Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文 , 日本, 2024-6 作者: Zhang Qingyang; Xu Bo Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:15/5  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Self-Talk Responses to Users' Opinions and Challenge in Human Computer Dialog 会议论文 , Beijing, China, 2018-8-2 作者: Yang Minghao; Zhang Ke; NaShengRuoYang; Tao Jianhua Adobe PDF(540Kb)  |  收藏  |  浏览/下载:35/6  |  提交时间:2024/06/24 |
| Learning in bi-level markov games 会议论文 , Padua, Italy, 2022.7.18-2022.7.23 作者: Meng Linghui; Ruan Jingqing; Xing Dengpeng; Xu Bo Adobe PDF(1450Kb)  |  收藏  |  浏览/下载:35/12  |  提交时间:2024/06/11 |
| M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文 , London, United Kingdom, 2023.5.29-2023.6.2 作者: Meng Linghui; Ruan Jingqing; Xiong Xuantang; Li Xiyun; Zhang Xi; Xing Dengpeng; Xu Bo Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:23/5  |  提交时间:2024/06/11 |