已选(0)清除
条数/页: 排序方式: |
| OpenHoldem: A Benchmark for Large-Scale Imperfect-Information Game Research 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 15 作者: Li, Kai; Xu, Hang; Zhao, Enmin; Wu, Zhe; Xing, Junliang 收藏  |  浏览/下载:82/0  |  提交时间:2023/11/17 Artificial intelligence (AI) benchmark imperfect-information game Nash equilibrium no-limit Texas hold'em (NLTH) |
| A Survey on Reinforcement Learning Methods in Bionic Underwater Robots 期刊论文 BIOMIMETICS, 2023, 卷号: 8, 期号: 2, 页码: 29 作者: Tong, Ru; Feng, Yukai; Wang, Jian; Wu, Zhengxing; Tan, Min; Yu, Junzhi 收藏  |  浏览/下载:58/0  |  提交时间:2023/11/17 bionic underwater robot reinforcement learning robotic fish intelligent control |
| An Efficient and Robust Complex Weld Seam Feature Point Extraction Method for Seam Tracking and Posture Adjustment 期刊论文 IEEE Transactions on Industrial Informatics, 2023, 卷号: 19, 期号: 11, 页码: 10704 - 10715 作者: Yunkai Ma; Junfeng Fan; Huizhen Yang; Hongliang Wang; Shiyu Xing; Fengshui Jing; Min Tan Adobe PDF(9580Kb)  |  收藏  |  浏览/下载:167/43  |  提交时间:2023/10/08 |
| Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文 , 线上, 2020-4 作者: Zhao EM(赵恩民); Deng SH(邓诗弘); Zang YF(臧一凡); Kang YX(康永欣); Li K(李凯); Xing JL(兴军亮) Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:66/22  |  提交时间:2023/06/29 |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯); Xing JL(兴军亮) Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:95/36  |  提交时间:2023/06/29 |
| Pseudo Value Network Distillation for High-Performance Exploration 会议论文 , 澳大利亚, 2023-06 作者: Zhao EM(赵恩民); Xing JL(兴军亮); Li K(李凯); Kang YX(康永欣); Tao P(陶品) Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:120/37  |  提交时间:2023/06/28 |
| Learning to Play Hard Exploration Games Using Graph-guided Self-navigation 会议论文 , 线上, 2021-02 作者: Zhao EM(赵恩民); Yan RY(闫仁业); Li K(李凯); Li LJ(李丽娟); Xing JL(兴军亮) Adobe PDF(413Kb)  |  收藏  |  浏览/下载:109/49  |  提交时间:2023/06/28 |
| 一种用于两人零和博弈对手适应的元策略演化学习算法 期刊论文 自动化学报, 2022, 页码: 0 作者: 吴哲; 李凯; 徐航; 兴军亮 Adobe PDF(15953Kb)  |  收藏  |  浏览/下载:181/42  |  提交时间:2022/06/17 |
| L2E: Learning to Exploit Your Opponent 会议论文 , 意大利 帕多瓦, 2022.07.18-2022.07.23 作者: Wu Zhe; Li Kai; Xu Hang; Zang Yifan; An Bo; Xing Junliang Adobe PDF(5676Kb)  |  收藏  |  浏览/下载:180/34  |  提交时间:2022/06/17 |
| 一种针对德州扑克AI的对手建模与策略集成框架 期刊论文 自动化学报, 2021, 期号: 0, 页码: 0 作者: 张蒙; 李凯; 吴哲; 臧一凡; 徐航; 兴军亮 Adobe PDF(1354Kb)  |  收藏  |  浏览/下载:342/91  |  提交时间:2021/06/21 不完美信息博弈 德州扑克 演化学习 在线对手建模 种群策略集成 |