已选(0)清除
条数/页: 排序方式: |
| Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文 IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219 作者: Zhang,Haijun; Yang,Ning ; Huangfu,Wei; Long,Keping; Leung,VictorCM
Adobe PDF(1925Kb)  |   收藏  |  浏览/下载:13/7  |  提交时间:2024/06/12 |
| Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文 , Turin, Italy, 2023.9.18-2023.9.22 作者: Meng Linghui ; Xiong Xuantang; Zang Yifan; Zhang Xi ; Li Guoqi ; Xing Dengpeng ; Xu Bo![](/image/person.jpg)
Adobe PDF(841Kb)  |   收藏  |  浏览/下载:13/6  |  提交时间:2024/06/11 |
| PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文 Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14 作者: Bai FS(白丰硕) ; Zhang HM(张鸿铭); Tao TY(陶天阳); Wu ZH(武志亨) ; Wang YN(王燕娜) ; Xu B(徐博)![](/image/person.jpg)
Adobe PDF(1663Kb)  |   收藏  |  浏览/下载:185/41  |  提交时间:2023/07/05 Reinforcement Learning Algorithms Transfer Domain Adaptation Multi-Task Learning |
| Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文 , 线上, 2020-4 作者: Zhao EM(赵恩民) ; Deng SH(邓诗弘); Zang YF(臧一凡); Kang YX(康永欣) ; Li K(李凯) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(1999Kb)  |   收藏  |  浏览/下载:102/39  |  提交时间:2023/06/29 |
| AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文 , 线上, 2022-02-22 作者: Zhao EM(赵恩民) ; Yan RY(闫仁业); Li JQ(李金秋); Li K(李凯) ; Xing JL(兴军亮)![](/image/person.jpg)
Adobe PDF(2593Kb)  |   收藏  |  浏览/下载:175/68  |  提交时间:2023/06/29 |
| L2E: Learning to Exploit Your Opponent 会议论文 , 意大利 帕多瓦, 2022.07.18-2022.07.23 作者: Wu Zhe ; Li Kai ; Xu Hang; Zang Yifan; An Bo; Xing Junliang![](/image/person.jpg)
Adobe PDF(5676Kb)  |   收藏  |  浏览/下载:212/43  |  提交时间:2022/06/17 |
| Learning to Navigate in Human Environments via Deep Reinforcement Learning 会议论文 , Sydney, Australia, 2019-12-12至2019-12-15 作者: Xingyuan Gao ; Shiying Sun ; Xiaoguang Zhao ; Min Tan![](/image/person.jpg)
Adobe PDF(1298Kb)  |   收藏  |  浏览/下载:225/56  |  提交时间:2022/03/31 |
| Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文 IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11 作者: Wei, Qinglai ; Han, Liyuan ; Zhang, Tielin![](/image/person.jpg)
Adobe PDF(2904Kb)  |   收藏  |  浏览/下载:208/6  |  提交时间:2022/01/27 Maximum likelihood estimation (MLE) Nonlinear systems Optimal control Poisson process Spike train Spiking Adaptive dynamic programming(SADP) |
| 基于混合更新Q值的深度强化学习方法研究 学位论文 工程硕士, 中国科学院自动化研究所: 中国科学院大学, 2020 作者: 李主南![](/image/person.jpg)
Adobe PDF(3839Kb)  |   收藏  |  浏览/下载:200/5  |  提交时间:2020/06/10 深度强化学习 Q 学习算法 过估计 欠估计 Actor-Critic 凸组合 混合更新 |
| An Incidental Delivery Based Method for Resolving Multirobot Pairwised Transportation Problems 期刊论文 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 卷号: 17, 期号: 7, 页码: 1852-1866 作者: Liu, Zhe; Wang, Hesheng; Chen, Weidong; Yu, Junzhi ; Chen, Jian
Adobe PDF(3291Kb)  |   收藏  |  浏览/下载:322/84  |  提交时间:2016/10/20 Incidental Delivery Multirobot Pairwised Transportation (Mrpwt) Simulated Annealing |