CASIA OpenIR

浏览/检索结果: 共10条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
作者:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:13/7  |  提交时间:2024/06/12
Filtered Observations for Model-Based Multi-agent Reinforcement Learning 会议论文
, Turin, Italy, 2023.9.18-2023.9.22
作者:  Meng Linghui;  Xiong Xuantang;  Zang Yifan;  Zhang Xi;  Li Guoqi;  Xing Dengpeng;  Xu Bo
Adobe PDF(841Kb)  |  收藏  |  浏览/下载:13/6  |  提交时间:2024/06/11
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:185/41  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Potential Driven Reinforcement Learning for Hard Exploration Tasks 会议论文
, 线上, 2020-4
作者:  Zhao EM(赵恩民);  Deng SH(邓诗弘);  Zang YF(臧一凡);  Kang YX(康永欣);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(1999Kb)  |  收藏  |  浏览/下载:102/39  |  提交时间:2023/06/29
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:175/68  |  提交时间:2023/06/29
L2E: Learning to Exploit Your Opponent 会议论文
, 意大利 帕多瓦, 2022.07.18-2022.07.23
作者:  Wu Zhe;  Li Kai;  Xu Hang;  Zang Yifan;  An Bo;  Xing Junliang
Adobe PDF(5676Kb)  |  收藏  |  浏览/下载:212/43  |  提交时间:2022/06/17
Learning to Navigate in Human Environments via Deep Reinforcement Learning 会议论文
, Sydney, Australia, 2019-12-12至2019-12-15
作者:  Xingyuan Gao;  Shiying Sun;  Xiaoguang Zhao;  Min Tan
Adobe PDF(1298Kb)  |  收藏  |  浏览/下载:225/56  |  提交时间:2022/03/31
Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(2904Kb)  |  收藏  |  浏览/下载:208/6  |  提交时间:2022/01/27
Maximum likelihood estimation (MLE)  Nonlinear systems  Optimal control  Poisson process  Spike train  Spiking Adaptive dynamic programming(SADP)  
基于混合更新Q值的深度强化学习方法研究 学位论文
工程硕士, 中国科学院自动化研究所: 中国科学院大学, 2020
作者:  李主南
Adobe PDF(3839Kb)  |  收藏  |  浏览/下载:200/5  |  提交时间:2020/06/10
深度强化学习  Q 学习算法  过估计  欠估计  Actor-Critic  凸组合  混合更新  
An Incidental Delivery Based Method for Resolving Multirobot Pairwised Transportation Problems 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2016, 卷号: 17, 期号: 7, 页码: 1852-1866
作者:  Liu, Zhe;  Wang, Hesheng;  Chen, Weidong;  Yu, Junzhi;  Chen, Jian
Adobe PDF(3291Kb)  |  收藏  |  浏览/下载:322/84  |  提交时间:2016/10/20
Incidental Delivery  Multirobot Pairwised Transportation (Mrpwt)  Simulated Annealing