CASIA OpenIR

浏览/检索结果: 共23条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:20/9  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文
Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou
Adobe PDF(1457Kb)  |  收藏  |  浏览/下载:16/4  |  提交时间:2024/05/30
Position Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文
, Liuzhou, China, 20-22 November 2020
作者:  Ma, Ruichen;  Wang, Yu;  Gao, Zisen;  Zhao, Tianzi;  Wang, Rui;  Wang, Shuo;  Zhou, Chao
Adobe PDF(927Kb)  |  收藏  |  浏览/下载:77/35  |  提交时间:2023/08/03
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:208/70  |  提交时间:2023/07/06
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:114/32  |  提交时间:2023/06/27
Multi-UAV Cooperative Short-Range Combat via Attention-Based Reinforcement Learning using Individual Reward Shaping 会议论文
, Kyoto, Japan, October 23-27, 2022
作者:  Zhang TL(张天乐);  Qiu TH(丘腾海);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(896Kb)  |  收藏  |  浏览/下载:144/46  |  提交时间:2023/06/12
Motion optimization for a robotic fish based on adversarial structured control 会议论文
, Dali, China, 2019年12月6日-2019年12月8日
作者:  Yan, Shuaizheng;  Wang, Jian;  Wu, Zhengxing;  Yu, Junzhi;  Tan, Min
Adobe PDF(1051Kb)  |  收藏  |  浏览/下载:70/27  |  提交时间:2023/06/12
Adaptive Remote Sensing Image Attribute Learning for Active Object Detection 会议论文
, Milan, Italy, 2021.1.10-1.15
作者:  Xu, Nuo;  Huo, Chunlei;  Guo, Jiacheng;  Liu, Yiwei;  Wang, Jian;  Pan, Chunhong
Adobe PDF(481Kb)  |  收藏  |  浏览/下载:158/42  |  提交时间:2022/12/20
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:212/40  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:234/41  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance