CASIA OpenIR

浏览/检索结果: 共20条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
DDRL: A Decentralized Deep Reinforcement Learning Method for Vehicle Repositioning 会议论文
, Indianapolis, IN, USA, 19-22 September 2021
作者:  Jinhao Xi;  Fenghua Zhu;  Yuanyuan Chen;  Yisheng Lv;  Chang Tan;  Feiyue Wang
Adobe PDF(1652Kb)  |  收藏  |  浏览/下载:140/25  |  提交时间:2023/06/26
Locomotion Control of a Hybrid Propulsion Biomimetic Underwater Vehicle via Deep Reinforcement Learning 会议论文
, Xining, China, 15-19 July 2021
作者:  Zhang Tiandong;  Wang Rui;  Wang Yu;  Wang Shuo
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:90/31  |  提交时间:2023/06/14
Benchmarking lane-changing decision-making for deep reinforcement learning 会议论文
, Guangzhou, China, 2021-11
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(1117Kb)  |  收藏  |  浏览/下载:162/55  |  提交时间:2023/05/30
Traffic Signal Control Using Offline Reinforcement Learning 会议论文
, Beijing, 2021-10
作者:  Dai, Xingyuan;  Zhao, Chen;  Li, Xiaoshuang;  Wang, Xiao;  Wang, Fei-Yue
Adobe PDF(1130Kb)  |  收藏  |  浏览/下载:211/66  |  提交时间:2022/10/11
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:252/45  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance  
A Collaborative Communication-Qmix Approach for Large-scale Networked Traffic Signal Control 会议论文
, Indianapolis, IN, United States, 2021-9-19
作者:  Chen, Xiaoyu;  Xiong, Gang;  Lv, Yisheng;  Chen, yuanyuan;  Song, bing;  Wang, Feiyue
Adobe PDF(1208Kb)  |  收藏  |  浏览/下载:284/74  |  提交时间:2022/06/16
ADEL: Autonomous Developmental Evolutionary Learning for Robotic Manipulation 会议论文
, 北京, 2021-8
作者:  Li YM(李一鸣)
Adobe PDF(9586Kb)  |  收藏  |  浏览/下载:161/27  |  提交时间:2022/06/16
A Multi-Task MRC Framework for Chinese Emotion Cause and Experiencer Extraction 会议论文
, Bratislava, Slovakia, 2021-09
作者:  Haoda Qian;  Qiudan Li;  Zaichuan Tang
Adobe PDF(79001Kb)  |  收藏  |  浏览/下载:368/128  |  提交时间:2022/06/14
DIMSAN: Fast Exploration with the Synergy between Density-based Intrinsic Motivation and Self-adaptive Action Noise 会议论文
, 西安, 2021.5.30-2021.6.5
作者:  Li, Jiayi;  Li, Boyao;  Lu, Tao;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
Adobe PDF(5599Kb)  |  收藏  |  浏览/下载:211/41  |  提交时间:2022/06/14
Trajectory-based Split Hindsight Reverse Curriculum Learning 会议论文
, Prague, Czech Republic, 2021-9
作者:  Wu, Jiaxi;  Zhang, Dianmin;  Zhong, Shanlin;  Qiao, Hong
Adobe PDF(5094Kb)  |  收藏  |  浏览/下载:244/61  |  提交时间:2022/06/14
Reinforcement Learning  Curriculum Learning