CASIA OpenIR

浏览/检索结果: 共30条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Learning to Manipulate Tools Using Deep Reinforcement Learning and Anchor Information 会议论文
, Jinghong, China, 05-09 December 2022
作者:  Junhang Wei;  Shaowei Cui;  Peng Hao;  Shuo Wang
Adobe PDF(933Kb)  |  收藏  |  浏览/下载:143/51  |  提交时间:2023/10/25
POPO: Pessimistic Offline Policy Optimization 会议论文
, Singapore, Singapore, 23-27 May 2022
作者:  He Q(何强);  Hou XW(侯新文);  Liu Y(刘禹)
Adobe PDF(1200Kb)  |  收藏  |  浏览/下载:176/36  |  提交时间:2022/06/27
reinforcement learning  offline optimization  out-of-distribution  
MTLDesc: Looking Wider to Describe Better 会议论文
, Virtual, February 22-28, 2022
作者:  Changwei Wang;  Rongtao Xu;  Yuyang Zhang;  Shibiao Xu;  Weiliang Meng;  Xiaopeng Zhang
Adobe PDF(7473Kb)  |  收藏  |  浏览/下载:182/39  |  提交时间:2022/04/06
DDRL: A Decentralized Deep Reinforcement Learning Method for Vehicle Repositioning 会议论文
, Indianapolis, IN, USA, 19-22 September 2021
作者:  Jinhao Xi;  Fenghua Zhu;  Yuanyuan Chen;  Yisheng Lv;  Chang Tan;  Feiyue Wang
Adobe PDF(1652Kb)  |  收藏  |  浏览/下载:104/21  |  提交时间:2023/06/26
DIMSAN: Fast Exploration with the Synergy between Density-based Intrinsic Motivation and Self-adaptive Action Noise 会议论文
, 西安, 2021.5.30-2021.6.5
作者:  Li, Jiayi;  Li, Boyao;  Lu, Tao;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
Adobe PDF(5599Kb)  |  收藏  |  浏览/下载:179/34  |  提交时间:2022/06/14
A Collaborative Communication-Qmix Approach for Large-scale Networked Traffic Signal Control 会议论文
, Indianapolis, IN, United States, 2021-9-19
作者:  Chen, Xiaoyu;  Xiong, Gang;  Lv, Yisheng;  Chen, yuanyuan;  Song, bing;  Wang, Feiyue
Adobe PDF(1208Kb)  |  收藏  |  浏览/下载:232/59  |  提交时间:2022/06/16
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:216/39  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance  
Locomotion Control of a Hybrid Propulsion Biomimetic Underwater Vehicle via Deep Reinforcement Learning 会议论文
, Xining, China, 15-19 July 2021
作者:  Zhang Tiandong;  Wang Rui;  Wang Yu;  Wang Shuo
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:48/18  |  提交时间:2023/06/14
LearnDA: Learnable Knowledge-Guided Data Augmentation for Event Causality Identification 会议论文
, Bangkok, Thailand (Online), August 1-6, 2021
作者:  Xinyu Zuo;  Pengfei Cao;  Yubo Chen;  Kang Liu;  Jun Zhao;  Weihua Peng;  Yuguang Chen
Adobe PDF(770Kb)  |  收藏  |  浏览/下载:189/36  |  提交时间:2021/06/18
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:190/37  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks