CASIA OpenIR

浏览/检索结果: 共40条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
作者:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:47/11  |  提交时间:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:52/23  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:40/22  |  提交时间:2024/05/28
A Novel Geometric Calibration Method for Active Stereovision System 会议论文
, Lyon (France), 2021-8
作者:  Jierui Liu;  Xilong Liu;  Zhiqiang Cao;  Zhonghui Li;  Junzhi Yu
Adobe PDF(1418Kb)  |  收藏  |  浏览/下载:33/16  |  提交时间:2024/05/28
Knowledge Mining and Transferring for Domain Adaptive Object Detection 会议论文
, Virtual Conference, 2021-10
作者:  Tian Kun;  Zhang Chenghao;  Wang Ying;  Xiang Shiming;  Pan Chunhong
Adobe PDF(1462Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/05/28
Towards Better Word Importance Ranking in Textual Adversarial Attacks 会议论文
, Gold Coast, Australia, June 18-23, 2023
作者:  Shi, Jiahui;  Li, Linjing;  Zeng, Daniel Dajun
Adobe PDF(932Kb)  |  收藏  |  浏览/下载:268/111  |  提交时间:2023/09/27
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:131/37  |  提交时间:2023/06/27
Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning 会议论文
, 中国桂林, 2022-7-9
作者:  Shan QF(单钦锋);  Wang WJ(王伟杰);  Guo DF(郭丁飞);  Sun XR(孙向荣);  Jia LH(贾立好)
Adobe PDF(494Kb)  |  收藏  |  浏览/下载:160/51  |  提交时间:2023/06/05
Deep learning  Mechatronics  Navigation  Reinforcement learning  Cost function  Real-time systems  Trajectory  
Wd3: Taming the estimation bias in deep reinforcement learning 会议论文
, Baltimore, MD, USA, 2020-12
作者:  He Q(何强);  Hou XW(侯新文)
Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:235/50  |  提交时间:2022/06/27
deep reinforcement learning  estimation bias  neural networks  
Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文
, Shenzhen, China, 05-09 July 2021
作者:  Gong C(龚晨);  He Q(何强);  Bai YP(白云鹏);  Hou XW(侯新文);  Fan GL(范国梁);  Liu Y(刘禹)
Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:251/45  |  提交时间:2022/06/27
Video Game  Reinforcement Learning  Quantile Regression  Bellman residual  Wasserstein Distance