已选(0)清除
条数/页: 排序方式: |
| Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文 /, Orlando, FL, USA, 2023-11 作者: Wang, Yuxiao; Dai, Xingyuan; Wang, Kara; Ali, Hub; Zhu, Fenghua Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:47/11  |  提交时间:2024/06/11 Imitation Learning Trajectory Planning Deep Reinforcement Learning Autonomous Driving |
| Explainable Reinforcement Learning via a Causal World Model 会议论文 Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22 作者: Yu ZY(余忠蔚); Ruan JQ(阮景晴); Xing DP(邢登鹏) Adobe PDF(850Kb)  |  收藏  |  浏览/下载:52/23  |  提交时间:2024/05/28 强化学习 可解释人工智能 因果推理 |
| Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文 , 长沙, 2023-11 作者: Kaishen Wang; Jingqing Ruan; Qingyang Zhang; Dengpeng Xing Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:40/22  |  提交时间:2024/05/28 |
| A Novel Geometric Calibration Method for Active Stereovision System 会议论文 , Lyon (France), 2021-8 作者: Jierui Liu; Xilong Liu; Zhiqiang Cao; Zhonghui Li; Junzhi Yu Adobe PDF(1418Kb)  |  收藏  |  浏览/下载:33/16  |  提交时间:2024/05/28 |
| Knowledge Mining and Transferring for Domain Adaptive Object Detection 会议论文 , Virtual Conference, 2021-10 作者: Tian Kun; Zhang Chenghao; Wang Ying; Xiang Shiming; Pan Chunhong Adobe PDF(1462Kb)  |  收藏  |  浏览/下载:30/12  |  提交时间:2024/05/28 |
| Towards Better Word Importance Ranking in Textual Adversarial Attacks 会议论文 , Gold Coast, Australia, June 18-23, 2023 作者: Shi, Jiahui; Li, Linjing; Zeng, Daniel Dajun Adobe PDF(932Kb)  |  收藏  |  浏览/下载:268/111  |  提交时间:2023/09/27 |
| Stable Training of Bellman Error in Reinforcement Learning 会议论文 , Thailand, November 18–22 作者: Gong C(龚晨); Bai YP(白云鹏); Hou XW(侯新文); Ji XH(季晓慧) Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:131/37  |  提交时间:2023/06/27 |
| Improving the Ability of Robots to Navigate Through Crowded Environments Safely using Deep Reinforcement Learning 会议论文 , 中国桂林, 2022-7-9 作者: Shan QF(单钦锋); Wang WJ(王伟杰); Guo DF(郭丁飞); Sun XR(孙向荣); Jia LH(贾立好) Adobe PDF(494Kb)  |  收藏  |  浏览/下载:160/51  |  提交时间:2023/06/05 Deep learning Mechatronics Navigation Reinforcement learning Cost function Real-time systems Trajectory |
| Wd3: Taming the estimation bias in deep reinforcement learning 会议论文 , Baltimore, MD, USA, 2020-12 作者: He Q(何强); Hou XW(侯新文) Adobe PDF(2006Kb)  |  收藏  |  浏览/下载:235/50  |  提交时间:2022/06/27 deep reinforcement learning estimation bias neural networks |
| Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文 , Shenzhen, China, 05-09 July 2021 作者: Gong C(龚晨); He Q(何强); Bai YP(白云鹏); Hou XW(侯新文); Fan GL(范国梁); Liu Y(刘禹) Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:251/45  |  提交时间:2022/06/27 Video Game Reinforcement Learning Quantile Regression Bellman residual Wasserstein Distance |