已选(0)清除
条数/页: 排序方式: |
| Learning State-Specific Action Masks for Reinforcement Learning 期刊论文 Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60 作者: Wang ZY(王梓薏); Li XR(李欣然); Sun LY(孙罗洋); Zhang HF(张海峰); Liu HL(刘华林); Jun Wang Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:36/15  |  提交时间:2024/07/05 reinforcement learning exploration efficiency space reduction |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:21/9  |  提交时间:2024/06/25 |
| Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking 会议论文 , Virtual, United States, 2020-06-14至2020-06-19 作者: Gao, Jin; Hu, Weiming; Lu, Yan Adobe PDF(468Kb)  |  收藏  |  浏览/下载:53/20  |  提交时间:2024/06/21 |
| Learning Playing Piano with Bionic-Constrained Diffusion Policy for Anthropomorphic Hand 期刊论文 Cyborg and Bionic Systems, 2024, 卷号: 5, 页码: 0104 作者: Yang YM(杨依明); Wang ZC(王泽昌); Xing DP(邢登鹏); Wang P(王鹏) Adobe PDF(3500Kb)  |  收藏  |  浏览/下载:36/16  |  提交时间:2024/05/30 |
| Cooperative Object Transportation for Second-order Multi-robot Systems in Dynamic Environment 会议论文 Proceedings of the 42nd Chinese Control Conference, 天津, 2023-7-24 作者: Cai, Qiang; Ai, Xiaolin; Liu, Tianqi; Pu, zhiqiang Adobe PDF(3418Kb)  |  收藏  |  浏览/下载:51/21  |  提交时间:2024/05/28 |
| A Robotized Soft Endoscope with Stereo Vision for Upper Gastrointestinal Endoscopic Submucosal Dissection (ESD) 会议论文 , Sydney, Australia, 24-27 July 2023 作者: Chen, Jian; Wang, Shuai; Zhao, Qingxiang; Chen, Mingcong; Liu, Hongbin Adobe PDF(5311Kb)  |  收藏  |  浏览/下载:44/13  |  提交时间:2024/05/28 |
| Second-Order Global Attention Networks for Graph Classification and Regression 会议论文 , Beijing, China, August 27-28, 2022 作者: Hu Fenyu; Cui Zeyu; Wu Shu; Liu Qiang; Wu Jinlin; Wang Liang; Tan Tieniu Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:227/73  |  提交时间:2023/07/06 |
| Consensus Control of Multi-Agent Systems With Two-Way Switching Directed Topology 会议论文 , 北京, 2020-12-5 作者: Wang Xin; Wei Qinglai; Song Ruizhuo Adobe PDF(898Kb)  |  收藏  |  浏览/下载:108/43  |  提交时间:2023/06/28 |
| Stable Training of Bellman Error in Reinforcement Learning 会议论文 , Thailand, November 18–22 作者: Gong C(龚晨); Bai YP(白云鹏); Hou XW(侯新文); Ji XH(季晓慧) Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:132/37  |  提交时间:2023/06/27 |
| Wide-Sense Stationary Policy Optimization with Bellman Residual on Video Games 会议论文 , Shenzhen, China, 05-09 July 2021 作者: Gong C(龚晨); He Q(何强); Bai YP(白云鹏); Hou XW(侯新文); Fan GL(范国梁); Liu Y(刘禹) Adobe PDF(2780Kb)  |  收藏  |  浏览/下载:252/45  |  提交时间:2022/06/27 Video Game Reinforcement Learning Quantile Regression Bellman residual Wasserstein Distance |