已选(0)清除
条数/页: 排序方式: |
| Advantage Constrained Proximal Policy Optimization in Multi-Agent Reinforcement Learning 会议论文 , 昆士兰, 2023-6 作者: Li WF(李伟凡); Zhu YH(朱圆恒); Zhao DB(赵冬斌) Adobe PDF(4104Kb)  |  收藏  |  浏览/下载:185/64  |  提交时间:2023/06/29 multi-agent reinforcement learning policy gradient |
| Curiosity-Driven and Victim-Aware Adversarial Policies 会议论文 , Austin TX, USA, December 5-9, 2022 作者: Gong C(龚晨); Yang Z(杨洲); Bai YP(白云鹏); Shi JK(史杰克); Sinha Arunesh; Xu BW(徐博文); Lo David; Hou XW(侯新文); Fan GL(范国梁) Adobe PDF(4090Kb)  |  收藏  |  浏览/下载:97/42  |  提交时间:2023/06/27 |
| Integrating Relational Knowledge With Text Sequences for Script Event Prediction 期刊论文 IEEE Transactions on Neural Networks and Learning Systems, 2023, 页码: early access 作者: Zikang Wang; Linjing Li; Daniel Zeng Adobe PDF(3215Kb)  |  收藏  |  浏览/下载:229/77  |  提交时间:2023/03/20 |
| Markovian Policy Network for Efficient Robot Learning 期刊 创刊日期: 2022, 主办者: Zhang FY(张丰一), Yurou Chen, Zhiyong Liu Adobe PDF(2453Kb)  |  收藏  |  浏览/下载:183/64  |  提交时间:2023/01/12 Efficient robot learning Structural prior knowledge Reinforcement learning Graph neural network |
| Improving the Data Quality for Credit Card Fraud Detection 会议论文 , Arlington, VA, USA, 2022-11 作者: Rongrong Jing; Hu Tian; Yidi Li; Xingwei Zhang; Xiaolong Zheng; Zhu Zhang; Daniel Dajun Zeng Adobe PDF(472Kb)  |  收藏  |  浏览/下载:313/64  |  提交时间:2022/06/17 |
| Dynamic-horizon model-based value estimation with latent imagination 期刊论文 IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14 作者: Wang JJ(王俊杰); Zhang QC(张启超); Zhao DB(赵冬斌) Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:113/48  |  提交时间:2023/05/30 Latent world model model-based value expansion (MVE) reinforcement learning reinforcement learning |
| HMDRL: Hierarchical Mixed Deep Reinforcement Learning to Balance Vehicle Supply and Demand 期刊论文 IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 页码: 12 作者: Xi, Jinhao; Zhu, Fenghua; Ye, Peijun; Lv, Yisheng; Tang, Haina; Wang, Fei-Yue Adobe PDF(3316Kb)  |  收藏  |  浏览/下载:197/27  |  提交时间:2022/09/19 deep reinforcement learning online ride-hailing system hierarchical repositioning framework parallel coordination mechanism mixed state |
| SURRL: Structural Unsupervised Representations for Robot Learning 期刊 创刊日期: 2022, 主办者: Zhang FY(张丰一), Yurou Chen, Hong Qiao, Zhiyong Liu Adobe PDF(7817Kb)  |  收藏  |  浏览/下载:210/74  |  提交时间:2023/01/12 Reinforcement learning structural representations learning multi-task learning robotics |
| 融合自适应神经网络的机器人模型预测控制方法研究 学位论文 工学博士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022 作者: 康二龙 Adobe PDF(21541Kb)  |  收藏  |  浏览/下载:266/15  |  提交时间:2022/06/19 机器人控制 模型预测控制 自适应神经网络 机械臂 最优控制理论 |
| 白内障显微手术场中的手术操作识别方法研究 学位论文 工学硕士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022 作者: 陈华斌 Adobe PDF(19218Kb)  |  收藏  |  浏览/下载:188/4  |  提交时间:2022/06/13 白内障显微手术 机器人辅助手术 手术操作阶段识别 手术器械识别 二元组识别 |