CASIA OpenIR

浏览/检索结果: 共34条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/06/25
强化学习,分层强化学习  
Human-robot object handover: Recent progress and future direction 期刊论文
Biomimetic Intelligence and Robotics, 2024, 卷号: 4, 页码: 100145
作者:  Duan, Haonan;  Yang, Yifan;  Li, Daheng;  Wang, Peng
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:43/15  |  提交时间:2024/05/29
Human–robot interactions  Object handover  
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:58/22  |  提交时间:2024/05/29
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:38/21  |  提交时间:2024/05/28
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:51/15  |  提交时间:2024/05/28
Social relation and physical lane aggregator: integrating social and physical features for multimodal motion prediction 期刊论文
Journal of Intelligent and Connected Vehicles, 2022, 卷号: 5, 期号: 3, 页码: 302-308
作者:  Chen QY(陈启元);  Wei ZB(魏泽兵);  Wang X(王晓);  Li LX(李灵犀);  Lv YS(吕宜生)
Adobe PDF(1118Kb)  |  收藏  |  浏览/下载:136/36  |  提交时间:2023/06/26
Multi-Target Encirclement with Collision Avoidance via Deep Reinforcement Learning using Relational Graphs 会议论文
, Philadelphia, PA, USA, May 23-27, 2022
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(4277Kb)  |  收藏  |  浏览/下载:170/39  |  提交时间:2023/06/12
Conditional Generative Neural Decoding with Structured CNN Feature Prediction 会议论文
, 美国, 2020-4
作者:  Du CD(杜长德);  Du CY(杜长营);  He HG(何晖光)
Adobe PDF(1813Kb)  |  收藏  |  浏览/下载:118/32  |  提交时间:2023/05/05
Simultaneous neural spike encoding and decoding based on cross-modal dual deep generative model 会议论文
, Glasgow, United Kingdom, 2020/7/19
作者:  Qiongyi Zhou;  Changde Du;  Dan Li;  Haibao Wang;  Jian K. Liu;  Huiguang He
Adobe PDF(4135Kb)  |  收藏  |  浏览/下载:154/56  |  提交时间:2023/05/05
A Flexible and Efficient Loop Closure Detection Based on Motion Knowledge 会议论文
, 西安, 2021年5月31日-2021年6月4日
作者:  Liu BX(刘秉熙);  Tang FL(唐付林);  Fu YJ(傅禹杰);  Yang YQ(杨彦群);  Wu YH(吴毅红)
Adobe PDF(4279Kb)  |  收藏  |  浏览/下载:162/54  |  提交时间:2023/04/26