CASIA OpenIR

浏览/检索结果: 共51条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:46/17  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:29/13  |  提交时间:2024/06/25
Mst: Masked self-supervised transformer for visual representation 会议论文
, 北京(虚拟会议), 2021
作者:  Li, Zhaowen;  Chen, Zhiyang;  Yang, Fan;  Li, Wei;  Zhu, Yousong;  Zhao, Chaoyang;  Zhao, Rui;  Deng, Rui;  Tang, Ming;  Wang, Jinqiao
Adobe PDF(1021Kb)  |  收藏  |  浏览/下载:66/18  |  提交时间:2024/05/30
Learning Playing Piano with Bionic-Constrained Diffusion Policy for Anthropomorphic Hand 期刊论文
Cyborg and Bionic Systems, 2024, 卷号: 5, 页码: 0104
作者:  Yang YM(杨依明);  Wang ZC(王泽昌);  Xing DP(邢登鹏);  Wang P(王鹏)
Adobe PDF(3500Kb)  |  收藏  |  浏览/下载:44/17  |  提交时间:2024/05/30
Efficient Spatiotemporal Transformer for Robotic Reinforcement Learning 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2022, 卷号: 7, 期号: 3, 页码: 7982-7989
作者:  Yang YM(杨依明);  Xing DP(邢登鹏);  Xu B(徐波)
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:52/20  |  提交时间:2024/05/29
Learning to Match Features with Geometry-aware Pooling 会议论文
, 湖南省长沙市, 2023-11
作者:  Deng, Jiaxin;  Yang, Xu;  Zheng, Suiwu
Adobe PDF(414Kb)  |  收藏  |  浏览/下载:66/23  |  提交时间:2024/05/29
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:53/28  |  提交时间:2024/05/28
复杂网络能控性鲁棒性研究进展 期刊论文
自动化学报, 2022, 卷号: 48, 期号: 10, 页码: 2374-2391
作者:  楼洋;  李均利;  李升;  邓浩
Adobe PDF(2169Kb)  |  收藏  |  浏览/下载:58/20  |  提交时间:2024/05/20
复杂网络  能控性鲁棒性  攻击  优化  
Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368
作者:  Qingyang Zhang;  Kaishen Wang;  Jingqing Ruan;  Yiming Yang;  Dengpeng Xing;  Bo Xu
Adobe PDF(4997Kb)  |  收藏  |  浏览/下载:84/29  |  提交时间:2024/04/23
Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency  
Offline Pre-trained Multi-agent Decision Transformer 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248
作者:  Linghui Meng;  Muning Wen;  Chenyang Le;  Xiyun Li;  Dengpeng Xing;  Weinan Zhang;  Ying Wen;  Haifeng Zhang;  Jun Wang;  Yaodong Yang;  Bo Xu
Adobe PDF(2121Kb)  |  收藏  |  浏览/下载:73/17  |  提交时间:2024/04/23
Pre-training model  multi-agent reinforcement learning (MARL)  decision making  transformer  offline reinforcement learning