CASIA OpenIR

浏览/检索结果: 共116条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Centralized Cooperative Exploration Policy for Continuous Control Tasks 会议论文
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, London, United Kingdom, May 29–June 2, 2023
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou;  Yu Liu
Adobe PDF(2175Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/05/30
continuous control tasks  cooperative exploration  
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文
Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou
Adobe PDF(1457Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/05/30
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:4/0  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/05/29
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:5/2  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/05/28
A Wire-driven Elastic Robotic Fish and its Design and CPG-Based Control 期刊论文
Journal of Intelligent & Robotic Systems, 2023, 卷号: 107, 期号: 1, 页码: 4
作者:  Liao Xiaocun;  Chao Zhou;  Jian Wang;  Junfeng Fan;  Zhuoliang Zhang
Adobe PDF(1749Kb)  |  收藏  |  浏览/下载:6/0  |  提交时间:2024/05/28
Robotic fish  Wire-driven mode  Elastic component  Kinematics model  Body wave  
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/05/28
Parallel Learning Based Foundation Model for Networked Traffic Signal Control 会议论文
, Bilbao, Bizkaia, Spain, 2022-9-24
作者:  Zhao, Chen;  Dai, Xingyuan;  Chen, Yuanyuan;  Yilun, Lin;  Lv, Yisheng;  Wang, Fei-Yue
Adobe PDF(1112Kb)  |  收藏  |  浏览/下载:0/0  |  提交时间:2024/05/28
复杂工业过程非串级双速率组合分散运行优化控制 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 1, 页码: 172-184
作者:  赵建国;  杨春雨
Adobe PDF(1648Kb)  |  收藏  |  浏览/下载:34/11  |  提交时间:2024/05/09
复杂工业过程  运行优化控制  奇异摄动理论  Q-学习  双速率