CASIA OpenIR

浏览/检索结果: 共16条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Centralized Cooperative Exploration Policy for Continuous Control Tasks 会议论文
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, London, United Kingdom, May 29–June 2, 2023
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou;  Yu Liu
Adobe PDF(2175Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/05/30
continuous control tasks  cooperative exploration  
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文
Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou
Adobe PDF(1457Kb)  |  收藏  |  浏览/下载:3/0  |  提交时间:2024/05/30
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:4/0  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:4/0  |  提交时间:2024/05/28
Machine Learning Methods in Solving the Boolean Satisfiability Problem 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 5, 页码: 640-655
作者:  Wenxuan Guo;  Hui-Ling Zhen;  Xijun Li;  Wanqian Luo;  Mingxuan Yuan;  Yaohui Jin;  Junchi Yan
Adobe PDF(1518Kb)  |  收藏  |  浏览/下载:24/7  |  提交时间:2024/04/23
Machine learning (ML), Boolean satisfiability (SAT), deep learning, graph neural networks (GNNs), combinatorial optimization  
RTDOD: A large-scale RGB-thermal domain-incremental object detection dataset for UAVs 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 9
作者:  Feng, Hangtao;  Zhang, Lu;  Zhang, Siqi;  Wang, Dong;  Yang, Xu;  Liu, Zhiyong
Adobe PDF(3013Kb)  |  收藏  |  浏览/下载:73/0  |  提交时间:2024/02/22
Domain -incremental object detection  Dataset  RGB-T dataset  Object detection dataset  UAVs dataset  Object detection  
Synergetic learning for unknown nonlinear H. control using neural networks 期刊论文
NEURAL NETWORKS, 2023, 卷号: 168, 页码: 287-299
作者:  Zhu, Liao;  Guo, Ping;  Wei, Qinglai
收藏  |  浏览/下载:82/0  |  提交时间:2023/12/21
H. control  Nonlinear systems  Adaptive dynamic programming  Temporal difference  Neural network  Data-driven  
Multiagent-Reinforcement-Learning-Based Stable Path Tracking Control for a Bionic Robotic Fish With Reaction Wheel 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 12, 页码: 12670-12679
作者:  Qiu, Changlin;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(1587Kb)  |  收藏  |  浏览/下载:127/0  |  提交时间:2023/11/17
Multiagent reinforcement learning (MARL)  path tracking control  reaction wheel  robotic fish  underwater robot  
Hierarchical Policy Learning With Demonstration Learning for Robotic Multiple Peg-in-Hole Assembly Tasks 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 卷号: 19, 期号: 10, 页码: 10254-10264
作者:  Yan, Shaohua;  Xu, De;  Tao, Xian
Adobe PDF(4845Kb)  |  收藏  |  浏览/下载:84/0  |  提交时间:2023/11/17
Assembly model  demonstration learning (DL)  force-based control algorithm  hierarchical reinforcement learning (HRL)  peg-in-hole assembly