CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Parallel Population and Parallel Human: A Cyber-Physical Social Approach 专著
Hoboken, NJ, USA:IEEE Press and John Wiley & Sons, Inc., 2023
作者:  Peijun Ye;  Fei-Yue Wang
Adobe PDF(544Kb)  |  收藏  |  浏览/下载:12/3  |  提交时间:2024/05/31
Centralized Cooperative Exploration Policy for Continuous Control Tasks 会议论文
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, London, United Kingdom, May 29–June 2, 2023
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou;  Yu Liu
Adobe PDF(2175Kb)  |  收藏  |  浏览/下载:9/2  |  提交时间:2024/05/30
continuous control tasks  cooperative exploration  
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:10/4  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:6/2  |  提交时间:2024/05/28
Cooperative Task Scheduling and Planning Considering Resource Conflicts and Precedence Constraints 期刊论文
International Journal of Precision Engineering and Manufacturing, 2023, 页码: 1503-1516
作者:  Li, Donghui;  Su, Hu;  Xu, Xinyi;  Wang, Qingbin;  Qin, Jie;  Zou, Wei
Adobe PDF(2513Kb)  |  收藏  |  浏览/下载:7/3  |  提交时间:2024/05/28
Parallel Learning Based Foundation Model for Networked Traffic Signal Control 会议论文
, Bilbao, Bizkaia, Spain, 2022-9-24
作者:  Zhao, Chen;  Dai, Xingyuan;  Chen, Yuanyuan;  Yilun, Lin;  Lv, Yisheng;  Wang, Fei-Yue
Adobe PDF(1112Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/05/28
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:60/1  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Network-Wide Traffic Signal Control Based on MARL With Hierarchical Nash-Stackelberg Game Model 期刊论文
IEEE ACCESS, 2023, 卷号: 11, 页码: 145085-145100
作者:  Shen, Hui;  Zhao, Hongxia;  Zhang, Zundong;  Yang, Xun;  Song, Yutong;  Liu, Xiaoming
收藏  |  浏览/下载:25/0  |  提交时间:2024/02/22
Games  Roads  Approximation algorithms  Q-learning  Multi-agent systems  Process control  Optimization  Reinforcement learning  Traffic control  Network-wide traffic signal control  hierarchical game model  multi-agent reinforcement learning  
IDO: Instance dual-optimization for weakly supervised object detection 期刊论文
APPLIED INTELLIGENCE, 2023, 页码: 18
作者:  Ren, Zhida;  Tang, Yongqiang;  Zhang, Wensheng
Adobe PDF(3668Kb)  |  收藏  |  浏览/下载:46/1  |  提交时间:2023/11/17
Deep learning  Weakly supervised learning  Object detection  Multiple instance learning  
Data-Driven Optimal Output Cluster Synchronization Control of Heterogeneous Multi-Agent Systems 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2023, 页码: 11
作者:  Li, Hongyang;  Wei, Qinglai
收藏  |  浏览/下载:61/0  |  提交时间:2023/11/17
Index Terms- Output cluster synchronization control  data-driven control  adaptive dynamic programming  policy iteration  heterogeneous multi-agent systems  optimal control