CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:6/1  |  提交时间:2024/06/05
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:10/2  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Cooperative Task Scheduling and Planning Considering Resource Conflicts and Precedence Constraints 期刊论文
International Journal of Precision Engineering and Manufacturing, 2023, 页码: 1503-1516
作者:  Li, Donghui;  Su, Hu;  Xu, Xinyi;  Wang, Qingbin;  Qin, Jie;  Zou, Wei
Adobe PDF(2513Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/05/28
Machine Learning Methods in Solving the Boolean Satisfiability Problem 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 5, 页码: 640-655
作者:  Wenxuan Guo;  Hui-Ling Zhen;  Xijun Li;  Wanqian Luo;  Mingxuan Yuan;  Yaohui Jin;  Junchi Yan
Adobe PDF(1518Kb)  |  收藏  |  浏览/下载:31/8  |  提交时间:2024/04/23
Machine learning (ML), Boolean satisfiability (SAT), deep learning, graph neural networks (GNNs), combinatorial optimization  
A Review and Outlook on Predictive Cruise Control of Vehicles and Typical Applications Under Cloud Control System 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 5, 页码: 614-639
作者:  Bolin Gao;  Keke Wan;  Qien Chen;  Zhou Wang;  Rui Li;  Yu Jiang;  Run Mei;  Yinghui Luo;  Keqiang Li
Adobe PDF(12630Kb)  |  收藏  |  浏览/下载:34/6  |  提交时间:2024/04/23
Predictive cruise control (PCC), cloud control system (CCS), cooperative control, efficient operation, intelligent connected vehicle  
The Road Ahead: DAO-Secured V2X Infrastructures for Safe and Smart Vehicular Management 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT VEHICLES, 2023, 卷号: 8, 期号: 12, 页码: 4674-4677
作者:  Dai, Xingyuan;  Vallati, Mauro;  Guo, Rongge;  Wang, Yutong;  Han, Shuangshuang;  Lin, Yilun
Adobe PDF(351Kb)  |  收藏  |  浏览/下载:34/1  |  提交时间:2024/03/26
Decentralized autonomous organization and operation (DAO)  vehicle-to-everything (V2X)  infrastructures  vehicular management  artificial intelligence  
A Streamlined 3-D Magnetic Particle Imaging System With a Two-Stage Excitation Feed-Through Compensation Strategy 期刊论文
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 卷号: 72, 页码: 1-10
作者:  Yin L(尹琳);  Li W(李玮);  Bian ZW(卞忠伟);  Chen ZW(陈梓威);  Liu YJ(刘晏君);  Zhong J(钟景);  Zhang SX(张水兴);  Du Y(杜洋);  Hui H(惠辉);  Tian J(田捷)
Adobe PDF(3893Kb)  |  收藏  |  浏览/下载:50/22  |  提交时间:2024/03/26
3-D imaging  compensation strategy  magnetic particle imaging (MPI)  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:69/4  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:47/2  |  提交时间:2024/02/22
Network-Wide Traffic Signal Control Based on MARL With Hierarchical Nash-Stackelberg Game Model 期刊论文
IEEE ACCESS, 2023, 卷号: 11, 页码: 145085-145100
作者:  Shen, Hui;  Zhao, Hongxia;  Zhang, Zundong;  Yang, Xun;  Song, Yutong;  Liu, Xiaoming
收藏  |  浏览/下载:29/0  |  提交时间:2024/02/22
Games  Roads  Approximation algorithms  Q-learning  Multi-agent systems  Process control  Optimization  Reinforcement learning  Traffic control  Network-wide traffic signal control  hierarchical game model  multi-agent reinforcement learning