CASIA OpenIR

浏览/检索结果: 共29条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:63/22  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Constrained-cost adaptive dynamic programming for optimal control of discrete-time nonlinear systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 卷号: 35, 期号: 3, 页码: 3251 - 3264
作者:  Wei, Qinglai;  Li, Tao
Adobe PDF(8471Kb)  |  收藏  |  浏览/下载:69/25  |  提交时间:2024/05/28
Adaptive dynamic programming  approximate dynamic programming  constrained cost  optimal control  reinforcement learning  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:130/26  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
A Parallel Control Method For Zero-Sum Games With Unknown Time-varying System 期刊论文
The International Journal of Intelligent Control and Systems, 2023, 页码: 5页
作者:  Qinglai Wei;  Zhenhua Zhu;  Jie Zhang;  Feiyue Wang
Adobe PDF(470Kb)  |  收藏  |  浏览/下载:170/68  |  提交时间:2023/12/15
二人零和动态博弈的自学习平行控制方法研究 学位论文
, 2023
作者:  朱振华
Adobe PDF(1737Kb)  |  收藏  |  浏览/下载:185/6  |  提交时间:2023/12/15
自适应动态规划  平行控制  零和博弈  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:163/9  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
A Survey on Reinforcement Learning Methods in Bionic Underwater Robots 期刊论文
BIOMIMETICS, 2023, 卷号: 8, 期号: 2, 页码: 29
作者:  Tong, Ru;  Feng, Yukai;  Wang, Jian;  Wu, Zhengxing;  Tan, Min;  Yu, Junzhi
Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:158/21  |  提交时间:2023/11/17
bionic underwater robot  reinforcement learning  robotic fish  intelligent control  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:66/5  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Human-Like Decision-Making of Autonomous Vehicles in Dynamic Traffic Scenarios 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 10, 页码: 1905-1917
作者:  Tangyike Zhang;  Junxiang Zhan;  Jiamin Shi;  Jingmin Xin;  Nanning Zheng
Adobe PDF(2815Kb)  |  收藏  |  浏览/下载:193/71  |  提交时间:2023/09/07
Autonomous vehicles  decision-making  driving behavior  human-like driving  
Position Errors and Interference Prediction-Based Trajectory Tracking for Snake Robots 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 9, 页码: 1810-1821
作者:  Dongfang Li;  Yilong Zhang;  Ping Li;  Rob Law;  Zhengrong Xiang;  Xin Xu;  Limin Zhu;  Edmond Q. Wu
Adobe PDF(19961Kb)  |  收藏  |  浏览/下载:166/35  |  提交时间:2023/08/10
Anti-sideslip  compensation  snake robot  trajectory tracking