CASIA OpenIR

浏览/检索结果: 共58条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model 期刊论文
IEEE Transactions on Intelligent Transportation Systems, 2024, 页码: 10.1109/TITS.2024.3400227
作者:  Zeyu Gao;  Yao Mu;  Chen Chen;  Jingliang Duan;  Ping Luo;  Yanfeng Lu;  Shengbo Eben Li
Adobe PDF(3954Kb)  |  收藏  |  浏览/下载:32/10  |  提交时间:2024/06/06
End-to-end autonomous driving  deep reinforcement learning  world model  
A cooperation and decision-making framework in dynamic confrontation for multi-agent systems 期刊论文
Computers and Electrical Engineering, 2024, 页码: 118
作者:  Lexing Wang;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi
Adobe PDF(1302Kb)  |  收藏  |  浏览/下载:32/8  |  提交时间:2024/06/06
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:30/4  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
Deep Reinforcement Learning-Based Driving Policy at Intersections Utilizing Lane Graph Networks 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1 - 16
作者:  Liu, Yuqi;  Zhang, Qichao;  Gao, Yinfeng;  Zhao, Dongbin
Adobe PDF(22863Kb)  |  收藏  |  浏览/下载:28/8  |  提交时间:2024/06/03
Reinforcement Learning  Autonomous Driving  Intersection Navigating  
A Local Obstacle Avoidance and Global Planning Method for the Follow-the-Leader Motion of Coiled Hyper-Redundant Manipulators 期刊论文
IEEE Transactions on Industrial Informatics, 2024, 卷号: 20, 期号: 4, 页码: 6591 - 6602
作者:  Mingrui, Luo;  Yunong, Tian;  En, Li;  Minghao, Chen;  Min, Tan
Adobe PDF(16892Kb)  |  收藏  |  浏览/下载:34/11  |  提交时间:2024/05/31
Cable-driven redundant manipulators  intelligent robot system  obstacle avoidance  path planning  
Toward the Intelligent, Safe Exploration of a Biomimetic Underwater Robot: Modeling, Planning, and Control 期刊论文
Biomimetics, 2024, 期号: 9, 页码: 126
作者:  Wang, Yu;  Wang, Jian;  Yu Lianyi;  Kong Shihan;  Yu Junzhi
Adobe PDF(1171Kb)  |  收藏  |  浏览/下载:40/12  |  提交时间:2024/05/30
Integrated Tracking Control of an Underwater Bionic Robot Based on Multimodal Motions 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 卷号: 54, 期号: 3, 页码: 1599-1610
作者:  Wang, Jian;  Wu, Zhengxing;  Zhang, Yang;  Kong, Shihan;  Tan, Min;  Yu, Junzhi
Adobe PDF(5090Kb)  |  收藏  |  浏览/下载:81/15  |  提交时间:2024/03/27
Disturbance observer (DOB)  fuzzy system  model predictive control (MPC)  tracking control  underwater bionic robot  
Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning 期刊论文
BIOMIMETICS, 2024, 卷号: 9, 期号: 1, 页码: 19
作者:  Wang, Yu;  Wang, Jian;  Kang, Song;  Yu, Junzhi
Adobe PDF(1553Kb)  |  收藏  |  浏览/下载:69/12  |  提交时间:2024/03/26
biomimetic motion  biomimetic autonomous system  target following  deep reinforcement learning  predictive control  
Computational Experiments for Complex Social Systems: Experiment Design and Generative Explanation 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 1022-1038
作者:  Xiao Xue;  Deyu Zhou;  Xiangning Yu;  Gang Wang;  Juanjuan Li;  Xia Xie;  Lizhen Cui;  Fei-Yue Wang
Adobe PDF(7239Kb)  |  收藏  |  浏览/下载:64/15  |  提交时间:2024/03/18
Agent-based modeling  computational experiments  cyber-physical-social systems (CPSS)  generative deduction  generative experiments  meta model  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:110/17  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration