CASIA OpenIR

浏览/检索结果: 共167条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:53/13  |  提交时间:2024/06/07
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model 期刊论文
IEEE Transactions on Intelligent Transportation Systems, 2024, 页码: 10.1109/TITS.2024.3400227
作者:  Zeyu Gao;  Yao Mu;  Chen Chen;  Jingliang Duan;  Ping Luo;  Yanfeng Lu;  Shengbo Eben Li
Adobe PDF(3954Kb)  |  收藏  |  浏览/下载:45/19  |  提交时间:2024/06/06
End-to-end autonomous driving  deep reinforcement learning  world model  
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:47/9  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
Deep Reinforcement Learning-Based Driving Policy at Intersections Utilizing Lane Graph Networks 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1 - 16
作者:  Liu, Yuqi;  Zhang, Qichao;  Gao, Yinfeng;  Zhao, Dongbin
Adobe PDF(22863Kb)  |  收藏  |  浏览/下载:48/16  |  提交时间:2024/06/03
Reinforcement Learning  Autonomous Driving  Intersection Navigating  
Cooperative Task Scheduling and Planning Considering Resource Conflicts and Precedence Constraints 期刊论文
International Journal of Precision Engineering and Manufacturing, 2023, 页码: 1503-1516
作者:  Li, Donghui;  Su, Hu;  Xu, Xinyi;  Wang, Qingbin;  Qin, Jie;  Zou, Wei
Adobe PDF(2513Kb)  |  收藏  |  浏览/下载:53/20  |  提交时间:2024/05/28
Graph-guided deep hashing networks for similar patient retrieval 期刊论文
Computers in Biology and Medicine, 2024, 卷号: 169, 页码: 107865
作者:  Gu, Yifan;  Yang, Xuebing;  Sun, Mengxuan;  Wang, Chutong;  Yang, Hongyu;  Yang, Chao;  Wang, Jinwei;  Kong, Guilan;  Lv, Jicheng;  Zhang, Wensheng
Adobe PDF(1325Kb)  |  收藏  |  浏览/下载:50/20  |  提交时间:2024/05/28
Similar patient retrieval  Deep hashing  Graph neural networks  Patient representation learning  Electronic health records  
Learning and Controlling Multiscale Dynamics in Spiking Neural Networks Using Recursive Least Square Modifications 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 14
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(8060Kb)  |  收藏  |  浏览/下载:99/22  |  提交时间:2024/03/27
Direct dynamic programming (DDP)  Lorenz system  multiscale dynamics  point-to-point control  recursive least square (RLS)  spiking neural network (SNN)  
Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning 期刊论文
BIOMIMETICS, 2024, 卷号: 9, 期号: 1, 页码: 19
作者:  Wang, Yu;  Wang, Jian;  Kang, Song;  Yu, Junzhi
Adobe PDF(1553Kb)  |  收藏  |  浏览/下载:89/20  |  提交时间:2024/03/26
biomimetic motion  biomimetic autonomous system  target following  deep reinforcement learning  predictive control  
Computational Experiments for Complex Social Systems: Experiment Design and Generative Explanation 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 1022-1038
作者:  Xiao Xue;  Deyu Zhou;  Xiangning Yu;  Gang Wang;  Juanjuan Li;  Xia Xie;  Lizhen Cui;  Fei-Yue Wang
Adobe PDF(7239Kb)  |  收藏  |  浏览/下载:74/17  |  提交时间:2024/03/18
Agent-based modeling  computational experiments  cyber-physical-social systems (CPSS)  generative deduction  generative experiments  meta model  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:125/24  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration