CASIA OpenIR

浏览/检索结果: 共80条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
二人零和动态博弈的自学习平行控制方法研究 学位论文
, 2023
作者:  朱振华
Adobe PDF(1737Kb)  |  收藏  |  浏览/下载:129/5  |  提交时间:2023/12/15
自适应动态规划  平行控制  零和博弈  
Sample-Observed Soft Actor-Critic Learning for Path Following of a Biomimetic Underwater Vehicle 期刊论文
IEEE Transactions on Automation Science and Engineering, 2023, 页码: 1-10
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Shuo;  Cheng, Long;  Wang, Rui;  Tan, Ming
Adobe PDF(2902Kb)  |  收藏  |  浏览/下载:165/53  |  提交时间:2023/08/03
Autonomous Skill Learning of Water Polo Ball Heading for a Robotic Fish: Curriculum and Verification 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 2, 页码: 865 - 876
作者:  Zhang Tiandong;  Wang Rui;  Wang Shuo;  Wang Yu;  Cheng Long;  Zheng Gang;  Tan Min
Adobe PDF(3052Kb)  |  收藏  |  浏览/下载:112/38  |  提交时间:2023/06/14
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:151/58  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
HMDRL: Hierarchical Mixed Deep Reinforcement Learning to Balance Vehicle Supply and Demand 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 页码: 12
作者:  Xi, Jinhao;  Zhu, Fenghua;  Ye, Peijun;  Lv, Yisheng;  Tang, Haina;  Wang, Fei-Yue
Adobe PDF(3316Kb)  |  收藏  |  浏览/下载:252/30  |  提交时间:2022/09/19
deep reinforcement learning  online ride-hailing system  hierarchical repositioning framework  parallel coordination mechanism  mixed state  
基于自适应动态规划的分布式迭代控制方法研究 学位论文
工学博士, 人工智能学院: 中国科学院大学, 2022
作者:  李洪阳
Adobe PDF(3786Kb)  |  收藏  |  浏览/下载:268/26  |  提交时间:2022/06/14
自适应动态规划,最优控制,分布式控制,智能控制,强化学习  
Meta-Residual Policy Learning: Zero-Trial Robot Skill Adaptation via Knowledge Fusion 期刊论文
IEEE Robotics and Automation Letters, 2022, 卷号: 7, 期号: 7, 页码: 3656-3663
作者:  Peng Hao;  Tao Lu;  Shaowei Cui;  Junhang Wei;  YInghao Cai;  Shuo Wang
Adobe PDF(1750Kb)  |  收藏  |  浏览/下载:206/38  |  提交时间:2022/04/08
meta-learning  residual learning  
Model-Free Adaptive Optimal Control for Unknown Nonlinear Multiplayer Nonzero-Sum Game 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 2, 页码: 879-892
作者:  Wei, Qinglai;  Zhu, Liao;  Song, Ruizhuo;  Zhang, Pinjia;  Liu, Derong;  Xiao, Jun
收藏  |  浏览/下载:241/0  |  提交时间:2022/03/17
Heuristic algorithms  Nonlinear systems  Optimal control  Mathematical model  Dynamic programming  Games  Adaptive systems  Adaptive dynamic programming (ADP)  globalized dual-heuristic dynamic programming (GDHP)  multiplayer nonzero-sum game (MP-NZSG)  neural network (NN)  
Hyperparameter Configuration Learning for Ship Detection From Synthetic Aperture Radar Images 期刊论文
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 卷号: 19, 页码: 5
作者:  Xu, Nuo;  Huo, Chunlei;  Zhang, Xin;  Cao, Yong;  Pan, Chunhong
Adobe PDF(4808Kb)  |  收藏  |  浏览/下载:293/72  |  提交时间:2022/02/16
Radar polarimetry  Synthetic aperture radar  Marine vehicles  Training  Feature extraction  Optimization  Optical sensors  Hyperparameter configuration learning (HCL)  object detection  reinforcement learning (RL)  synthetic aperture radar (SAR)  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:231/24  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II