CASIA OpenIR

浏览/检索结果: 共17条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Learning and Controlling Multiscale Dynamics in Spiking Neural Networks Using Recursive Least Square Modifications 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 14
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(8060Kb)  |  收藏  |  浏览/下载:76/15  |  提交时间:2024/03/27
Direct dynamic programming (DDP)  Lorenz system  multiscale dynamics  point-to-point control  recursive least square (RLS)  spiking neural network (SNN)  
Computational Experiments for Complex Social Systems: Experiment Design and Generative Explanation 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 4, 页码: 1022-1038
作者:  Xiao Xue;  Deyu Zhou;  Xiangning Yu;  Gang Wang;  Juanjuan Li;  Xia Xie;  Lizhen Cui;  Fei-Yue Wang
Adobe PDF(7239Kb)  |  收藏  |  浏览/下载:64/15  |  提交时间:2024/03/18
Agent-based modeling  computational experiments  cyber-physical-social systems (CPSS)  generative deduction  generative experiments  meta model  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:110/17  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:61/3  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Hierarchical Multihop Reasoning on Knowledge Graphs 期刊论文
IEEE INTELLIGENT SYSTEMS, 2022, 卷号: 37, 期号: 1, 页码: 71-78
作者:  Wang, Zikang;  Li, Linjing;  Zeng, Daniel Dajun
Adobe PDF(1656Kb)  |  收藏  |  浏览/下载:369/87  |  提交时间:2022/07/25
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:239/8  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Adaptive Fault-tolerant Control for Trajectory Tracking and Rectification of Directional Drilling 期刊论文
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 卷号: 20, 期号: 1, 页码: 334-348
作者:  Zhang, Chi;  Zou, Wei;  Cheng, Ningbo;  Gao, Junshan
Adobe PDF(3031Kb)  |  收藏  |  浏览/下载:258/35  |  提交时间:2022/03/17
Fault-tolerant control (FTC)  integral sliding mode control (ISMC)  neural network (NN)  nonlinear control system  reinforcement learning (RL)  
Trip Purposes Mining From Mobile Signaling Data 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 卷号: 99, 期号: 99, 页码: 13
作者:  Li, Zhishuai;  Xiong, Gang;  Wei, Zebing;  Zhang, Yu;  Zheng, Meng;  Liu, Xiaoli;  Tarkoma, Sasu;  Huang, Min;  Lv, Yisheng;  Wu, Chuheng
Adobe PDF(3962Kb)  |  收藏  |  浏览/下载:454/78  |  提交时间:2022/01/27
Cellular networks  Trajectory  Semantics  Unsupervised learning  Supervised learning  Resource management  Public transportation  Trip purpose inference  cellular network data  latent Dirichlet allocation  travel behavior  big data  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:255/10  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(2904Kb)  |  收藏  |  浏览/下载:228/11  |  提交时间:2022/01/27
Maximum likelihood estimation (MLE)  Nonlinear systems  Optimal control  Poisson process  Spike train  Spiking Adaptive dynamic programming(SADP)