CASIA OpenIR

浏览/检索结果: 共30条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Learning and Controlling Multiscale Dynamics in Spiking Neural Networks Using Recursive Least Square Modifications 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 14
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(8060Kb)  |  收藏  |  浏览/下载:54/8  |  提交时间:2024/03/27
Direct dynamic programming (DDP)  Lorenz system  multiscale dynamics  point-to-point control  recursive least square (RLS)  spiking neural network (SNN)  
Hedonic Coalition Formation for Distributed Task Allocation in Heterogeneous Multi-agent System 期刊论文
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2024, 页码: 13
作者:  Wang, Lexing;  Qiu, Tenghai;  Pu, Zhiqiang;  Yi, Jianqiang;  Zhu, Jinying;  Yuan, Wanmai
Adobe PDF(2578Kb)  |  收藏  |  浏览/下载:96/8  |  提交时间:2024/03/13
Coalition formation  hedonic games  heterogeneous agents  Nash stable  task allocation  
Multi-Agent Reinforcement Learning for Extended Flexible Job Shop Scheduling 期刊论文
MACHINES, 2024, 卷号: 12, 期号: 1, 页码: 25
作者:  Peng, Shaoming;  Xiong, Gang;  Yang, Jing;  Shen, Zhen;  Tamir, Tariku Sinshaw;  Tao, Zhikun;  Han, Yunjun;  Wang, Fei-Yue
Adobe PDF(199Kb)  |  收藏  |  浏览/下载:73/4  |  提交时间:2024/03/13
production planning and scheduling  multi-agent reinforcement learning  flexible job shop  path flexibility  technological flexibility  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:83/9  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
IDO: Instance dual-optimization for weakly supervised object detection 期刊论文
APPLIED INTELLIGENCE, 2023, 页码: 18
作者:  Ren, Zhida;  Tang, Yongqiang;  Zhang, Wensheng
Adobe PDF(3668Kb)  |  收藏  |  浏览/下载:60/4  |  提交时间:2023/11/17
Deep learning  Weakly supervised learning  Object detection  Multiple instance learning  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:52/0  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Decentralized Autonomous Operations and Organizations in TransVerse: Federated Intelligence for Smart Mobility 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 卷号: 53, 期号: 4, 页码: 2062-2072
作者:  Zhao, Chen;  Dai, Xingyuan;  Lv, Yisheng;  Niu, Jinglong;  Lin, Yilun
Adobe PDF(1921Kb)  |  收藏  |  浏览/下载:245/4  |  提交时间:2023/02/22
Intelligent Transportation Systems (ITS)  Artificial Systems, Computational Experiments, Parallel Execution (ACP)  Cyber–Physical–Social Systems (CPSS)  
AHDet: A dynamic coarse-to-fine gaze strategy for active object detection 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 522-532
作者:  Xu, Nuo;  Huo, Chunlei;  Zhang, Xin;  Pan, Chunhong
Adobe PDF(2664Kb)  |  收藏  |  浏览/下载:320/62  |  提交时间:2022/09/19
Object detection  Active object detection  Deep reinforcement learning  Convolutional neural networks  
Hierarchical Multihop Reasoning on Knowledge Graphs 期刊论文
IEEE INTELLIGENT SYSTEMS, 2022, 卷号: 37, 期号: 1, 页码: 71-78
作者:  Wang, Zikang;  Li, Linjing;  Zeng, Daniel Dajun
Adobe PDF(1656Kb)  |  收藏  |  浏览/下载:336/83  |  提交时间:2022/07/25
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:223/3  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum