CASIA OpenIR

浏览/检索结果: 共32条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multi-Agent Reinforcement Learning for Extended Flexible Job Shop Scheduling 期刊论文
MACHINES, 2024, 卷号: 12, 期号: 1, 页码: 25
作者:  Peng, Shaoming;  Xiong, Gang;  Yang, Jing;  Shen, Zhen;  Tamir, Tariku Sinshaw;  Tao, Zhikun;  Han, Yunjun;  Wang, Fei-Yue
收藏  |  浏览/下载:50/0  |  提交时间:2024/03/13
production planning and scheduling  multi-agent reinforcement learning  flexible job shop  path flexibility  technological flexibility  
A Self-Attention-Based Deep Reinforcement Learning Approach for AGV Dispatching Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 12
作者:  Wei, Qinglai;  Yan, Yutian;  Zhang, Jie;  Xiao, Jun;  Wang, Cong
收藏  |  浏览/下载:215/0  |  提交时间:2023/01/09
Automated guided vehicle (AGV) dispatching  deep learning  reinforcement learning (RL)  self-attention  
Hierarchical Multihop Reasoning on Knowledge Graphs 期刊论文
IEEE INTELLIGENT SYSTEMS, 2022, 卷号: 37, 期号: 1, 页码: 71-78
作者:  Wang, Zikang;  Li, Linjing;  Zeng, Daniel Dajun
Adobe PDF(1656Kb)  |  收藏  |  浏览/下载:299/80  |  提交时间:2022/07/25
Finite-Time Asynchronous Event-Triggered Formation of UAVs with Semi-Markov-Type Topologies 期刊论文
SENSORS, 2022, 卷号: 22, 期号: 12, 页码: 19
作者:  Ma, Chao;  Zheng, Suiwu;  Xu, Tao;  Ji, Yidao
收藏  |  浏览/下载:173/0  |  提交时间:2022/07/25
finite-time formation  event-triggered formation  UAVs  semi-Markov topologies  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:207/0  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Attention Enhanced Reinforcement Learning for Multi agent Cooperation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Pu, Zhiqiang;  Wang, Huimu;  Liu, Zhen;  Yi, Jianqiang;  Wu, Shiguang
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:313/43  |  提交时间:2022/06/06
Training  Reinforcement learning  Games  Scalability  Task analysis  Standards  Optimization  Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems  
Adaptive Fault-tolerant Control for Trajectory Tracking and Rectification of Directional Drilling 期刊论文
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2022, 卷号: 20, 期号: 1, 页码: 334-348
作者:  Zhang, Chi;  Zou, Wei;  Cheng, Ningbo;  Gao, Junshan
Adobe PDF(3031Kb)  |  收藏  |  浏览/下载:219/30  |  提交时间:2022/03/17
Fault-tolerant control (FTC)  integral sliding mode control (ISMC)  neural network (NN)  nonlinear control system  reinforcement learning (RL)  
Hyperparameter Configuration Learning for Ship Detection From Synthetic Aperture Radar Images 期刊论文
IEEE GEOSCIENCE AND REMOTE SENSING LETTERS, 2022, 卷号: 19, 页码: 5
作者:  Xu, Nuo;  Huo, Chunlei;  Zhang, Xin;  Cao, Yong;  Pan, Chunhong
Adobe PDF(4808Kb)  |  收藏  |  浏览/下载:304/75  |  提交时间:2022/02/16
Radar polarimetry  Synthetic aperture radar  Marine vehicles  Training  Feature extraction  Optimization  Optical sensors  Hyperparameter configuration learning (HCL)  object detection  reinforcement learning (RL)  synthetic aperture radar (SAR)  
Trip Purposes Mining From Mobile Signaling Data 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 卷号: 99, 期号: 99, 页码: 13
作者:  Li, Zhishuai;  Xiong, Gang;  Wei, Zebing;  Zhang, Yu;  Zheng, Meng;  Liu, Xiaoli;  Tarkoma, Sasu;  Huang, Min;  Lv, Yisheng;  Wu, Chuheng
Adobe PDF(3962Kb)  |  收藏  |  浏览/下载:386/72  |  提交时间:2022/01/27
Cellular networks  Trajectory  Semantics  Unsupervised learning  Supervised learning  Resource management  Public transportation  Trip purpose inference  cellular network data  latent Dirichlet allocation  travel behavior  big data  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:243/26  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II