CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:74/6  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Optimization-Based Post-Training Quantization With Bit-Split and Stitching 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 2, 页码: 2119-2135
作者:  Wang, Peisong;  Chen, Weihan;  He, Xiangyu;  Chen, Qiang;  Liu, Qingshan;  Cheng, Jian
Adobe PDF(921Kb)  |  收藏  |  浏览/下载:191/57  |  提交时间:2023/03/20
Deep neural networks  compression  quantization  post-training quantization  
Multi-View Multi-Label Fine-Grained Emotion Decoding From Human Brain Activity 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-15
作者:  Fu, Kaicheng;  Du, Changde;  Wang, Shengpei;  He, Huiguang
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:295/70  |  提交时间:2022/12/27
Fine-grained Emotion Decoding  Multi-view Learning  Multi-label Learning  Variational Autoencoder  Product of Experts  
Deep Learning for Unsupervised Anomaly Localization in Industrial Images: A Survey 期刊论文
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2022, 卷号: 71, 页码: 21
作者:  Tao, Xian;  Gong, Xinyi;  Zhang, Xin;  Yan, Shaohua;  Adak, Chandranath
Adobe PDF(7056Kb)  |  收藏  |  浏览/下载:247/0  |  提交时间:2022/09/19
Anomaly localization (AL)  deep learning  industrial inspection  literature survey  unsupervised learning  
Meta Graph Transformer: A Novel Framework for Spatial-Temporal Traffic Prediction 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 544-563
作者:  Ye, Xue;  Fang, Shen;  Sun, Fang;  Zhang, Chunxia;  Xiang, Shiming
Adobe PDF(3491Kb)  |  收藏  |  浏览/下载:246/32  |  提交时间:2022/09/19
Traffic prediction  Spatial-temporal modeling  Meta-learning  Attention mechanism  Deep learning  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:220/2  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Attention Enhanced Reinforcement Learning for Multi agent Cooperation 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Pu, Zhiqiang;  Wang, Huimu;  Liu, Zhen;  Yi, Jianqiang;  Wu, Shiguang
Adobe PDF(2967Kb)  |  收藏  |  浏览/下载:331/48  |  提交时间:2022/06/06
Training  Reinforcement learning  Games  Scalability  Task analysis  Standards  Optimization  Attention mechanism  deep reinforcement learning (DRL)  graph convolutional networks  multi agent systems  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:228/4  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Enhanced Rolling Horizon Evolution Algorithm With Opponent Model Learning: Results for the Fighting Game AI Competition 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 5, 期号: 1, 页码: 5 - 15
作者:  Zhentao Tang;  Yuanheng Zhu;  Dongbin Zhao;  Simon M. Lucas
Adobe PDF(7686Kb)  |  收藏  |  浏览/下载:284/66  |  提交时间:2021/07/05
Rolling horizon evolution  opponent model  reinforcement learning  supervised learning  fighting game  
Learning Control for Air Conditioning Systems via Human Expressions 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 卷号: 68, 期号: 8, 页码: 7662-7671
作者:  Wei, Qinglai;  Li, Tao;  Liu, Derong
Adobe PDF(1354Kb)  |  收藏  |  浏览/下载:233/5  |  提交时间:2021/06/15
Adaptive dynamic programming  air conditioning control  deep learning (DL)  deep Q-network (DQN)  human expressions  optimal control  reinforcement learning (RL)  Q-learning