CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:249/26  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(2904Kb)  |  收藏  |  浏览/下载:193/1  |  提交时间:2022/01/27
Maximum likelihood estimation (MLE)  Nonlinear systems  Optimal control  Poisson process  Spike train  Spiking Adaptive dynamic programming(SADP)  
A Novel Parallel Control Method for Continuous-Time Linear Output Regulation With Disturbances 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 页码: 11
作者:  Wei, Qinglai;  Li, Hongyang;  Wang, Fei-Yue
Adobe PDF(1665Kb)  |  收藏  |  浏览/下载:248/40  |  提交时间:2022/01/27
Regulators  Regulation  Control systems  Stability analysis  Asymptotic stability  Eigenvalues and eigenfunctions  Control theory  Continuous-time linear systems  output regulation  parallel control theory  parallel regulators  parallel systems  
Data Augmented Deep Behavioral Cloning for Urban Traffic Control Operations Under a Parallel Learning Framework 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 卷号: 23, 期号: 6, 页码: 5128-5137
作者:  Li, Xiaoshuang;  Ye, Peijun;  Jin, Junchen;  Zhu, Fenghua;  Wang, Fei-Yue
Adobe PDF(2319Kb)  |  收藏  |  浏览/下载:308/61  |  提交时间:2022/01/27
Generative adversarial networks  Data models  Gallium nitride  Task analysis  Complex systems  Intelligent traffic signal operations  deep behavioral cloning  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:288/51  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Structure transforming for constructing constraint force field in musculoskeletal robot 期刊论文
ASSEMBLY AUTOMATION, 2021, 页码: 12
作者:  Zhong, Shanlin;  Chen, Ziyu;  Zhou, Junjie
Adobe PDF(2152Kb)  |  收藏  |  浏览/下载:319/60  |  提交时间:2021/12/28
High precision  Attractive region in environment  Constraint force field  Musculoskeletal robot  
A New Integral Critic Learning for Optimal Tracking Control with Applications to Boiler-Turbine Systems 期刊论文
OPTIMAL CONTROL APPLICATIONS & METHODS, 2021, 页码: 16
作者:  Wei, Qinglai;  Liu, Yujia;  Lu, Jingwei;  Ling, Jun;  Luan, Zhenhua;  Chen, Mingliang
收藏  |  浏览/下载:163/0  |  提交时间:2021/12/28
adaptive dynamic programming  boiler-turbine system  integral reinforcement learning  neural network  policy iteration  
Improving Domain-adaptive Person Re-identification by Dual-alignment Learning with Camera-aware Image Generation 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2021, 卷号: 31, 期号: 11, 页码: 4334 - 4346
作者:  Chenyang Zhang;  Yongqiang Tang;  Zhizhong Zhang;  Ding Li;  Xuebing Yang;  Wensheng Zhang
Adobe PDF(2538Kb)  |  收藏  |  浏览/下载:341/83  |  提交时间:2021/06/23
Person Re-identification  Convolutional Neural Networks  Generative Adversarial Networks  Mutual Information  
MLRNN: Taxi Demand Prediction Based on Multi-Level Deep Learning and Regional Heterogeneity Analysis 期刊论文
IEEE Transactions on Intelligent Transportation Systems, 2021, 卷号: 0, 期号: 0, 页码: 0
作者:  Chizhan Zhang;  Fenghua Zhu;  Yisheng Lv;  Peijun Ye;  Feiyue Wang
Adobe PDF(4431Kb)  |  收藏  |  浏览/下载:229/53  |  提交时间:2021/06/16
Taxi demand prediction  taxi zone clustering  heterogeneity analysis  deep learning  
Learning Control for Air Conditioning Systems via Human Expressions 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2021, 卷号: 68, 期号: 8, 页码: 7662-7671
作者:  Wei, Qinglai;  Li, Tao;  Liu, Derong
Adobe PDF(1354Kb)  |  收藏  |  浏览/下载:222/1  |  提交时间:2021/06/15
Adaptive dynamic programming  air conditioning control  deep learning (DL)  deep Q-network (DQN)  human expressions  optimal control  reinforcement learning (RL)  Q-learning