CASIA OpenIR

浏览/检索结果: 共19条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
SOZIL: Self-Optimal Zero-shot Imitation Learning 期刊论文
IEEE Trans on Cognitive and Developmental System, 2021, 卷号: 15, 期号: 1, 页码: 1
作者:  Peng Hao;  Tao Lu;  Shaowei Cui;  Junhang Wei;  Yinghao Cai;  Shuo Wang
Adobe PDF(13840Kb)  |  收藏  |  浏览/下载:182/32  |  提交时间:2022/04/08
imitation learning  learning from observation  keyframe demonstration  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:290/38  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:264/12  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Spiking Adaptive Dynamic Programming Based on Poisson Process for Discrete-Time Nonlinear Systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 11
作者:  Wei, Qinglai;  Han, Liyuan;  Zhang, Tielin
Adobe PDF(2904Kb)  |  收藏  |  浏览/下载:238/16  |  提交时间:2022/01/27
Maximum likelihood estimation (MLE)  Nonlinear systems  Optimal control  Poisson process  Spike train  Spiking Adaptive dynamic programming(SADP)  
A Novel Parallel Control Method for Continuous-Time Linear Output Regulation With Disturbances 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2021, 页码: 11
作者:  Wei, Qinglai;  Li, Hongyang;  Wang, Fei-Yue
Adobe PDF(1665Kb)  |  收藏  |  浏览/下载:281/46  |  提交时间:2022/01/27
Regulators  Regulation  Control systems  Stability analysis  Asymptotic stability  Eigenvalues and eigenfunctions  Control theory  Continuous-time linear systems  output regulation  parallel control theory  parallel regulators  parallel systems  
Data Augmented Deep Behavioral Cloning for Urban Traffic Control Operations Under a Parallel Learning Framework 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2021, 卷号: 23, 期号: 6, 页码: 5128-5137
作者:  Li, Xiaoshuang;  Ye, Peijun;  Jin, Junchen;  Zhu, Fenghua;  Wang, Fei-Yue
Adobe PDF(2319Kb)  |  收藏  |  浏览/下载:348/68  |  提交时间:2022/01/27
Generative adversarial networks  Data models  Gallium nitride  Task analysis  Complex systems  Intelligent traffic signal operations  deep behavioral cloning  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:320/60  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Robust Cross-lingual Task-oriented Dialogue 期刊论文
ACM TRANSACTIONS ON ASIAN AND LOW-RESOURCE LANGUAGE INFORMATION PROCESSING, 2021, 卷号: 20, 期号: 6, 页码: 24
作者:  Xiang, Lu;  Zhu, Junnan;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(1935Kb)  |  收藏  |  浏览/下载:306/71  |  提交时间:2021/12/28
Cross-lingual  dialogue system  adversarial learning  knowledge  robustness  
Dynamic camera configuration learning for high-confidence active object detection 期刊论文
NEUROCOMPUTING, 2021, 卷号: 466, 页码: 113-127
作者:  Xu, Nuo;  Huo, Chunlei;  Zhang, Xin;  Cao, Yong;  Meng, Gaofeng;  Pan, Chunhong
Adobe PDF(4412Kb)  |  收藏  |  浏览/下载:338/68  |  提交时间:2021/12/28
Object detection  Active object detection  Deep reinforcement learning  Camera control  
A New Integral Critic Learning for Optimal Tracking Control with Applications to Boiler-Turbine Systems 期刊论文
OPTIMAL CONTROL APPLICATIONS & METHODS, 2021, 页码: 16
作者:  Wei, Qinglai;  Liu, Yujia;  Lu, Jingwei;  Ling, Jun;  Luan, Zhenhua;  Chen, Mingliang
收藏  |  浏览/下载:182/0  |  提交时间:2021/12/28
adaptive dynamic programming  boiler-turbine system  integral reinforcement learning  neural network  policy iteration