CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Parallel reinforcement learning: a framework and case study 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2018, 卷号: 5, 期号: 4, 页码: 827-835
作者:  Teng Liu;  Bin TIan;  Yunfeng Ai;  Li Li;  Dongpu Cao;  Feiyue Wang
Adobe PDF(7005Kb)  |  收藏  |  浏览/下载:54/15  |  提交时间:2023/05/05
Stochastic Multiple Choice Learning for Acoustic Modeling 会议论文
, Rio de Janeiro, 巴西, 2018-07-08
作者:  Liu, Bin;  Nie, Shuai;  Liang, Shan;  Yang, Zhanlei;  Liu, Wenju
浏览  |  Adobe PDF(529Kb)  |  收藏  |  浏览/下载:191/68  |  提交时间:2020/06/08
Fine-level semantic labeling of large-scale 3d model by active learning 会议论文
, Verona, Italy, 2018-9-5~8
作者:  Zhou Y(周洋);  Shen SH(申抒含);  Hu ZY(胡占义)
Adobe PDF(5390Kb)  |  收藏  |  浏览/下载:330/123  |  提交时间:2019/04/30
Semantic Labeling  Active Learning  Large Scale  
Decentralized control for large-scale nonlinear systems with unknown mismatched interconnections via policy iteration 期刊论文
IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2018, 卷号: 48, 期号: 10, 页码: 1725-1735
作者:  Zhao B(赵博);  Ding Wang
Adobe PDF(768Kb)  |  收藏  |  浏览/下载:365/100  |  提交时间:2018/10/14
Adaptive Dynamic Programming  Decentralized Control  Large-scale Systems  Neural Networks  
Adaptive Constrained Optimal Control Design for Data-Based Nonlinear Discrete-Time Systems With Critic-Only Structure 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2018, 卷号: 29, 期号: 6, 页码: 2099-2111
作者:  Luo, Biao;  Liu, Derong;  Wu, Huai-Ning
Adobe PDF(1045Kb)  |  收藏  |  浏览/下载:382/117  |  提交时间:2018/10/10
Adaptive Control  Adaptive Dynamic Programming  Constraints  Critic-only  Data-based  Optimal Control  Q-learning  
Model-free Adaptive Dynamic Programming Based Near-optimal Decentralized Tracking Control of Reconfigurable Manipulators 期刊论文
INTERNATIONAL JOURNAL OF CONTROL AUTOMATION AND SYSTEMS, 2018, 卷号: 16, 期号: 2, 页码: 478-490
作者:  Zhao, Bo;  Li, Yuanchun
Adobe PDF(974Kb)  |  收藏  |  浏览/下载:322/107  |  提交时间:2018/10/10
Adaptive Dynamic Programming  Decentralized Tracking Control  Model-free  Near-optimal  Neural Networks  Reconfigurable Manipulators  
Adaptive dynamic programming-based stabilization of nonlinear systems with unknown actuator saturation 期刊论文
NONLINEAR DYNAMICS, 2018, 卷号: 93, 期号: 4, 页码: 2089-2103
作者:  Zhao, Bo;  Jia, Lihao;  Xia, Hongbing;  Li, Yuanchun
Adobe PDF(1069Kb)  |  收藏  |  浏览/下载:384/136  |  提交时间:2018/10/10
Adaptive Dynamic Programming  Unknown Actuator Saturation  Continuous-time Nonlinear Systems  Stabilizing Control  Neural Networks  
A hierarchical framework for ad inventory allocation in programmatic advertising markets 期刊论文
ELECTRONIC COMMERCE RESEARCH AND APPLICATIONS, 2018, 卷号: 31, 页码: 40-51
作者:  Juanjuan Li;  Xiaochun Ni;  Yong Yuan;  Fei-Yue Wang
Adobe PDF(1018Kb)  |  收藏  |  浏览/下载:452/171  |  提交时间:2018/09/20
Programmatic advertising  Ad inventory  Real-time bidding  Private marketplace  Header bidding  
Comprehensive comparison of online ADP algorithms for continuous-time optimal control 期刊论文
ARTIFICIAL INTELLIGENCE REVIEW, 2018, 卷号: 49, 期号: 4, 页码: 531-547
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(766Kb)  |  收藏  |  浏览/下载:413/183  |  提交时间:2017/09/13
Adaptive Dynamic Programming  Policy Iteration  Integral Reinforcement Learning  Experience Replay  Off-policy  
Data-Based Optimal Control for Weakly Coupled Nonlinear Systems Using Policy Iteration 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2018, 卷号: 48, 期号: 4, 页码: 511-521
作者:  Li, Chao;  Liu, Derong;  Wang, Ding
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:893/577  |  提交时间:2017/05/03
Adaptive Dynamic Programming (Adp)  Neural Networks (Nns)  Optimal Control  Policy Iteration (Pi)  Unknown Dynamics  Weakly Coupled Systems