CASIA OpenIR

浏览/检索结果: 共23条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Learning to Navigate in Human Environments via Deep Reinforcement Learning 会议论文
, Sydney, Australia, 2019-12-12至2019-12-15
作者:  Xingyuan Gao;  Shiying Sun;  Xiaoguang Zhao;  Min Tan
Adobe PDF(1298Kb)  |  收藏  |  浏览/下载:149/44  |  提交时间:2022/03/31
Feature Aggregation With Reinforcement Learning for Video-Based Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2019, 卷号: 30, 期号: 12, 页码: 3847-3852
作者:  Zhang, Wei;  He, Xuanyu;  Lu, Weizhi;  Qiao, Hong;  Li, Yibin
收藏  |  浏览/下载:287/0  |  提交时间:2020/03/30
Feature extraction  Task analysis  Cameras  Noise measurement  Learning systems  Reinforcement learning  Feature aggregation  reinforcement learning (RL)  sequential decision making  video-based person re-identification (re-id)  
DetNAS: Backbone Search for Object Detection 会议论文
, 加拿大温哥华, 2019-12-8
作者:  Chen, Yukang;  Yang, Tong;  Zhang, Xiangyu;  Meng, Gaofeng;  Xiao, Xinyu;  Sun, Jian
浏览  |  Adobe PDF(1366Kb)  |  收藏  |  浏览/下载:265/73  |  提交时间:2020/06/09
Parallel Adaptive Critic Designs of Optimal Control for Ice-Storage Air Conditioning Systems 会议论文
, Xiamen, China, 2019-12
作者:  Liao, Zehua;  Wei, Qinglai;  Song, Ruizhuo
浏览  |  Adobe PDF(199Kb)  |  收藏  |  浏览/下载:274/75  |  提交时间:2020/06/26
Parallel adaptive critic design  Adaptive dynamic programming  Particle swarm optimization  Ice-storage air conditioning  
Curiosity-Driven Exploration for Off-Policy Reinforcement Learning Methods 会议论文
, Dali, China, 2019.12.06-2019.12.08
作者:  Li, Boyao;  Lu, Tao;  Li, Jiayi;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
浏览  |  Adobe PDF(2877Kb)  |  收藏  |  浏览/下载:191/68  |  提交时间:2020/08/27
Conservative Policy Gradient in Multi-critic Setting 会议论文
, Hangzhou, China, 2019.11.22-24
作者:  Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
浏览  |  Adobe PDF(379Kb)  |  收藏  |  浏览/下载:185/63  |  提交时间:2021/02/02
inconsistancy  stablility  Q learning  policy gradient  
Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 卷号: 28, 期号: 10, 页码: 5105-5120
作者:  Li, Debang;  Wu, Huikai;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(6588Kb)  |  收藏  |  浏览/下载:367/41  |  提交时间:2019/12/16
Reinforcement learning  adversarial learning  image cropping  
Adaptive Tracking Control of Surface Vessel Using Optimized Backstepping Technique 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2019, 卷号: 49, 期号: 9, 页码: 3420-3431
作者:  Wen, Guoxing;  Ge, Shuzhi Sam;  Chen, C. L. Philip;  Tu, Fangwen;  Wang, Shengnan
收藏  |  浏览/下载:171/0  |  提交时间:2019/12/16
Actor-critic architecture  Lyapunov stability  optimized backstepping (OB)  reinforcement learning (RL)  surface vessel  
Optimized Adaptive Nonlinear Tracking Control Using Actor-Critic Reinforcement Learning Strategy 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2019, 卷号: 15, 期号: 9, 页码: 4969-4977
作者:  Wen, Guoxing;  Chen, C. L. Philip;  Ge, Shuzhi Sam;  Yang, Hongli;  Liu, Xiaoguang
收藏  |  浏览/下载:208/0  |  提交时间:2019/12/16
Lyapunov function  neural networks (NNs)  nonlinear systems  optimized tracking control  reinforcement learning (RL) of actor-critic architecture  
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:55/28  |  提交时间:2023/05/22