CASIA OpenIR

浏览/检索结果: 共7条,第1-7条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
A Reinforcement Learning Benchmark for Autonomous Driving in Intersection Scenarios 会议论文
, Orlando, FL, USA, 2022-1-24
作者:  Liu, Yuqi;  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(1537Kb)  |  收藏  |  浏览/下载:19/11  |  提交时间:2024/06/03
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:176/64  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Mixed-Supervised Scene Text Detection With Expectation-Maximization Algorithm 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 5513-5528
作者:  Zhao, Mengbiao;  Feng, Wei;  Yin, Fei;  Zhang, Xu-Yao;  Liu, Cheng-Lin
Adobe PDF(5999Kb)  |  收藏  |  浏览/下载:408/39  |  提交时间:2022/09/19
Costs  Annotations  Training  Labeling  Detectors  Data models  Benchmark testing  Mixed-supervised learning  scene text detection  weak supervision forms  expectation-maximization algorithm  
A New Approach to Finite-Horizon Optimal Control for Discrete-Time Affine Nonlinear Systems via a Pseudolinear Method 期刊论文
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 卷号: 67, 期号: 5, 页码: 2610-2617
作者:  Wei, Qinglai;  Zhu, Liao;  Li, Tao;  Liu, Derong
Adobe PDF(984Kb)  |  收藏  |  浏览/下载:240/7  |  提交时间:2022/07/25
Time-varying systems  Nonlinear systems  Optimal control  Heuristic algorithms  Dynamic programming  Neural networks  Linear systems  Adaptive dynamic programming  approximate dynamic programming  finite horizon  nonlinear systems  optimal control  pseudolinear approximation  
Towards Corruption-Agnostic Robust Domain Adaptation 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 卷号: 18, 期号: 4, 页码: 16
作者:  Xu, Yifan;  Sheng, Kekai;  Dong, Weiming;  Wu, Baoyuan;  Xu, Changsheng;  Hu, Bao-Gang
Adobe PDF(2116Kb)  |  收藏  |  浏览/下载:447/97  |  提交时间:2022/06/10
Domain adaptation  corruption robustness  transfer learning  
Supervised assisted deep reinforcement learning for emergency voltage control of power systems 期刊论文
NEUROCOMPUTING, 2022, 卷号: 475, 页码: 69-79
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Dai, Yuxin;  Yu, Zhihong;  Zhang, Jun Jason;  Bu, Guangquan;  Wang, Fei-Yue
Adobe PDF(2551Kb)  |  收藏  |  浏览/下载:339/69  |  提交时间:2022/06/06
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Emergency control  
SADRL: Merging human experience with machine intelligence via supervised assisted deep reinforcement learning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 467, 页码: 300-309
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Jin, Junchen;  Huang, Yanhao;  Zhang, Jun Jason;  Wang, Fei-Yue
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:318/72  |  提交时间:2021/12/28
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Double DQN