CASIA OpenIR

浏览/检索结果: 共28条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Deep Reinforcement Learning-Based Driving Policy at Intersections Utilizing Lane Graph Networks 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1 - 16
作者:  Liu, Yuqi;  Zhang, Qichao;  Gao, Yinfeng;  Zhao, Dongbin
Adobe PDF(22863Kb)  |  收藏  |  浏览/下载:11/4  |  提交时间:2024/06/03
Reinforcement Learning  Autonomous Driving  Intersection Navigating  
A Reinforcement Learning Benchmark for Autonomous Driving in Intersection Scenarios 会议论文
, Orlando, FL, USA, 2022-1-24
作者:  Liu, Yuqi;  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(1537Kb)  |  收藏  |  浏览/下载:13/9  |  提交时间:2024/06/03
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:55/6  |  提交时间:2024/02/22
基于深度强化学习的超车换道决策方法 学位论文
, 2023
作者:  王俊杰
Adobe PDF(17475Kb)  |  收藏  |  浏览/下载:178/3  |  提交时间:2023/06/26
深度强化学习,自动驾驶,换道决策,基于模型值扩展,动力学泛化  
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:174/63  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Mixed-Supervised Scene Text Detection With Expectation-Maximization Algorithm 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 5513-5528
作者:  Zhao, Mengbiao;  Feng, Wei;  Yin, Fei;  Zhang, Xu-Yao;  Liu, Cheng-Lin
Adobe PDF(5999Kb)  |  收藏  |  浏览/下载:393/38  |  提交时间:2022/09/19
Costs  Annotations  Training  Labeling  Detectors  Data models  Benchmark testing  Mixed-supervised learning  scene text detection  weak supervision forms  expectation-maximization algorithm  
A New Approach to Finite-Horizon Optimal Control for Discrete-Time Affine Nonlinear Systems via a Pseudolinear Method 期刊论文
IEEE TRANSACTIONS ON AUTOMATIC CONTROL, 2022, 卷号: 67, 期号: 5, 页码: 2610-2617
作者:  Wei, Qinglai;  Zhu, Liao;  Li, Tao;  Liu, Derong
Adobe PDF(984Kb)  |  收藏  |  浏览/下载:234/4  |  提交时间:2022/07/25
Time-varying systems  Nonlinear systems  Optimal control  Heuristic algorithms  Dynamic programming  Neural networks  Linear systems  Adaptive dynamic programming  approximate dynamic programming  finite horizon  nonlinear systems  optimal control  pseudolinear approximation  
Object Relational Graph with Teacher-Recommended Learning for Video Captioning 会议论文
2020, 线上, 2020.6.14-19
作者:  Zhang,Ziqi;  Shi,Yaya;  Yuan,Chunfeng;  Li,Bing;  Wang,Peijin;  Hu,Weiming;  Zha,Zhengjun
Adobe PDF(1547Kb)  |  收藏  |  浏览/下载:204/73  |  提交时间:2022/06/16
Towards Corruption-Agnostic Robust Domain Adaptation 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2022, 卷号: 18, 期号: 4, 页码: 16
作者:  Xu, Yifan;  Sheng, Kekai;  Dong, Weiming;  Wu, Baoyuan;  Xu, Changsheng;  Hu, Bao-Gang
Adobe PDF(2116Kb)  |  收藏  |  浏览/下载:438/94  |  提交时间:2022/06/10
Domain adaptation  corruption robustness  transfer learning  
Supervised assisted deep reinforcement learning for emergency voltage control of power systems 期刊论文
NEUROCOMPUTING, 2022, 卷号: 475, 页码: 69-79
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Dai, Yuxin;  Yu, Zhihong;  Zhang, Jun Jason;  Bu, Guangquan;  Wang, Fei-Yue
Adobe PDF(2551Kb)  |  收藏  |  浏览/下载:333/67  |  提交时间:2022/06/06
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Emergency control