CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Monte Carlo-based reinforcement learning control for unmanned aerial vehicle systems 期刊论文
NEUROCOMPUTING, 2022, 卷号: 507, 页码: 282-291
作者:  Wei, Qinglai;  Yang, Zesheng;  Su, Huaizhong;  Wang, Lijian
收藏  |  浏览/下载:240/0  |  提交时间:2022/09/19
Reinforcement learning  Adaptive dynamic programming (ADP)  UAV control  Monte Carlo simulation  Neural networks  
HMDRL: Hierarchical Mixed Deep Reinforcement Learning to Balance Vehicle Supply and Demand 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 卷号: 23, 期号: 11, 页码: 21861-21872
作者:  Xi, Jinhao;  Zhu, Fenghua;  Ye, Peijun;  Lv, Yisheng;  Tang, Haina;  Wang, Fei-Yue
Adobe PDF(3316Kb)  |  收藏  |  浏览/下载:305/40  |  提交时间:2022/09/19
deep reinforcement learning  online ride-hailing system  hierarchical repositioning framework  parallel coordination mechanism  mixed state  
Multi-modal spatio-temporal meteorological forecasting with deep neural network 期刊论文
ISPRS Journal of Photogrammetry and Remote Sensing, 2022, 页码: 14
作者:  Xinbang Zhang;  Qizhao Jin;  Tingzhao Yu;  Shiming Xiang;  Qiuming Kuang;  Véronique Prinet;  Chunhong Pan
Adobe PDF(3735Kb)  |  收藏  |  浏览/下载:325/77  |  提交时间:2022/07/01
Meterological forecasting  Deep learning  Neural architecture search  AutoML  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:239/8  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
On Iterative Proportional Updating: Limitations and Improvements for General Population Synthesis 期刊论文
IEEE Transactions on Cybernetics, 2022, 卷号: 52, 期号: 3, 页码: 1726-1735
作者:  Peijun Ye;  Bin Tian;  Yisheng Lv;  Qijie Li;  Fei-Yue Wang
Adobe PDF(1066Kb)  |  收藏  |  浏览/下载:247/52  |  提交时间:2020/10/15
Agent-based simulation, bilevel optimization, iterative proportional updating (IPU), population synthesis