CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共11条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:52/21  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
A Reinforcement Learning Benchmark for Autonomous Driving in Intersection Scenarios 会议论文
, Orlando, FL, USA, 2022-1-24
作者:  Liu, Yuqi;  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(1537Kb)  |  收藏  |  浏览/下载:50/24  |  提交时间:2024/06/03
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:117/48  |  提交时间:2023/04/26
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:259/15  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Heuristic rank selection with progressively searching tensor ring network 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 15
作者:  Li, Nannan;  Pan, Yu;  Chen, Yaran;  Ding, Zixiang;  Zhao, Dongbin;  Xu, Zenglin
Adobe PDF(1305Kb)  |  收藏  |  浏览/下载:324/61  |  提交时间:2021/04/27
Tensor ring networks  Rank selection  Progressively search  Image classification  
LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 卷号: 21, 期号: 11, 页码: 4516-4525
作者:  Zhu, Yuanheng;  He, Haibo;  Zhao, Dongbin
Adobe PDF(1648Kb)  |  收藏  |  浏览/下载:199/25  |  提交时间:2021/01/06
Cooperative adaptive cruise control  string stability  time-delay system  H-infinity control  linear matrix inequality  
Model-free Optimal Control based Intelligent Cruise Control with Hardware-in-the-loop Demonstration 期刊论文
IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE, 2017, 卷号: 12, 期号: 2, 页码: 56-69
作者:  Zhao, Dongbin;  Xia, Zhongpu;  Zhang, Qichao
浏览  |  Adobe PDF(4525Kb)  |  收藏  |  浏览/下载:625/198  |  提交时间:2017/05/04
Intelligent Cruise Control  
Optimization of Periodic Optimal Cruise for a Hypersionic Vehicle 会议论文
Proceedings of Chinese Automation Congress, Changsha, 2013.11.7~11.8
作者:  Haitao Wang;  Dongbin Zhao;  Mingwei Sun
浏览  |  Adobe PDF(311Kb)  |  收藏  |  浏览/下载:316/92  |  提交时间:2016/06/15
Optimal Periodic Control  Hl-20 Model  Snopt  Imsl  Nonlinear Optimization  
DynaCAS: Computational Experiments and Decision Support for ITS 期刊论文
IEEE INTELLIGENT SYSTEMS, 2008, 卷号: 23, 期号: 6, 页码: 19-23
作者:  Zhang, Nan;  Wang, Fei-Yue;  Zhu, Fenghua;  Zhao, Dongbin;  Tang, Shuming
浏览  |  Adobe PDF(586Kb)  |  收藏  |  浏览/下载:359/89  |  提交时间:2015/11/08
Dynacas  Computational Experiments  Decision Support  Its  
DynaCAS: Computational Experiments and Decision Support for ITS 期刊论文
IEEE INTELLIGENT SYSTEMS, 2008, 卷号: 23, 期号: 6, 页码: 19-23
作者:  Zhang, Nan;  Wang, Fei-Yue;  Zhu, Fenghua;  Zhao, Dongbin;  Tang, Shuming
浏览  |  Adobe PDF(586Kb)  |  收藏  |  浏览/下载:311/63  |  提交时间:2015/11/08
Dynacas  Computational Experiments  Decision Support  Its