CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共15条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:52/21  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Enhancing Reinforcement Learning via Transformer-based State Predictive Representations 期刊论文
IEEE Transactions on Artificial Intelligence, 2024, 页码: 1 - 12
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Chen YR(陈亚冉);  Zhao DB(赵冬斌)
Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:43/12  |  提交时间:2024/06/24
A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat 期刊论文
IEEE Transactions on Systems, Man and Cybernetics: Systems, 2023, 页码: DOI: 10.1109/TSMC.2023.3270444
作者:  Jiajun Chai;  Wenzhang Chen;  Yuanheng Zhu;  Zong-xin Yao,;  Dongbin Zhao
Adobe PDF(9249Kb)  |  收藏  |  浏览/下载:299/130  |  提交时间:2023/04/26
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:260/16  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:332/63  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
深度强化学习进展: 从 AlphaGo 到 AlphaGo Zero 期刊论文
控 制 理 论 与 应 用, 2017, 卷号: 34, 期号: 12, 页码: 1529-1546
作者:  唐振韬;  邵 坤;  赵冬斌;  朱圆恒
Adobe PDF(8232Kb)  |  收藏  |  浏览/下载:306/53  |  提交时间:2021/07/05
深度强化学习  AlphaGo Zero  深度学习  强化学习  人工智能  
Enhanced Rolling Horizon Evolution Algorithm With Opponent Model Learning: Results for the Fighting Game AI Competition 期刊论文
IEEE TRANSACTIONS ON GAMES, 2023, 卷号: 5, 期号: 1, 页码: 5 - 15
作者:  Zhentao Tang;  Yuanheng Zhu;  Dongbin Zhao;  Simon M. Lucas
Adobe PDF(7686Kb)  |  收藏  |  浏览/下载:373/80  |  提交时间:2021/07/05
Rolling horizon evolution  opponent model  reinforcement learning  supervised learning  fighting game  
Caffeic Acid Phenethyl Ester Effects: In Silico Study of its Osteoimmunological Mechanisms 期刊论文
LETTERS IN DRUG DESIGN & DISCOVERY, 2020, 卷号: 17, 期号: 5, 页码: 556-562
作者:  Zhao, Yuhao;  Pang, Xiaokun;  Nepal, Akriti;  Jiang, Xincan;  Xu, Xiaoxin;  Zhao, Dongbin;  Murtaza, Ghulam;  Ma, Yanxu
收藏  |  浏览/下载:281/0  |  提交时间:2020/06/22
Drug target  drug-protein interaction  in silico prediction  osteoimmunological  PASS prediction  CAPE  
Reinforcement Learning and Deep Learning based Lateral Control for Autonomous Driving 期刊论文
IEEE Computational Intelligence Magazine, IEEE Computational Intelligence Magazine, 2019, 2019, 卷号: 14, 14, 期号: 2, 页码: 83-98, 83-98
作者:  Dong Li;  Dongbin Zhao;  Qichao Zhang;  Yaran Chen
浏览  |  Adobe PDF(2205Kb)  |  收藏  |  浏览/下载:403/116  |  提交时间:2019/04/25
Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  Deep Learning  Autonomous Driving  Visual Control  Reinforcement Learning  
Visual navigation with Actor-Critic deep reinforcement learning 会议论文
, Rio, Brazil, 2018-01
作者:  Kun Shao;  Dongbin Zhao;  Yuanheng Zhu;  Qichao Zhang
浏览  |  Adobe PDF(1827Kb)  |  收藏  |  浏览/下载:350/143  |  提交时间:2019/04/22