CASIA OpenIR
(本次检索基于用户作品认领结果)

浏览/检索结果: 共4条,第1-4条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:46/20  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:75/37  |  提交时间:2023/05/22
A Review of Computational Intelligence for StarCraft AI 会议论文
, Bangalore, India, 18-21 Nov. 2018
作者:  Tang, Zhentao;  Shao, Kun;  Zhu, Yuanheng;  Li, Dong;  Zhao, Dongbin;  Huang, Tingwen
浏览  |  Adobe PDF(131Kb)  |  收藏  |  浏览/下载:531/238  |  提交时间:2019/04/25
Online Reinforcement Learning by Bayesian Inference 会议论文
Proceedings of International Joint Conference on Neural Networks 2015, Ireland, 2015年7月
作者:  Xia ZP(夏中谱);  Dongbin Zhao
浏览  |  Adobe PDF(751Kb)  |  收藏  |  浏览/下载:320/99  |  提交时间:2016/06/15
Reinforcement Learning  Bayesian Inference  Gaussian Processes