CASIA OpenIR

浏览/检索结果: 共17条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:39/12  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:25/6  |  提交时间:2024/05/28
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:192/71  |  提交时间:2023/06/29
Exploring Motion Information for Distractor Suppression in Visual Tracking 会议论文
2022, New Orleans , United States, 2022.6.19
作者:  Liu, Kaiwen;  Gao, Jin;  Liu, Haowei;  Li, Liang;  Li, Bing;  Hu, Weiming
Adobe PDF(1747Kb)  |  收藏  |  浏览/下载:282/55  |  提交时间:2022/06/20
Bootstrap Estimated Uncertainty of the Environment Model for Model-Based Reinforcement Learning 会议论文
, Honolulu, Hawaii, USA, 2019-1
作者:  Huang, Wenzhen;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(5079Kb)  |  收藏  |  浏览/下载:146/48  |  提交时间:2022/01/11
Scene Text Detection with Recurrent Instance Segmentation 会议论文
, 北京, 2018.8.20
作者:  Wei Feng;  Wen-Hao He;  Fei Yin;  Cheng-Lin Liu
Adobe PDF(1336Kb)  |  收藏  |  浏览/下载:211/70  |  提交时间:2021/07/06
Conservative Policy Gradient in Multi-critic Setting 会议论文
, Hangzhou, China, 2019.11.22-24
作者:  Xi, Bao;  Wang, Rui;  Wang, Shuo;  Lu, Tao;  Cai, Yinghao
浏览  |  Adobe PDF(379Kb)  |  收藏  |  浏览/下载:226/77  |  提交时间:2021/02/02
inconsistancy  stablility  Q learning  policy gradient  
Addressing Reward Engineering for Deep Reinforcement Learning on Multi-stage Task 会议论文
, Australia, 2019-12
作者:  Chen, Bin;  Su, Jianhua
浏览  |  Adobe PDF(1169Kb)  |  收藏  |  浏览/下载:335/87  |  提交时间:2020/06/08
Sequence-to-Sequence Domain Adaptation Network for Robust Text Image Recognition 会议论文
, Long Beach, CA, 2019.06.16-2019.06.20
作者:  Zhang, Yaping;  Nie, Shuai;  Liu, Wenju;  Xu, Xing;  Zhang, Dongxiang;  Shen, Hengtao
浏览  |  Adobe PDF(718Kb)  |  收藏  |  浏览/下载:312/108  |  提交时间:2020/05/15
Domain Adaptation  Text Image Recognition