CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:35/15  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Review on Peg-in-Hole Insertion Technology Based on Reinforcement Learning 会议论文
, Chongqing, China, 2023-11
作者:  Shen Liancheng;  Su Jianhua;  Zhang Xiaodong
Adobe PDF(254Kb)  |  收藏  |  浏览/下载:38/20  |  提交时间:2024/06/24
—Robot Peg-in-hole Insertion  Reinforcement Learning  Meta-Reinforcement Learning  
基于视觉表征的深度强化学习方法 学位论文
, 2024
作者:  刘民颂
Adobe PDF(10778Kb)  |  收藏  |  浏览/下载:46/4  |  提交时间:2024/06/22
深度强化学习,视觉表征学习,自监督学习,状态抽象,Transformer神经网络  
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:45/16  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning