CASIA OpenIR

浏览/检索结果: 共35条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Self-Modifying State Modeling for Simultaneous Machine Translation 会议论文
, Bangkok, Thailand, August 11–16, 2024
作者:  Donglei, Yu;  Xiaomian, Kang;  Yuchen, Liu;  YU, Zhou;  Chengqing, Zong
Adobe PDF(924Kb)  |  收藏  |  浏览/下载:10/6  |  提交时间:2024/06/20
Learning Playing Piano with Bionic-Constrained Diffusion Policy for Anthropomorphic Hand 期刊论文
Cyborg and Bionic Systems, 2024, 卷号: 5, 页码: 0104
作者:  Yang YM(杨依明);  Wang ZC(王泽昌);  Xing DP(邢登鹏);  Wang P(王鹏)
Adobe PDF(3500Kb)  |  收藏  |  浏览/下载:16/7  |  提交时间:2024/05/30
Spiking Adaptive Dynamic Programming with Poisson Process 会议论文
, 中国山东省青岛市, 2021-07-18
作者:  Wei QL(魏庆来);  Han LY(韩立元);  Zhang TL(张铁林)
Adobe PDF(2334Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/05/28
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:24/10  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文
, Suzhou, China, May 14-16, 2021
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Rui;  Wang, Shuo
Adobe PDF(855Kb)  |  收藏  |  浏览/下载:102/35  |  提交时间:2023/08/02
Omnidirectional Drift Control  Undulating Fin  Underwater Biomimetic Vehicle-manipulator System (UBVMS)  Reinforcement Learning  Twin Delayed Deep Deterministic policy gradient (TD3)  
Robust Graph Neural Networks Against Adversarial Attacks via Jointly Adversarial Training 会议论文
, 上海, 2020-12-3
作者:  Tian Hu;  Ye Bowei;  Zheng Xiaolong;  Zhang Xingwei;  Wu Dash Desheng
Adobe PDF(443Kb)  |  收藏  |  浏览/下载:152/50  |  提交时间:2023/07/04
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:122/34  |  提交时间:2023/06/27
Optimal Strategy for Aircraft Pursuit-Evasion Games via Self-Play Iteration 期刊论文
Machine Intelligence Research, 2023, 页码: 1-12
作者:  Wang Xin;  Wei Qinglai;  Li Tao;  Zhang Jie
Adobe PDF(1556Kb)  |  收藏  |  浏览/下载:192/69  |  提交时间:2023/06/26
DDRL: A Decentralized Deep Reinforcement Learning Method for Vehicle Repositioning 会议论文
, Indianapolis, IN, USA, 19-22 September 2021
作者:  Jinhao Xi;  Fenghua Zhu;  Yuanyuan Chen;  Yisheng Lv;  Chang Tan;  Feiyue Wang
Adobe PDF(1652Kb)  |  收藏  |  浏览/下载:126/23  |  提交时间:2023/06/26