CASIA OpenIR

浏览/检索结果: 共156条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:45/17  |  提交时间:2024/06/25
强化学习,分层强化学习  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:52/21  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Self-Talk Responses to Users' Opinions and Challenge in Human Computer Dialog 会议论文
, Beijing, China, 2018-8-2
作者:  Yang Minghao;  Zhang Ke;  NaShengRuoYang;  Tao Jianhua
Adobe PDF(540Kb)  |  收藏  |  浏览/下载:66/19  |  提交时间:2024/06/24
Credible Influence Analysis in Mass Media Using Causal Inference 会议论文
, San Antonio, TX, USA, 02-03 November 2021
作者:  Deng ZZ(邓紫臻);  Zheng XL(郑晓龙);  Cai Z(蔡震);  Ceng DJ(曾大军)
Adobe PDF(716Kb)  |  收藏  |  浏览/下载:34/14  |  提交时间:2024/06/21
Memory-based Error Label Suppression for Embodied Self-Improving Object Detection 会议论文
, 意大利巴里, 2024-8-28
作者:  Deng JR(邓杰仁);  Zhang HJ(张好剑);  Hu JH(胡建华);  Wang YK(王云宽)
Adobe PDF(2603Kb)  |  收藏  |  浏览/下载:63/24  |  提交时间:2024/06/20
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
作者:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:57/12  |  提交时间:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:58/22  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
作者:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:49/16  |  提交时间:2024/06/11
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:80/26  |  提交时间:2024/06/05
A Cooperation Graph Approach for Multiagent Sparse Reward Reinforcement Learning 会议论文
, Padua, Italy, 2022年07月
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Wanmai Yuan
Adobe PDF(2650Kb)  |  收藏  |  浏览/下载:53/19  |  提交时间:2024/06/05