CASIA OpenIR

浏览/检索结果: 共5条,第1-5条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
MILP Models for Flexible Job Shop Scheduling with Spatial Constraints and Sequence Flexibility 会议论文
2024 IEEE 20th International Conference on Automation Science and Engineering, Bari,Italy, 2024年8月28
作者:  Han, Yunjun(韩云君);  Peng,Shaoming;  Shen, Zhen;  Tao,Zhikun;  Xiong, Gang
Adobe PDF(397Kb)  |  收藏  |  浏览/下载:30/7  |  提交时间:2024/06/11
Interpretable Autonomous Driving Model Based on Cognitive Reinforcement Learning 会议论文
, Jeju, Korea, Jun. 02-05, 2024
作者:  Yijia Li;  Hao Qi;  Fenghua Zhu;  Yisheng Lv;  Peijun Ye
Adobe PDF(87Kb)  |  收藏  |  浏览/下载:22/10  |  提交时间:2024/06/06
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:37/12  |  提交时间:2024/06/05
MOT: A Mixture of Actors Reinforcement Learning Method by Optimal Transport for Algorithmic Trading 会议论文
, 台湾台北, 20240507-20240510
作者:  Cheng X(程曦);  Zhang JH(张景昊);  Ceng YN(曾宇楠);  Xue WF(薛文芳)
Adobe PDF(739Kb)  |  收藏  |  浏览/下载:21/6  |  提交时间:2024/06/03