CASIA OpenIR

浏览/检索结果: 共42条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Centralized Cooperative Exploration Policy for Continuous Control Tasks 会议论文
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, London, United Kingdom, May 29–June 2, 2023
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou;  Yu Liu
Adobe PDF(2175Kb)  |  收藏  |  浏览/下载:29/9  |  提交时间:2024/05/30
continuous control tasks  cooperative exploration  
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:22/10  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
Learning Transformer-based Cooperation for Networked Traffic Signal Control 会议论文
, Macau, China, 2022-10
作者:  Zhao, Chen;  Dai, Xingyuan;  Wang, Xiao;  Li, Lingxi;  Lv, Yisheng;  Wang, Fei-Yue
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:21/9  |  提交时间:2024/05/28
ReasonChainQA: Text-based Complex Question Answering with Explainable Evidence Chains 会议论文
, 中国厦门, 2022
作者:  Zhu MJ(朱敏郡);  Weng YX(翁诣轩);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(456Kb)  |  收藏  |  浏览/下载:110/30  |  提交时间:2023/06/29
Deep Behavioral Cloning for Traffic Control with Virtual Expert Demonstration Under a Parallel Learning Framework 会议论文
, 北京, 2020-12
作者:  Li Xiaoshuang;  Zhu Fenghua;  Wang Fei-Yue
Adobe PDF(770Kb)  |  收藏  |  浏览/下载:188/78  |  提交时间:2022/06/16
Open-book Video Captioning with Retrieve-Copy-Generate Network 会议论文
2021, 线上, 2021.6.19-25
作者:  Zhang,Ziqi;  Qi,Zhongang;  Yuan,Chunfeng;  Shan,Ying;  Li,Bing;  Deng,Ying;  Hu,Weiming
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:243/62  |  提交时间:2022/06/16
HGCNet: Deep Anthropomorphic Hand Grasping in Clutter 会议论文
, 线上+线下(美国费城), 2022-5
作者:  Li YM(李一鸣)
Adobe PDF(5779Kb)  |  收藏  |  浏览/下载:191/37  |  提交时间:2022/06/16
DIMSAN: Fast Exploration with the Synergy between Density-based Intrinsic Motivation and Self-adaptive Action Noise 会议论文
, 西安, 2021.5.30-2021.6.5
作者:  Li, Jiayi;  Li, Boyao;  Lu, Tao;  Lu, Ning;  Cai, Yinghao;  Wang, Shuo
Adobe PDF(5599Kb)  |  收藏  |  浏览/下载:197/37  |  提交时间:2022/06/14
Learning Smooth and Omnidirectional Locomotion for Quadruped Robots 会议论文
, Chongqing, China, 2021-7
作者:  Wu, Jiaxi;  Wang, Chen'an;  Zhang, Dianmin;  Zhong, Shanlin;  Wang, Boxing;  Qiao, Hong
Adobe PDF(1436Kb)  |  收藏  |  浏览/下载:203/53  |  提交时间:2022/06/14
Quadruped Robot  Reinforcement Learning