CASIA OpenIR

浏览/检索结果: 共70条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:45/17  |  提交时间:2024/06/25
强化学习,分层强化学习  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:52/21  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Self-Talk Responses to Users' Opinions and Challenge in Human Computer Dialog 会议论文
, Beijing, China, 2018-8-2
作者:  Yang Minghao;  Zhang Ke;  NaShengRuoYang;  Tao Jianhua
Adobe PDF(540Kb)  |  收藏  |  浏览/下载:66/19  |  提交时间:2024/06/24
Bridging the Gap between Different Vocabularies for LLM Ensemble 会议论文
, Mexico City, Mexico, June 16–21, 2024
作者:  徐杨一帆;  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(1982Kb)  |  收藏  |  浏览/下载:79/26  |  提交时间:2024/06/13
TIM: An Efficient Temporal Interaction Module for Spiking Transformer 会议论文
, Jeju, korea, 2024-08
作者:  Shen, Sicheng;  Zhao, Dongcheng;  Shen, Guobin;  Zeng, Yi
Adobe PDF(717Kb)  |  收藏  |  浏览/下载:43/8  |  提交时间:2024/06/06
Bullying10K: A Large-Scale Neuromorphic Dataset towards Privacy-Preserving Bullying Recognition 会议论文
, New Orleans, 2023-12-14
作者:  Dong, Yiting;  Li, Yang;  Zhao, Dongcheng;  Shen, Guobin;  Zeng, Yi
Adobe PDF(11215Kb)  |  收藏  |  浏览/下载:39/8  |  提交时间:2024/06/05
Centralized Cooperative Exploration Policy for Continuous Control Tasks 会议论文
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, London, United Kingdom, May 29–June 2, 2023
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou;  Yu Liu
Adobe PDF(2175Kb)  |  收藏  |  浏览/下载:62/21  |  提交时间:2024/05/30
continuous control tasks  cooperative exploration  
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:64/29  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
Content Based Deep Learning Image Retrieval: A Survey 会议论文
, Lingshui, China, 2023-12-14
作者:  Chi, Zhang;  JIe, Liu
Adobe PDF(504Kb)  |  收藏  |  浏览/下载:53/13  |  提交时间:2024/05/28
Content Based Image Retrieval  Deep Learning  Convolution Neural Network  
Learning Transformer-based Cooperation for Networked Traffic Signal Control 会议论文
, Macau, China, 2022-10
作者:  Zhao, Chen;  Dai, Xingyuan;  Wang, Xiao;  Li, Lingxi;  Lv, Yisheng;  Wang, Fei-Yue
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:59/20  |  提交时间:2024/05/28