CASIA OpenIR

浏览/检索结果: 共48条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:45/17  |  提交时间:2024/06/25
强化学习,分层强化学习  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:52/21  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Self-Talk Responses to Users' Opinions and Challenge in Human Computer Dialog 会议论文
, Beijing, China, 2018-8-2
作者:  Yang Minghao;  Zhang Ke;  NaShengRuoYang;  Tao Jianhua
Adobe PDF(540Kb)  |  收藏  |  浏览/下载:66/19  |  提交时间:2024/06/24
Bridging the Gap between Different Vocabularies for LLM Ensemble 会议论文
, Mexico City, Mexico, June 16–21, 2024
作者:  徐杨一帆;  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(1982Kb)  |  收藏  |  浏览/下载:79/26  |  提交时间:2024/06/13
Zero-shot Object Goal Visual Navigation 会议论文
, London, UK, May 29 - June 2, 2023
作者:  Qianfan Zhao;  Lu Zhang;  Bin He;  Hong Qiao;  Zhiyong Liu
Adobe PDF(2100Kb)  |  收藏  |  浏览/下载:57/26  |  提交时间:2024/06/06
GraphMLLM: A Graph-based Multi-level Layout Language-independent Model for Document Understanding 会议论文
, 希腊雅典, 2024-09
作者:  He-Sen Dai;  Xiao-Hui Li;  Fei Yin;  Xudong Yan;  Shuqi Mei;  Cheng-Lin Liu
Adobe PDF(967Kb)  |  收藏  |  浏览/下载:72/19  |  提交时间:2024/06/05
Visual information extraction  Self-supervised pre-training  Multi-level page layouts  
Driving Control with Deep and Reinforcement Learning in The Open Racing Car Simulator 会议论文
, Siem Reap, Cambodia, 2018, 12, 13-16
作者:  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(697Kb)  |  收藏  |  浏览/下载:46/20  |  提交时间:2024/06/05
Centralized Cooperative Exploration Policy for Continuous Control Tasks 会议论文
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, London, United Kingdom, May 29–June 2, 2023
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou;  Yu Liu
Adobe PDF(2175Kb)  |  收藏  |  浏览/下载:62/21  |  提交时间:2024/05/30
continuous control tasks  cooperative exploration  
Learning Realistic and Reasonable Grasps for Anthropomorphic Hand in Cluttered Scenes 会议论文
, Yokohama, Japan, 2024-5-13
作者:  Duan, Haonan;  Li, Yiming;  LI, Daheng;  Wei, Wei;  Huang, Yayu;  Wang, Peng
Adobe PDF(4993Kb)  |  收藏  |  浏览/下载:83/36  |  提交时间:2024/05/29
Robotic grasping  Anthropomorphic hand  Affordance  
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:64/29  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理