CASIA OpenIR

浏览/检索结果: 共217条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:33/13  |  提交时间:2024/06/25
强化学习,分层强化学习  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:32/15  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Self-Talk Responses to Users' Opinions and Challenge in Human Computer Dialog 会议论文
, Beijing, China, 2018-8-2
作者:  Yang Minghao;  Zhang Ke;  NaShengRuoYang;  Tao Jianhua
Adobe PDF(540Kb)  |  收藏  |  浏览/下载:49/12  |  提交时间:2024/06/24
Credible Influence Analysis in Mass Media Using Causal Inference 会议论文
, San Antonio, TX, USA, 02-03 November 2021
作者:  Deng ZZ(邓紫臻);  Zheng XL(郑晓龙);  Cai Z(蔡震);  Ceng DJ(曾大军)
Adobe PDF(716Kb)  |  收藏  |  浏览/下载:22/9  |  提交时间:2024/06/21
Heterogeneous Avatar Synthesis Based on Disentanglement of Topology and Rendering 会议论文
, macao, 2022-12-4
作者:  Gao Nan;  Zhi Zeng;  GuiXuan Zhang;  ShuWu Zhang
Adobe PDF(11845Kb)  |  收藏  |  浏览/下载:41/15  |  提交时间:2024/06/20
face generation  
Learning to Correct Erroneous Words for Document Grounded Conversations 会议论文
, Kuantan, Malaysia, 2023.02.23-2023.02.25
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(773Kb)  |  收藏  |  浏览/下载:41/18  |  提交时间:2024/06/17
Deep Learning  Natural Language Generation  Dialogue System  Curriculum Learning  
Bridging the Gap between Different Vocabularies for LLM Ensemble 会议论文
, Mexico City, Mexico, June 16–21, 2024
作者:  徐杨一帆;  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(1982Kb)  |  收藏  |  浏览/下载:61/20  |  提交时间:2024/06/13
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
作者:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:44/11  |  提交时间:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Interpretable Autonomous Driving Model Based on Cognitive Reinforcement Learning 会议论文
, Jeju, Korea, Jun. 02-05, 2024
作者:  Yijia Li;  Hao Qi;  Fenghua Zhu;  Yisheng Lv;  Peijun Ye
Adobe PDF(87Kb)  |  收藏  |  浏览/下载:44/21  |  提交时间:2024/06/06
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:65/22  |  提交时间:2024/06/05