CASIA OpenIR

浏览/检索结果: 共150条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
V2X-BGN: Camera-based V2X-Collaborative 3D Object Detection with BEV Global Non-Maximum Suppression 会议论文
, Jeju Island, South Korea, June 2-5, 2024
作者:  Zhang Caiji;  Tian Bin;  Meng Shi;  Qi Shuangying;  Sun Yang;  Ai Yunfeng;  Chen Long
Adobe PDF(1659Kb)  |  收藏  |  浏览/下载:26/9  |  提交时间:2024/06/25
V2X  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:42/16  |  提交时间:2024/06/25
强化学习,分层强化学习  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:42/18  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Self-Talk Responses to Users' Opinions and Challenge in Human Computer Dialog 会议论文
, Beijing, China, 2018-8-2
作者:  Yang Minghao;  Zhang Ke;  NaShengRuoYang;  Tao Jianhua
Adobe PDF(540Kb)  |  收藏  |  浏览/下载:58/16  |  提交时间:2024/06/24
Learning to Correct Erroneous Words for Document Grounded Conversations 会议论文
, Kuantan, Malaysia, 2023.02.23-2023.02.25
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(773Kb)  |  收藏  |  浏览/下载:46/19  |  提交时间:2024/06/17
Deep Learning  Natural Language Generation  Dialogue System  Curriculum Learning  
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
作者:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:50/11  |  提交时间:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:50/18  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
作者:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:43/15  |  提交时间:2024/06/11
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:71/24  |  提交时间:2024/06/05
A Reinforcement Learning Benchmark for Autonomous Driving in Intersection Scenarios 会议论文
, Orlando, FL, USA, 2022-1-24
作者:  Liu, Yuqi;  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(1537Kb)  |  收藏  |  浏览/下载:44/23  |  提交时间:2024/06/03