CASIA OpenIR

浏览/检索结果: 共212条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Learning to Correct Erroneous Words for Document Grounded Conversations 会议论文
, Kuantan, Malaysia, 2023.02.23-2023.02.25
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(773Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/06/17
Deep Learning  Natural Language Generation  Dialogue System  Curriculum Learning  
Bridging the Gap between Different Vocabularies for LLM Ensemble 会议论文
, Mexico City, Mexico, June 16–21, 2024
作者:  徐杨一帆;  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(1982Kb)  |  收藏  |  浏览/下载:25/5  |  提交时间:2024/06/13
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
作者:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:8/3  |  提交时间:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Interpretable Autonomous Driving Model Based on Cognitive Reinforcement Learning 会议论文
, Jeju, Korea, Jun. 02-05, 2024
作者:  Yijia Li;  Hao Qi;  Fenghua Zhu;  Yisheng Lv;  Peijun Ye
Adobe PDF(87Kb)  |  收藏  |  浏览/下载:14/6  |  提交时间:2024/06/06
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:31/10  |  提交时间:2024/06/05
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:17/4  |  提交时间:2024/06/05
Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文
, 厦门国际会议中心, 2023-10-13
作者:  Chen ZP(陈忠鹏);  Guan Q(关强)
Adobe PDF(2260Kb)  |  收藏  |  浏览/下载:18/5  |  提交时间:2024/06/04
Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation  
A Reinforcement Learning Benchmark for Autonomous Driving in Intersection Scenarios 会议论文
, Orlando, FL, USA, 2022-1-24
作者:  Liu, Yuqi;  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(1537Kb)  |  收藏  |  浏览/下载:11/8  |  提交时间:2024/06/03
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:22/9  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Parallel Spiking Unit for Efficient Training of Spiking Neural Networks 会议论文
, YOKOHAMA, 30 June - 5 July 2024
作者:  Yang Li;  Yinqian Sun;  Xiang He;  Yiting Dong;  Dongcheng Zhao;  Yi Zeng
Adobe PDF(959Kb)  |  收藏  |  浏览/下载:22/5  |  提交时间:2024/05/31