CASIA OpenIR

浏览/检索结果: 共47条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Self-Modifying State Modeling for Simultaneous Machine Translation 会议论文
, Bangkok, Thailand, August 11–16, 2024
作者:  Donglei, Yu;  Xiaomian, Kang;  Yuchen, Liu;  YU, Zhou;  Chengqing, Zong
Adobe PDF(924Kb)  |  收藏  |  浏览/下载:4/2  |  提交时间:2024/06/20
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
作者:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:8/3  |  提交时间:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Interpretable Autonomous Driving Model Based on Cognitive Reinforcement Learning 会议论文
, Jeju, Korea, Jun. 02-05, 2024
作者:  Yijia Li;  Hao Qi;  Fenghua Zhu;  Yisheng Lv;  Peijun Ye
Adobe PDF(87Kb)  |  收藏  |  浏览/下载:15/7  |  提交时间:2024/06/06
Traffic Signal Control Based on Reinforcement Learning and Fuzzy Neural Network 会议论文
, Macau, China, October 8-12, 2022
作者:  Zhao, Hongxia;  Chen, Songhang;  Zhu, Fenghua;  Tang, Haina
Adobe PDF(565Kb)  |  收藏  |  浏览/下载:14/8  |  提交时间:2024/06/03
Centralized Cooperative Exploration Policy for Continuous Control Tasks 会议论文
Proceedings of the 2023 International Conference on Autonomous Agents and Multiagent Systems, London, United Kingdom, May 29–June 2, 2023
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou;  Yu Liu
Adobe PDF(2175Kb)  |  收藏  |  浏览/下载:20/6  |  提交时间:2024/05/30
continuous control tasks  cooperative exploration  
Learning Playing Piano with Bionic-Constrained Diffusion Policy for Anthropomorphic Hand 期刊论文
Cyborg and Bionic Systems, 2024, 卷号: 5, 页码: 0104
作者:  Yang YM(杨依明);  Wang ZC(王泽昌);  Xing DP(邢登鹏);  Wang P(王鹏)
Adobe PDF(3500Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/05/30
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:17/7  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:12/7  |  提交时间:2024/05/28
Learning Transformer-based Cooperation for Networked Traffic Signal Control 会议论文
, Macau, China, 2022-10
作者:  Zhao, Chen;  Dai, Xingyuan;  Wang, Xiao;  Li, Lingxi;  Lv, Yisheng;  Wang, Fei-Yue
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:17/7  |  提交时间:2024/05/28
Stable Training of Bellman Error in Reinforcement Learning 会议论文
, Thailand, November 18–22
作者:  Gong C(龚晨);  Bai YP(白云鹏);  Hou XW(侯新文);  Ji XH(季晓慧)
Adobe PDF(2416Kb)  |  收藏  |  浏览/下载:118/33  |  提交时间:2023/06/27