CASIA OpenIR

浏览/检索结果: 共51条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks 会议论文
, Queensland, Australia, 2023-6
作者:  Hu GZ(胡光政);  Li HR(李浩然);  Liu SS(刘莎莎);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(2785Kb)  |  收藏  |  浏览/下载:33/9  |  提交时间:2024/07/04
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:21/9  |  提交时间:2024/06/25
基于强化学习与安全约束的自动驾驶决策方法 期刊论文
交通运输研究, 2023, 卷号: 9, 期号: 1, 页码: 31-39
作者:  王宇霄;  刘敬玉;  李忠飞;  朱凤华
Adobe PDF(2613Kb)  |  收藏  |  浏览/下载:52/25  |  提交时间:2024/06/11
深度强化学习  自动驾驶  决策  安全约束  
BrainCog: A spiking neural network based, braininspired cognitive intelligence engine for braininspired AI and brain simulation 期刊论文
Patterns, 2023, 页码: 100789
作者:  Zeng, Yi;  Zhao, Dongcheng;  Zhao, Feifei;  Shen, Guobin;  Dong, Yiting;  Lu, Enmeng;  Zhang, Qian;  Sun, Yinqian;  Liang, Qian;  Zhao, Yuxuan;  Zhao, Zhuoya;  Fang, Hongjian;  Wang, Yuwei;  Li, Yang;  Liu, Xin;  Du, Chengcheng;  Kong, Qingqun;  Zizhe, Ruan;  Weida Bi
Adobe PDF(6608Kb)  |  收藏  |  浏览/下载:39/8  |  提交时间:2024/06/06
Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文
, 厦门国际会议中心, 2023-10-13
作者:  Chen ZP(陈忠鹏);  Guan Q(关强)
Adobe PDF(2260Kb)  |  收藏  |  浏览/下载:39/12  |  提交时间:2024/06/04
Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation  
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:62/21  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Keep Various Trajectories: Promoting Exploration of Ensemble Policies in Continuous Control 会议论文
Advances in Neural Information Processing Systems, New Orleans, USA, 2023-12-10
作者:  Chao Li;  Chen Gong;  Qiang He;  Xinwen Hou
Adobe PDF(1457Kb)  |  收藏  |  浏览/下载:42/12  |  提交时间:2024/05/30
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:54/19  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
A brain-inspired theory of mind spiking neural network improves multi-agent cooperation and competition 期刊论文
Patterns, 2023, 页码: 8
作者:  Zhao,Zhuoya;  Zhao,Feifei;  Zhao,Yuxuan;  Sun,Yinqian;  Zeng,Yi
Adobe PDF(4502Kb)  |  收藏  |  浏览/下载:61/16  |  提交时间:2024/05/28
不确定工业过程运行指标异步更新强化学习决策算法 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 2, 页码: 461-472
作者:  李金娜;  袁林;  丁进良
Adobe PDF(1941Kb)  |  收藏  |  浏览/下载:63/26  |  提交时间:2024/05/09
运行优化控制  强化学习  数据驱动控制  自适应动态规划  安全运行