CASIA OpenIR

浏览/检索结果: 共45条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:17/7  |  提交时间:2024/06/25
强化学习,分层强化学习  
Digital Twin Driven Measurement in Robotic Flexible Printed Circuit Assembly 期刊论文
IEEE Transactions on Instrumentation & Measurement, 2023, 卷号: 72, 页码: 5007812
作者:  Yang Minghao;  Huang Zhenping;  Sun Yangchang;  Zhao Yongjia;  Sun Ruize;  Sun Qi;  Chen JinLong;  Qiang BaoHua;  Wang JingHong;  Sun FuChun
Adobe PDF(39985Kb)  |  收藏  |  浏览/下载:20/3  |  提交时间:2024/06/24
Zero-shot Object Goal Visual Navigation 会议论文
, London, UK, May 29 - June 2, 2023
作者:  Qianfan Zhao;  Lu Zhang;  Bin He;  Hong Qiao;  Zhiyong Liu
Adobe PDF(2100Kb)  |  收藏  |  浏览/下载:28/12  |  提交时间:2024/06/06
Learning Heterogeneous Agent Cooperation via Multiagent League Training 期刊论文
IFAC World Congress, 2023, 页码: IFAC PapersOnLine 56-2 (2023) 3033-3040
作者:  Qingxu, Fu;  Xiaolin Ai;  Jianqiang Yi;  Tenghai Qiu;  Wanmai Yuan;  Zhiqiang Pu
Adobe PDF(996Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/06/05
Parallel Population and Parallel Human: A Cyber-Physical Social Approach 专著
Hoboken, NJ, USA:IEEE Press and John Wiley & Sons, Inc., 2023
作者:  Peijun Ye;  Fei-Yue Wang
Adobe PDF(544Kb)  |  收藏  |  浏览/下载:45/12  |  提交时间:2024/05/31
Advancing Air Combat Tactics with Improved Neural Fictitious Self-Play Reinforcement Learning 会议论文
Advanced Intelligent Computing Technology and Applications, 中国郑州, 2023-8
作者:  He SQ(何少钦);  Gao Y(高阳);  Zhang BF(张保丰);  Chang H(常惠);  Zhang XC(张鑫辰)
Adobe PDF(1496Kb)  |  收藏  |  浏览/下载:44/13  |  提交时间:2024/05/31
Air Combat, Reinforcement Learning, Neural Fictitious Self-Play.  
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:38/13  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Explicitly Learning Policy Under Partial Observability in Multiagent Reinforcement Learning 会议论文
, Queensland, Australia, 2023-6
作者:  Yang, Chen;  Yang, Guangkai;  Chen, Hao;  Zhang, Junge
Adobe PDF(3027Kb)  |  收藏  |  浏览/下载:43/17  |  提交时间:2024/05/29
A brain-inspired theory of mind spiking neural network improves multi-agent cooperation and competition 期刊论文
Patterns, 2023, 页码: 8
作者:  Zhao,Zhuoya;  Zhao,Feifei;  Zhao,Yuxuan;  Sun,Yinqian;  Zeng,Yi
Adobe PDF(4502Kb)  |  收藏  |  浏览/下载:40/5  |  提交时间:2024/05/28
Cooperative Task Scheduling and Planning Considering Resource Conflicts and Precedence Constraints 期刊论文
International Journal of Precision Engineering and Manufacturing, 2023, 页码: 1503-1516
作者:  Li, Donghui;  Su, Hu;  Xu, Xinyi;  Wang, Qingbin;  Qin, Jie;  Zou, Wei
Adobe PDF(2513Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/05/28