CASIA OpenIR

浏览/检索结果: 共22条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/25
强化学习,分层强化学习  
UAV Path Planning with Terrain Constraints for Aerial Scanning. 期刊论文
IEEE Transactions on Intelligent Vehicles, 2024, 卷号: 9, 期号: 1, 页码: 1189-1203
作者:  Jinbiao Yuan;  Zhenbao Liu;  Xiaoyu Xiong;  Yunfeng Ai;  Long Chen;  Bin Tian
Adobe PDF(3939Kb)  |  收藏  |  浏览/下载:75/20  |  提交时间:2024/06/20
A Multi-AGV Routing Planning Method Based on Deep Reinforcement Learning and Recurrent Neural Network 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1720-1722
作者:  Yishuai Lin;  Gang Hu;  Liang Wang;  Qingshan Li;  Jiawei Zhu
Adobe PDF(2948Kb)  |  收藏  |  浏览/下载:55/22  |  提交时间:2024/06/07
Secure Tracking Control via Fixed-Time Convergent Reinforcement Learning for a UAV CPS 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1699-1701
作者:  Zhenyu Gong;  Feisheng Yang
Adobe PDF(604Kb)  |  收藏  |  浏览/下载:49/20  |  提交时间:2024/06/07
Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1591-1604
作者:  Kun Jiang;  Wenzhang Liu;  Yuanda Wang;  Lu Dong;  Changyin Sun
Adobe PDF(2128Kb)  |  收藏  |  浏览/下载:42/15  |  提交时间:2024/06/07
Latent variable model  maximum entropy  multi-agent reinforcement learning (MARL)  multi-agent system  
Enhance Sample Efficiency and Robustness of End-to-end Urban Autonomous Driving via Semantic Masked World Model 期刊论文
IEEE Transactions on Intelligent Transportation Systems, 2024, 页码: 10.1109/TITS.2024.3400227
作者:  Zeyu Gao;  Yao Mu;  Chen Chen;  Jingliang Duan;  Ping Luo;  Yanfeng Lu;  Shengbo Eben Li
Adobe PDF(3954Kb)  |  收藏  |  浏览/下载:40/15  |  提交时间:2024/06/06
End-to-end autonomous driving  deep reinforcement learning  world model  
Fuzzy Feedback Multi-Agent Reinforcement Learning for Adversarial Dynamic Multi-Team Competitions 期刊论文
IEEE Transactions on Fuzzy Systems, 2024, 页码: 1
作者:  Qingxu Fu;  Zhiqiang Pu;  Yi Pan;  Tenghai Qiu;  Jianqiang Yi
Adobe PDF(4975Kb)  |  收藏  |  浏览/下载:43/14  |  提交时间:2024/06/05
Boosting On-Policy Actor–Critic With Shallow Updates in Critic 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 1-10
作者:  Luntong Li;  Yuanheng Zhu
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:45/15  |  提交时间:2024/06/05
MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12
作者:  Boyu Li;  Haran Li;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:32/10  |  提交时间:2024/06/05
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:43/8  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games