CASIA OpenIR

Browse/Search Results:  1-10 of 265 Help

Selected(0)Clear Items/Page:    Sort:
基于视觉表征的深度强化学习方法 学位论文
, 2024
Authors:  刘民颂
Adobe PDF(10778Kb)  |  Favorite  |  View/Download:9/1  |  Submit date:2024/06/22
深度强化学习,视觉表征学习,自监督学习,状态抽象,Transformer神经网络  
Physics-Inspired Spatial-Temporal Graph Neural Networks for Predicting Industrial Chain Resilience 期刊论文
Journal of Machine Learning Research, 2024, 卷号: 4, 期号: 23, 页码: 1-23
Authors:  Wang BC(王必成);  Wang JP(王军平)
Adobe PDF(2342Kb)  |  Favorite  |  View/Download:22/10  |  Submit date:2024/06/12
Learning in bi-level markov games 会议论文
, Padua, Italy, 2022.7.18-2022.7.23
Authors:  Meng Linghui;  Ruan Jingqing;  Xing Dengpeng;  Xu Bo
Adobe PDF(1450Kb)  |  Favorite  |  View/Download:13/4  |  Submit date:2024/06/11
M3: Modularization for Multi-task and Multi-agent Offline Pre-training 会议论文
, London, United Kingdom, 2023.5.29-2023.6.2
Authors:  Meng Linghui;  Ruan Jingqing;  Xiong Xuantang;  Li Xiyun;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(1302Kb)  |  Favorite  |  View/Download:8/2  |  Submit date:2024/06/11
A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文
, Seoul, Korea, 2024.4.14-2024.4.19
Authors:  Meng Linghui;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(964Kb)  |  Favorite  |  View/Download:10/4  |  Submit date:2024/06/11
基于预训练模型的决策序列化建模研究 学位论文
, 2024
Authors:  林润基
Adobe PDF(7811Kb)  |  Favorite  |  View/Download:41/0  |  Submit date:2024/06/07
预训练模型  决策序列化  序列模型  
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
Authors:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  Favorite  |  View/Download:22/2  |  Submit date:2024/06/07
Privacy-Preserving Average Consensus Algorithm Under Round-Robin Scheduling Protocol 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1705-1707
Authors:  Yingjiang Guo;  Wenying Xu;  Haodong Wang;  Jianquan Lu;  Shengli Du
Adobe PDF(728Kb)  |  Favorite  |  View/Download:14/8  |  Submit date:2024/06/07
Multi-Robot Collaborative Hunting in Cluttered Environments With Obstacle-Avoiding Voronoi Cells 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1643-1655
Authors:  Meng Zhou;  Zihao Wang;  Jing Wang;  Zhengcai Cao
Adobe PDF(3022Kb)  |  Favorite  |  View/Download:16/9  |  Submit date:2024/06/07
Dynamic obstacle avoidance  multi-robot collaborative hunting  obstacle-avoiding Voronoi cells  task allocation  
Distributed Optimal Variational GNE Seeking in Merely Monotone Games 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1621-1630
Authors:  Wangli He;  Yanzhen Wang
Adobe PDF(2076Kb)  |  Favorite  |  View/Download:11/7  |  Submit date:2024/06/07
Distributed algorithms  equilibria selection  generalized Nash equilibrium (GNE)  merely monotone games