CASIA OpenIR

浏览/检索结果: 共183条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368
作者:  Qingyang Zhang
Adobe PDF(4997Kb)  |  收藏  |  浏览/下载:26/8  |  提交时间:2024/04/03
Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency  
Improving metric-based few-shot learning with dynamically scaled softmax loss 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 15
作者:  Zhang, Yu;  Zuo, Xin;  Zheng, Xuxu;  Gao, Xiaoyong;  Wang, Bo;  Hu, Weiming
收藏  |  浏览/下载:26/0  |  提交时间:2024/02/22
Few-shot learning  Metric-based learning framework  Softmax loss improvement  
TENET: Beyond Pseudo-Labeling for Semi-supervised Few-shot Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 0
作者:  Ma CC(马成丞);  Dong WM(董未名);  Xu CS(徐常胜)
Adobe PDF(741Kb)  |  收藏  |  浏览/下载:82/21  |  提交时间:2024/01/29
Semi-supervised few-shot learning  few-shot learning  pseudo-labeling  linear regression  low-rank reconstruction  
UAV-Assisted Dynamic Avatar Task Migration for Vehicular Metaverse Services: A Multi-Agent Deep Reinforcement Learning Approach 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 430-445
作者:  Jiawen Kang;  Junlong Chen;  Minrui Xu;  Zehui Xiong;  Yutao Jiao;  Luchao Han;  Dusit Niyato;  Yongju Tong;  Shengli Xie
Adobe PDF(6097Kb)  |  收藏  |  浏览/下载:47/12  |  提交时间:2024/01/23
Avatar  blockchain  metaverses  multi-agent deep reinforcement learning  transformer  UAVs  
Path Planning and Tracking Control for Parking via Soft Actor-Critic Under Non-Ideal Scenarios 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 181-195
作者:  Xiaolin Tang;  Yuyou Yang;  Teng Liu;  Xianke Lin;  Kai Yang;  Shen Li
Adobe PDF(4905Kb)  |  收藏  |  浏览/下载:175/118  |  提交时间:2024/01/02
Automatic parking  control strategy  parking deviation (APS)  soft actor-critic (SAC)  
Autonomous Vehicle Platoons In Urban Road Networks: A Joint Distributed Reinforcement Learning and Model Predictive Control Approach 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 141-156
作者:  Luigi D’Alfonso;  Francesco Giannini;  Giuseppe Franzè;  Giuseppe Fedele;  Francesco Pupo;  Giancarlo Fortino
Adobe PDF(7491Kb)  |  收藏  |  浏览/下载:182/125  |  提交时间:2024/01/02
Distributed model predictive control  distributed reinforcement learning  routing decisions  urban road networks  
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 12, 页码: 2233-2247
作者:  Hongyu Ding;  Yuanze Tang;  Qing Wu;  Bo Wang;  Chunlin Chen;  Zhi Wang
Adobe PDF(5205Kb)  |  收藏  |  浏览/下载:83/31  |  提交时间:2023/10/31
Dynamic environments  goal-conditioned reinforcement learning  magnetic field  reward shaping  
Privacy Preserving Demand Side Management Method via Multi-Agent Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 10, 页码: 1984-1999
作者:  Feiye Zhang;  Qingyu Yang;  Dou An
Adobe PDF(3841Kb)  |  收藏  |  浏览/下载:73/38  |  提交时间:2023/09/07
Centralized training and decentralized execution  demand side management  multi-agent reinforcement learning  privacy preserving  
Sample-Observed Soft Actor-Critic Learning for Path Following of a Biomimetic Underwater Vehicle 期刊论文
IEEE Transactions on Automation Science and Engineering, 2023, 页码: 1-10
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Shuo;  Cheng, Long;  Wang, Rui;  Tan, Ming
Adobe PDF(2902Kb)  |  收藏  |  浏览/下载:150/49  |  提交时间:2023/08/03
Omnidirectional Drift Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文
, Suzhou, China, May 14-16, 2021
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Rui;  Wang, Shuo
Adobe PDF(855Kb)  |  收藏  |  浏览/下载:71/28  |  提交时间:2023/08/02
Omnidirectional Drift Control  Undulating Fin  Underwater Biomimetic Vehicle-manipulator System (UBVMS)  Reinforcement Learning  Twin Delayed Deep Deterministic policy gradient (TD3)