CASIA OpenIR

浏览/检索结果: 共183条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
An Empirical Study on Google Research Football Multi-agent Scenarios 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 549-570
作者:  Yan Song;  He Jiang;  Zheng Tian;  Haifeng Zhang;  Yingping Zhang;  Jiangcheng Zhu;  Zonghong Dai;  Weinan Zhang;  Jun Wang
Adobe PDF(24588Kb)  |  收藏  |  浏览/下载:3/2  |  提交时间:2024/05/23
Multi-agent reinforcement learning (RL), distributed RL system, population-based training, reward shaping, game theory  
Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368
作者:  Qingyang Zhang;  Kaishen Wang;  Jingqing Ruan;  Yiming Yang;  Dengpeng Xing;  Bo Xu
Adobe PDF(4997Kb)  |  收藏  |  浏览/下载:16/7  |  提交时间:2024/04/23
Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency  
Region-adaptive Concept Aggregation for Few-shot Visual Recognition 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 554-568
作者:  Mengya Han;  Yibing Zhan;  Baosheng Yu;  Yong Luo;  Han Hu;  Bo Du;  Yonggang Wen;  Dacheng Tao
Adobe PDF(4324Kb)  |  收藏  |  浏览/下载:13/4  |  提交时间:2024/04/23
Few-shot learning, metric-based meta learning, concept learning, region-adaptive, concept-aggregation  
Weakly Correlated Knowledge Integration for Few-shot Image Classification 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 1, 页码: 24-37
作者:  Chun Yang;  Chang Liu;  Xu-Cheng Yin
Adobe PDF(1133Kb)  |  收藏  |  浏览/下载:17/5  |  提交时间:2024/04/23
Computer vision  pattern recognition  knowledge refinement and reuse  neural networks  machine vision  
Improving metric-based few-shot learning with dynamically scaled softmax loss 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 15
作者:  Zhang, Yu;  Zuo, Xin;  Zheng, Xuxu;  Gao, Xiaoyong;  Wang, Bo;  Hu, Weiming
收藏  |  浏览/下载:39/0  |  提交时间:2024/02/22
Few-shot learning  Metric-based learning framework  Softmax loss improvement  
TENET: Beyond Pseudo-Labeling for Semi-supervised Few-shot Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 0
作者:  Ma CC(马成丞);  Dong WM(董未名);  Xu CS(徐常胜)
Adobe PDF(741Kb)  |  收藏  |  浏览/下载:110/24  |  提交时间:2024/01/29
Semi-supervised few-shot learning  few-shot learning  pseudo-labeling  linear regression  low-rank reconstruction  
UAV-Assisted Dynamic Avatar Task Migration for Vehicular Metaverse Services: A Multi-Agent Deep Reinforcement Learning Approach 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 2, 页码: 430-445
作者:  Jiawen Kang;  Junlong Chen;  Minrui Xu;  Zehui Xiong;  Yutao Jiao;  Luchao Han;  Dusit Niyato;  Yongju Tong;  Shengli Xie
Adobe PDF(6097Kb)  |  收藏  |  浏览/下载:61/13  |  提交时间:2024/01/23
Avatar  blockchain  metaverses  multi-agent deep reinforcement learning  transformer  UAVs  
Path Planning and Tracking Control for Parking via Soft Actor-Critic Under Non-Ideal Scenarios 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 181-195
作者:  Xiaolin Tang;  Yuyou Yang;  Teng Liu;  Xianke Lin;  Kai Yang;  Shen Li
Adobe PDF(4905Kb)  |  收藏  |  浏览/下载:201/124  |  提交时间:2024/01/02
Automatic parking  control strategy  parking deviation (APS)  soft actor-critic (SAC)  
Autonomous Vehicle Platoons In Urban Road Networks: A Joint Distributed Reinforcement Learning and Model Predictive Control Approach 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 1, 页码: 141-156
作者:  Luigi D’Alfonso;  Francesco Giannini;  Giuseppe Franzè;  Giuseppe Fedele;  Francesco Pupo;  Giancarlo Fortino
Adobe PDF(7491Kb)  |  收藏  |  浏览/下载:207/128  |  提交时间:2024/01/02
Distributed model predictive control  distributed reinforcement learning  routing decisions  urban road networks  
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 12, 页码: 2233-2247
作者:  Hongyu Ding;  Yuanze Tang;  Qing Wu;  Bo Wang;  Chunlin Chen;  Zhi Wang
Adobe PDF(5205Kb)  |  收藏  |  浏览/下载:97/32  |  提交时间:2023/10/31
Dynamic environments  goal-conditioned reinforcement learning  magnetic field  reward shaping