CASIA OpenIR

浏览/检索结果: 共50条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
基于表征学习的离线强化学习方法研究综述 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 6, 页码: 1104-1128
作者:  王雪松;  王荣荣;  程玉虎
Adobe PDF(3333Kb)  |  收藏  |  浏览/下载:7/4  |  提交时间:2024/07/02
强化学习  离线强化学习  表征学习  历史经验数据  分布偏移  
A LiDAR Point Clouds Dataset of Ships in a Maritime Environment 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1681-1694
作者:  Qiuyu Zhang;  Lipeng Wang;  Hao Meng;  Wen Zhang;  Genghua Huang
Adobe PDF(10728Kb)  |  收藏  |  浏览/下载:38/14  |  提交时间:2024/06/07
3D point clouds dataset  dynamic tail wave  fog simulation  rainy simulation  simulated data  
Multi-Robot Collaborative Hunting in Cluttered Environments With Obstacle-Avoiding Voronoi Cells 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1643-1655
作者:  Meng Zhou;  Zihao Wang;  Jing Wang;  Zhengcai Cao
Adobe PDF(3022Kb)  |  收藏  |  浏览/下载:33/15  |  提交时间:2024/06/07
Dynamic obstacle avoidance  multi-robot collaborative hunting  obstacle-avoiding Voronoi cells  task allocation  
Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1591-1604
作者:  Kun Jiang;  Wenzhang Liu;  Yuanda Wang;  Lu Dong;  Changyin Sun
Adobe PDF(2128Kb)  |  收藏  |  浏览/下载:32/11  |  提交时间:2024/06/07
Latent variable model  maximum entropy  multi-agent reinforcement learning (MARL)  multi-agent system  
Nonlinear Filtering With Sample-Based Approximation Under Constrained Communication: Progress, Insights and Trends 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1539-1556
作者:  Weihao Song;  Zidong Wang;  Zhongkui Li;  Jianan Wang;  Qing-Long Han
Adobe PDF(1858Kb)  |  收藏  |  浏览/下载:30/9  |  提交时间:2024/06/07
Communication constraints  maximum correntropy filter  networked nonlinear filtering  particle filter  sample-based approximation  unscented Kalman filter  
Overhead-free Noise-tolerant Federated Learning: A New Baseline 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 526-537
作者:  Shiyi Lin;  Deming Zhai;  Feilong Zhang;  Junjun Jiang;  Xianming Liu;  Xiangyang Ji
Adobe PDF(1816Kb)  |  收藏  |  浏览/下载:38/10  |  提交时间:2024/05/23
Federated learning, noise-label learning, privacy-preserving machine learning, edge intelligence, distributed machine learning  
Collective Movement Simulation: Methods and Applications 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 452-480
作者:  Hua Wang;  Xing-Yu Guo;  Hao Tao;  Ming-Liang Xu
Adobe PDF(1439Kb)  |  收藏  |  浏览/下载:39/10  |  提交时间:2024/05/23
Collective movement simulation, multiple objects, multiple discipline, simulation effect, collective intelligence  
Distributed Deep Reinforcement Learning: A Survey and a Multi-player Multi-agent Learning Toolbox 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 411-430
作者:  Qiyue Yin;  Tongtong Yu;  Shengqi Shen;  Jun Yang;  Meijing Zhao;  Wancheng Ni;  Kaiqi Huang;  Bin Liang;  Liang Wang
Adobe PDF(2923Kb)  |  收藏  |  浏览/下载:39/15  |  提交时间:2024/05/23
Deep reinforcement learning, distributed machine learning, self-play, population-play, toolbox  
A Local-Global Attention Fusion Framework with Tensor Decomposition for Medical Diagnosis 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 6, 页码: 1536-1538
作者:  Peishu Wu;  Han Li;  Liwei Hu;  Jirong Ge;  Nianyin Zeng
Adobe PDF(630Kb)  |  收藏  |  浏览/下载:38/14  |  提交时间:2024/05/22
Accelerated Primal-Dual Projection Neurodynamic Approach with time Scaling for Linear and set Constrained Convex Optimization Problems 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 6, 页码: 1485-1498
作者:  You Zhao;  Xing He;  Mingliang Zhou;  Tingwen Huang
Adobe PDF(2287Kb)  |  收藏  |  浏览/下载:43/20  |  提交时间:2024/05/22
Accelerated projection neurodynamic approach  linear and set constraints  projection operators  smooth and nonsmooth convex optimization  time scaling