CASIA OpenIR

浏览/检索结果: 共184条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
基于表征学习的离线强化学习方法研究综述 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 6, 页码: 1104-1128
作者:  王雪松;  王荣荣;  程玉虎
Adobe PDF(3333Kb)  |  收藏  |  浏览/下载:12/8  |  提交时间:2024/07/02
强化学习  离线强化学习  表征学习  历史经验数据  分布偏移  
Low-Rank Optimal Transport for Robust Domain Adaptation 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1667-1680
作者:  Bingrong Xu;  Jianhua Yin;  Cheng Lian;  Yixin Su;  Zhigang Zeng
Adobe PDF(3368Kb)  |  收藏  |  浏览/下载:39/17  |  提交时间:2024/06/07
Domain adaptation  low-rank constraint  noise corruption  optimal transport  
Multi-Robot Collaborative Hunting in Cluttered Environments With Obstacle-Avoiding Voronoi Cells 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1643-1655
作者:  Meng Zhou;  Zihao Wang;  Jing Wang;  Zhengcai Cao
Adobe PDF(3022Kb)  |  收藏  |  浏览/下载:38/16  |  提交时间:2024/06/07
Dynamic obstacle avoidance  multi-robot collaborative hunting  obstacle-avoiding Voronoi cells  task allocation  
Discovering Latent Variables for the Tasks With Confounders in Multi-Agent Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1591-1604
作者:  Kun Jiang;  Wenzhang Liu;  Yuanda Wang;  Lu Dong;  Changyin Sun
Adobe PDF(2128Kb)  |  收藏  |  浏览/下载:33/12  |  提交时间:2024/06/07
Latent variable model  maximum entropy  multi-agent reinforcement learning (MARL)  multi-agent system  
Ultimately Bounded Output Feedback Control for Networked Nonlinear Systems With Unreliable Communication Channel: A Buffer-Aided Strategy 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1566-1578
作者:  Yuhan Zhang;  Zidong Wang;  Lei Zou;  Yun Chen;  Guoping Lu
Adobe PDF(2016Kb)  |  收藏  |  浏览/下载:31/10  |  提交时间:2024/06/07
Buffer-aided strategy  neural networks  nonlinear control  output-feedback control  unreliable communication channel  
An Empirical Study on Google Research Football Multi-agent Scenarios 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 549-570
作者:  Yan Song;  He Jiang;  Zheng Tian;  Haifeng Zhang;  Yingping Zhang;  Jiangcheng Zhu;  Zonghong Dai;  Weinan Zhang;  Jun Wang
Adobe PDF(24588Kb)  |  收藏  |  浏览/下载:51/13  |  提交时间:2024/05/23
Multi-agent reinforcement learning (RL), distributed RL system, population-based training, reward shaping, game theory  
Parsing Objects at a Finer Granularity: A Survey 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 431-451
作者:  Yifan Zhao;  Jia Li;  Yonghong Tian
Adobe PDF(1743Kb)  |  收藏  |  浏览/下载:33/13  |  提交时间:2024/05/23
Finer granularity, visual parsing, part segmentation, fine-grained object recognition, part relationship  
Industry-oriented Detection Method of PCBA Defects Using Semantic Segmentation Models 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 6, 页码: 1438-1446
作者:  Yang Li;  Xiao Wang;  Zhifan He;  Ze Wang;  Ke Cheng;  Sanchuan Ding;  Yijing Fan;  Xiaotao Li;  Yawen Niu;  Shanpeng Xiao;  Zhenqi Hao;  Bin Gao;  Huaqiang Wu
Adobe PDF(12898Kb)  |  收藏  |  浏览/下载:38/10  |  提交时间:2024/05/22
Automated optical inspection (AOI)  deep learning  defect detection  printed circuit board assembly (PCBA)  semantic segmentation  
A Non-Parametric Scheme for Identifying Data Characteristic Based on Curve Similarity Matching 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 6, 页码: 1424-1437
作者:  Quanbo Ge;  Yang Cheng;  Hong Li;  Ziyi Ye;  Yi Zhu;  Gang Yao
Adobe PDF(2544Kb)  |  收藏  |  浏览/下载:53/17  |  提交时间:2024/05/22
Curve similarity matching  Gaussian-like noise  non-parametric scheme  parzen window  
Mapping Network-coordinated Stacked Gated Recurrent Units for Turbulence Prediction 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 6, 页码: 1331-1341
作者:  Zhiming Zhang;  Shangce Gao;  MengChu Zhou;  Mengtao Yan;  Shuyang Cao
Adobe PDF(7172Kb)  |  收藏  |  浏览/下载:49/20  |  提交时间:2024/05/22
Convolutional neural network  deep learning  recurrent neural network  turbulence prediction  wind load prediction