CASIA OpenIR

浏览/检索结果: 共1795条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:13/2  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:24/12  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Boosting On-Policy Actor-Critic With Shallow Updates in Critic 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:  Li, Luntong;  Zhu, Yuanheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Artificial neural networks  Vectors  Task analysis  Training  Representation learning  Approximation algorithms  Optimization  Actor-critic  deep reinforcement learning (DRL)  proximal policy optimization (PPO)  shallow reinforcement learning (SRL)  
Online Adaptive Dynamic Programming for Optimal Self-Learning Control of VTOL Aircraft Systems With Disturbances 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2024, 卷号: 21, 期号: 1, 页码: 343-352
作者:  Wei, Qinglai;  Yang, Zesheng;  Su, Huaizhong;  Wang, Lijian
收藏  |  浏览/下载:4/0  |  提交时间:2024/07/03
Adaptive dynamic programming (ADP)  VTOL aircraft system  policy iteration  neural network (NN)  optimal control  iterative errors  
Synergetic Learning Neuro-Control for Unknown Affine Nonlinear Systems With Asymptotic Stability Guarantees 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 11
作者:  Zhu, Liao;  Wei, Qinglai;  Guo, Ping
收藏  |  浏览/下载:4/0  |  提交时间:2024/07/03
Approximate dynamic programming (ADP)  neural network  off-policy  optimal control  reinforcement learning (RL)  
Graph Information Bottleneck-Based Dual Subgraph Prediction for Molecular Interactions 期刊论文
IEEE ACCESS, 2024, 卷号: 12, 页码: 30113-30122
作者:  Li, Lanqi;  Dong, Weiming
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Graph information bottleneck  graph neural networks  molecular interactions  
不确定性环境下维纳模型的随机变分贝叶斯学习 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 6, 页码: 1185-1198
作者:  刘切;  李俊豪;  王浩;  曾建学;  柴毅
Adobe PDF(2009Kb)  |  收藏  |  浏览/下载:19/11  |  提交时间:2024/07/02
非线性系统辨识  随机优化  变分贝叶斯  维纳模型  
Autonomy Evaluation of Unmanned Systems Based on Task Models 期刊论文
Machine Intelligence Research, 2024, 页码: 1-16
作者:  Yi Zou;  Zehao Ni;  Xun Lei;  Chi Zhang
Adobe PDF(1801Kb)  |  收藏  |  浏览/下载:31/10  |  提交时间:2024/06/27
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:16/9  |  提交时间:2024/06/25
GFFNet: Global Feature Fusion Network for Semantic Segmentation of Large-Scale Remote Sensing Images 期刊论文
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 卷号: 17, 期号: 2024, 页码: 4222 - 4234
作者:  Cao, Yong;  Huo, Chunlei;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(4340Kb)  |  收藏  |  浏览/下载:20/4  |  提交时间:2024/06/25
Cross feature fusion (CFF)  global context learning  group transformer  semantic segmentation