CASIA OpenIR

浏览/检索结果: 共212条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Adaptive bias-variance trade-off in advantage estimator for actor-critic algorithms 期刊论文
NEURAL NETWORKS, 2024, 卷号: 169, 页码: 764-777
作者:  Chen, Yurou;  Zhang, Fengyi;  Liu, Zhiyong
收藏  |  浏览/下载:16/0  |  提交时间:2024/02/22
Reinforcement Learning  Policy gradient  Actor-critic  Value function  Bias-variance trade-off  
Synergetic learning for unknown nonlinear H. control using neural networks 期刊论文
NEURAL NETWORKS, 2023, 卷号: 168, 页码: 287-299
作者:  Zhu, Liao;  Guo, Ping;  Wei, Qinglai
收藏  |  浏览/下载:72/0  |  提交时间:2023/12/21
H. control  Nonlinear systems  Adaptive dynamic programming  Temporal difference  Neural network  Data-driven  
Brain-inspired neural circuit evolution for spiking neural networks 期刊论文
PROCEEDINGS OF THE NATIONAL ACADEMY OF SCIENCES OF THE UNITED STATES OF AMERICA, 2023, 卷号: 120, 期号: 39, 页码: 10
作者:  Shen, Guobin;  Zhao, Dongcheng;  Dong, Yiting;  Zeng, Yi
收藏  |  浏览/下载:15/0  |  提交时间:2024/02/21
brain-inspired  neural circuit evolution  spiking neural networks  
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:161/37  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Sample-Observed Soft Actor-Critic Learning for Path Following of a Biomimetic Underwater Vehicle 期刊论文
IEEE Transactions on Automation Science and Engineering, 2023, 页码: 1-10
作者:  Ma, Ruichen;  Wang, Yu;  Wang, Shuo;  Cheng, Long;  Wang, Rui;  Tan, Ming
Adobe PDF(2902Kb)  |  收藏  |  浏览/下载:163/52  |  提交时间:2023/08/03
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:129/38  |  提交时间:2023/06/28
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文
, Washington DC, USA, 2023-2-7
作者:  Pei Xu;  Junge Zhang;  Qiyue Yin;  Chao Yu;  Yaodong Yang;  Kaiqi Huang
Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:193/60  |  提交时间:2023/06/19
deep reinforcement learning  sparse reward  exploration  multi-agent  
Efficient Accelerator/Network Co-Search with Circular Greedy Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS II: EXPRESS BRIEFS, 2023, 页码: 1-5
作者:  Liu, Zejian;  Li, Gang;  Cheng, Jian
Adobe PDF(1982Kb)  |  收藏  |  浏览/下载:104/32  |  提交时间:2023/06/19
Accelerator/Network Co-Search  Reinforcement Learning  Performance Estimation  Multi-objective Optimization  
A Torque Control Strategy for a Robotic Dolphin Platform Based on Angle of Attack Feedback 期刊论文
Biomimetics, 2023, 卷号: 8, 页码: 291
作者:  Tianzhu Wang;  Junzhi Yu;  Di Chen;  Yan Meng
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:132/50  |  提交时间:2023/09/21
robotic dolphin  torque control  angle of attack  motion improvement  
Can Digital Intelligence and Cyber-Physical-Social Systems Achieve Global Food Security and Sustainability? 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 11, 页码: 2070-2080
作者:  Yanfen Wang;  Mengzhen Kang;  Yali Liu;  Juanjuan Li;  Kai Xue;  Xiujuan Wang;  Jianqing Du;  Yonglin Tian;  Qinghua Ni;  Fei-Yue Wang
Adobe PDF(9632Kb)  |  收藏  |  浏览/下载:189/108  |  提交时间:2023/09/22
Carbon-water balance  decision-support  digital intelligence (DI)  foundation models  planning