CASIA OpenIR

浏览/检索结果: 共379条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Boosting On-Policy Actor-Critic With Shallow Updates in Critic 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:  Li, Luntong;  Zhu, Yuanheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Artificial neural networks  Vectors  Task analysis  Training  Representation learning  Approximation algorithms  Optimization  Actor-critic  deep reinforcement learning (DRL)  proximal policy optimization (PPO)  shallow reinforcement learning (SRL)  
A Bio-Inspired Integration Model of Basal Ganglia and Cerebellum for Motion Learning of a Musculoskeletal Robot 期刊论文
JOURNAL OF SYSTEMS SCIENCE & COMPLEXITY, 2024, 卷号: 37, 期号: 1, 页码: 82-113
作者:  Zhang, Jinhan;  Chen, Jiahao;  Zhong, Shanlin;  Qiao, Hong
收藏  |  浏览/下载:3/0  |  提交时间:2024/07/03
Basal ganglia and cerebellum  bio-inspired integration model  motion learning  muscu-loskeletal robot  reinforcement learning  
Optimizing Reward Function Weights and Enhancing Control Mechanisms for Bipedal Robots Using LSTM and Attention Mechanisms 会议论文
, 河北保定, 2023-8-16
作者:  Cui LZ(崔凌志);  Tianqi Deng;  Lihua Ma;  Wenhao He
Adobe PDF(541Kb)  |  收藏  |  浏览/下载:16/4  |  提交时间:2024/07/01
双足机器人步态生成的研究 学位论文
, 2024
作者:  崔凌志
Adobe PDF(7077Kb)  |  收藏  |  浏览/下载:24/1  |  提交时间:2024/07/01
请双足机器人控制  混合零动力  轨迹自由强化学习  周期性步态奖励机制  动态步态优化  模型融合策略  
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:13/6  |  提交时间:2024/06/25
基于视觉-语言引导的机器人导航研究 学位论文
, 2024
作者:  何科技
Adobe PDF(29796Kb)  |  收藏  |  浏览/下载:57/5  |  提交时间:2024/06/25
视觉语言导航、数据稀缺、时序信息挖掘噪声、跨模态对齐、异常行为  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:21/8  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
A Brain-inspired Theory of Collective Mind Model for Efficient Social Cooperation 期刊论文
IEEE Transactions on Artificial Intelligence, 2024, 页码: 无
作者:  Zhao,Zhuoya;  Zhao,Feifei;  Wang,Shiwen;  Sun,Yinqian;  Zeng,Yi
Adobe PDF(2270Kb)  |  收藏  |  浏览/下载:14/11  |  提交时间:2024/06/25
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 3, 页码: 1463 - 1473
作者:  Liu MS(刘民颂);  Li LT(李伦通);  Hao S(郝帅);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4197Kb)  |  收藏  |  浏览/下载:26/5  |  提交时间:2024/06/24