CASIA OpenIR

浏览/检索结果: 共599条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Boosting On-Policy Actor–Critic With Shallow Updates in Critic 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 1-10
作者:  Luntong Li;  Yuanheng Zhu
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:3/1  |  提交时间:2024/06/05
MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12
作者:  Boyu Li;  Haran Li;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:3/1  |  提交时间:2024/06/05
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:2/0  |  提交时间:2024/06/05
Investigating Shift Equivalence of Convolutional Neural Networks in Industrial Defect Segmentation 期刊论文
IEEE TRANSACTIONS ON INSTRUMENTATION AND MEASUREMENT, 2023, 页码: 1-17
作者:  Qu Z(屈震);  Tao X(陶显);  Shen F(沈飞);  Zhang ZT(张正涛);  Li T(李涛)
Adobe PDF(2869Kb)  |  收藏  |  浏览/下载:0/0  |  提交时间:2024/06/04
A Bio-Inspired Integration Model of Basal Ganglia and Cerebellum for Motion Learning of a Musculoskeletal Robot 期刊论文
Journal of Systems Science and Complexity, 2024, 卷号: 37, 页码: 82-113
作者:  Jinhan Zhang;  Jiahao Chen;  Shanlin Zhong;  Hong Qiao
Adobe PDF(1513Kb)  |  收藏  |  浏览/下载:11/0  |  提交时间:2024/06/04
Two-particle Debris Flow Simulation Based on SPH 期刊论文
Computer Animation and Virtual Worlds, 2024, 卷号: 35, 期号: 3, 页码: 1-17
作者:  Zhang JX(张佳岫);  Yang M(杨猛);  Xiaomin Li;  Qunou Jiang;  Heng Zhang;  Meng WL(孟维亮)
Adobe PDF(4962Kb)  |  收藏  |  浏览/下载:0/0  |  提交时间:2024/06/04
Efficient Calibration of Agent-Based Traffic Simulation Using Variational Auto-Encoder 会议论文
无, Macau, China, Oct. 08-12, 2022
作者:  Peijun Ye;  Fenghua Zhu;  Yisheng Lv;  Xiao Wang;  Yuanyuan Chen
Adobe PDF(1928Kb)  |  收藏  |  浏览/下载:4/0  |  提交时间:2024/06/03
Agent-Based Model  Calibration  
基于深度强化学习的大规模群体智能决策方法研究 学位论文
, 2024
作者:  付清旭
Adobe PDF(39071Kb)  |  收藏  |  浏览/下载:24/1  |  提交时间:2024/05/29
大规模,群体系统,协同,决策,深度强化学习,多智能体系统  
基于强化学习的机器人操作策略表征与学习 学位论文
, 2024
作者:  杨依明
Adobe PDF(19731Kb)  |  收藏  |  浏览/下载:13/0  |  提交时间:2024/05/28
强化学习  机器人操作  机器人控制  策略表征  
Policy Iteration Algorithm for Constrained Cost Optimal Control of Discrete-Time Nonlinear System 会议论文
, Shenzhen, China, 2021.7.18-22
作者:  Li, Tao;  Wei, Qinglai;  Li, Hongyang;  Song, Ruizhuo
Adobe PDF(920Kb)  |  收藏  |  浏览/下载:18/7  |  提交时间:2024/05/28