CASIA OpenIR

浏览/检索结果: 共142条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
A position-control-based framework for dynamic and robust quadrupedal trotting, Measurement and Control 会议论文
, Virtual Conference, December 11-13,2021
作者:  Wang,Boxing;  Jia,Lihao;  Liu,Song;  Zhang,Haoyu;  Yin,Zeya
Adobe PDF(2757Kb)  |  收藏  |  浏览/下载:16/6  |  提交时间:2024/06/20
Nonlinear Filtering With Sample-Based Approximation Under Constrained Communication: Progress, Insights and Trends 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1539-1556
作者:  Weihao Song;  Zidong Wang;  Zhongkui Li;  Jianan Wang;  Qing-Long Han
Adobe PDF(1858Kb)  |  收藏  |  浏览/下载:23/8  |  提交时间:2024/06/07
Communication constraints  maximum correntropy filter  networked nonlinear filtering  particle filter  sample-based approximation  unscented Kalman filter  
Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文
, Singapore, 2023/8/24-27
作者:  Yang,Ning;  Wen,Junrui;  Zhang,Meng;  Tang,Ming
Adobe PDF(499Kb)  |  收藏  |  浏览/下载:33/12  |  提交时间:2024/06/05
mobile edge computing  multi-objective reinforcement learning  resource scheduling  
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:25/4  |  提交时间:2024/06/05
Beyond the Edge: An Advanced Exploration of Reinforcement Learning for Mobile Edge Computing, its Applications, and Future Research Trajectories 期刊论文
IEEE Communications Surveys and Tutorials, 2024, 页码: 50
作者:  Yang,Ning;  Chen,Shuo;  Zhang,Haijun;  Berry,Randall
Adobe PDF(1694Kb)  |  收藏  |  浏览/下载:35/3  |  提交时间:2024/06/01
Reinforcement learning, mobile edge computing, offloading scheduling, content caching, and communication  
基于序列展开模型的多智能体方法研究 学位论文
, 2024
作者:  Luo ZX(罗正昕)
Adobe PDF(13451Kb)  |  收藏  |  浏览/下载:44/1  |  提交时间:2024/05/30
多智能体  强化学习  序列展开模型  信度分配  非平稳性  
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:29/10  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
基于强化学习的多智能体协同决策关键问题研究 学位论文
, 2024
作者:  徐志伟
Adobe PDF(12464Kb)  |  收藏  |  浏览/下载:67/6  |  提交时间:2024/05/28
强化学习  多智能体系统  协同与合作  分层决策  对比学习  
多智能体强化学习预训练方法研究 学位论文
, 2024
作者:  孟令辉
Adobe PDF(6367Kb)  |  收藏  |  浏览/下载:61/6  |  提交时间:2024/05/28
多智能体强化学习  预训练方法  神经网络  表示学习  在线强化评估  
MMD-MIX: Value Function Factorisation with Maximum Mean Discrepancy for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Shenzhen, China, 18-22 July 2021
作者:  Zhiwei Xu;  Dapeng Li;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(3892Kb)  |  收藏  |  浏览/下载:12/6  |  提交时间:2024/05/28