CASIA OpenIR

浏览/检索结果: 共66条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game 期刊论文
IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 页码: 1-13
作者:  Guangzheng Hu;  Yuanheng Zhu;  Haoran Li;  Dongbin Zhao
Adobe PDF(2144Kb)  |  收藏  |  浏览/下载:52/11  |  提交时间:2024/06/05
Games  Q-learning  Task analysis  Optimization  Convergence  Training  Nash equilibrium  Multi-agent reinforcement learning  minimax-Q learning  two-team zero-sum Markov games  
Policy Iteration Algorithm for Constrained Cost Optimal Control of Discrete-Time Nonlinear System 会议论文
, Shenzhen, China, 2021.7.18-22
作者:  Li, Tao;  Wei, Qinglai;  Li, Hongyang;  Song, Ruizhuo
Adobe PDF(920Kb)  |  收藏  |  浏览/下载:66/28  |  提交时间:2024/05/28
Constrained-cost adaptive dynamic programming for optimal control of discrete-time nonlinear systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 卷号: 35, 期号: 3, 页码: 3251 - 3264
作者:  Wei, Qinglai;  Li, Tao
Adobe PDF(8471Kb)  |  收藏  |  浏览/下载:69/25  |  提交时间:2024/05/28
Adaptive dynamic programming  approximate dynamic programming  constrained cost  optimal control  reinforcement learning  
Adaptive Locomotion Transition Recognition With Wearable Sensors for Lower Limb Robotic Prosthesis 期刊论文
IEEE-ASME TRANSACTIONS ON MECHATRONICS, 2023, 页码: 11
作者:  Zheng, Enhao;  Wan, Jiacheng;  Gao, Siyuan;  Wang, Qining
Adobe PDF(2630Kb)  |  收藏  |  浏览/下载:100/10  |  提交时间:2023/11/17
Adaptive recognition model  interday and interuser  locomotion mode recognition  lower limb robotic prostheses  template generation  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:66/5  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Towards Better Word Importance Ranking in Textual Adversarial Attacks 会议论文
, Gold Coast, Australia, June 18-23, 2023
作者:  Shi, Jiahui;  Li, Linjing;  Zeng, Daniel Dajun
Adobe PDF(932Kb)  |  收藏  |  浏览/下载:278/114  |  提交时间:2023/09/27
Singing-Tacotron: Global Duration Control Attention and Dynamic Filter for End-to-end Singing Voice Synthesis 会议论文
, Online, 2022
作者:  Wang T(汪涛)
Adobe PDF(2873Kb)  |  收藏  |  浏览/下载:64/25  |  提交时间:2023/08/07
A robotic grasping approach with elliptical cone-based potential fields under disturbed scenes 期刊论文
International Journal of Advanced Robotic Systems, 2021, 卷号: 1, 期号: 18, 页码: 1-11
作者:  Wenjie Geng;  Zhiqiang Cao;  Zhonghui Li;  Yingying Yu;  Fengshui Jing;  Junzhi Yu
Adobe PDF(15762Kb)  |  收藏  |  浏览/下载:134/38  |  提交时间:2023/06/29
Robotic grasping, elliptical cone, potential field, disturbed scene  
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:117/48  |  提交时间:2023/04/26
A Collaborative Communication-Qmix Approach for Large-scale Networked Traffic Signal Control 会议论文
, Indianapolis, IN, United States, 2021-9-19
作者:  Chen, Xiaoyu;  Xiong, Gang;  Lv, Yisheng;  Chen, yuanyuan;  Song, bing;  Wang, Feiyue
Adobe PDF(1208Kb)  |  收藏  |  浏览/下载:289/77  |  提交时间:2022/06/16