CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Line-Based 3D Building Abstraction and Polygonal Surface Reconstruction From Images 期刊论文
IEEE Transactions on Visualization and Computer Graphics, 2022, 卷号: xx, 期号: xx, 页码: 1-15
作者:  Guo, Jianwei;  Liu, Yanchao;  Song, Xin;  Liu, Haoyu;  Zhang, Xiaopeng;  Cheng, Zhanglin
Adobe PDF(6913Kb)  |  收藏  |  浏览/下载:43/15  |  提交时间:2024/06/03
3D reconstruction  3D Line cloud  Scene abstraction  Polygonal mesh model  
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:198/69  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:114/45  |  提交时间:2023/04/26
SURRL: Structural Unsupervised Representations for Robot Learning 期刊
创刊日期: 2022,
主办者:  Zhang FY(张丰一), Yurou Chen, Hong Qiao, Zhiyong Liu
Adobe PDF(7817Kb)  |  收藏  |  浏览/下载:301/104  |  提交时间:2023/01/12
Reinforcement learning  structural representations learning  multi-task learning  robotics  
Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game 期刊论文
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 页码: 10
作者:  Li, Hongyang;  Wei, Qinglai
Adobe PDF(716Kb)  |  收藏  |  浏览/下载:231/62  |  提交时间:2022/06/14
基于自适应动态规划的分布式迭代控制方法研究 学位论文
工学博士, 人工智能学院: 中国科学院大学, 2022
作者:  李洪阳
Adobe PDF(3786Kb)  |  收藏  |  浏览/下载:321/26  |  提交时间:2022/06/14
自适应动态规划,最优控制,分布式控制,智能控制,强化学习  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:255/12  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Supervised assisted deep reinforcement learning for emergency voltage control of power systems 期刊论文
NEUROCOMPUTING, 2022, 卷号: 475, 页码: 69-79
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Dai, Yuxin;  Yu, Zhihong;  Zhang, Jun Jason;  Bu, Guangquan;  Wang, Fei-Yue
Adobe PDF(2551Kb)  |  收藏  |  浏览/下载:361/76  |  提交时间:2022/06/06
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Emergency control  
Event-triggered optimal control for discrete-time multi-player non-zero-sum games using parallel control 期刊论文
INFORMATION SCIENCES, 2022, 卷号: 584, 页码: 519-535
作者:  Lu, Jingwei;  Wei, Qinglai;  Wang, Ziyang;  Zhou, Tianmin;  Wang, Fei-Yue
收藏  |  浏览/下载:249/0  |  提交时间:2021/12/28
Event-triggered  Non-zero-sum games  Parallel control  Neural network  Adaptive dynamic programming  
SADRL: Merging human experience with machine intelligence via supervised assisted deep reinforcement learning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 467, 页码: 300-309
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Jin, Junchen;  Huang, Yanhao;  Zhang, Jun Jason;  Wang, Fei-Yue
Adobe PDF(1244Kb)  |  收藏  |  浏览/下载:347/77  |  提交时间:2021/12/28
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Double DQN