CASIA OpenIR

浏览/检索结果: 共141条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Offline Hierarchical Reinforcement Learning: Enable Large-Scale Training in HRL 会议论文
, Nanjing, 2023-11-27
作者:  Yuqiao Wu;  Haifeng Zhang;  Jun Wang
Adobe PDF(1339Kb)  |  收藏  |  浏览/下载:33/10  |  提交时间:2024/07/12
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:29/13  |  提交时间:2024/06/25
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 3, 页码: 1463 - 1473
作者:  Liu MS(刘民颂);  Li LT(李伦通);  Hao S(郝帅);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4197Kb)  |  收藏  |  浏览/下载:53/17  |  提交时间:2024/06/24
A Hip-Knee Joint Coordination Evaluation System in Hemiplegic Individuals Based on Cyclogram Analysis 会议论文
, Changsha, China, 20-23 November
作者:  Ningcun Xu;  Chen Wang;  Liang Peng;  Jingyao Chen;  Zhi Cheng;  Zeng-Guang Hou;  Pu Zhang;  Zejia He
Adobe PDF(727Kb)  |  收藏  |  浏览/下载:51/18  |  提交时间:2024/06/21
Pavement Defect Detection with Deep Learning: A Comprehensive Survey 期刊论文
IEEE Transactions on Intelligent Vehicles, 2023, 卷号: 9, 期号: 3, 页码: 4292 - 4311
作者:  Lili Fan;  Dandan Wang;  Junhao Wang;  Yunjie Li;  Yifeng Cao;  Yi Liu;  Xiaoming Chen;  Yutong Wang
Adobe PDF(6287Kb)  |  收藏  |  浏览/下载:60/17  |  提交时间:2024/06/06
Deep learning  pavement defect detection  computer vision  image processing  3D image  
BrainCog: A spiking neural network based, braininspired cognitive intelligence engine for braininspired AI and brain simulation 期刊论文
Patterns, 2023, 页码: 100789
作者:  Zeng, Yi;  Zhao, Dongcheng;  Zhao, Feifei;  Shen, Guobin;  Dong, Yiting;  Lu, Enmeng;  Zhang, Qian;  Sun, Yinqian;  Liang, Qian;  Zhao, Yuxuan;  Zhao, Zhuoya;  Fang, Hongjian;  Wang, Yuwei;  Li, Yang;  Liu, Xin;  Du, Chengcheng;  Kong, Qingqun;  Zizhe, Ruan;  Weida Bi
Adobe PDF(6608Kb)  |  收藏  |  浏览/下载:48/10  |  提交时间:2024/06/06
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:64/23  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Constrained-cost adaptive dynamic programming for optimal control of discrete-time nonlinear systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 卷号: 35, 期号: 3, 页码: 3251 - 3264
作者:  Wei, Qinglai;  Li, Tao
Adobe PDF(8471Kb)  |  收藏  |  浏览/下载:70/26  |  提交时间:2024/05/28
Adaptive dynamic programming  approximate dynamic programming  constrained cost  optimal control  reinforcement learning  
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:43/12  |  提交时间:2024/05/28
复杂工业过程非串级双速率组合分散运行优化控制 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 1, 页码: 172-184
作者:  赵建国;  杨春雨
Adobe PDF(1648Kb)  |  收藏  |  浏览/下载:91/29  |  提交时间:2024/05/09
复杂工业过程  运行优化控制  奇异摄动理论  Q-学习  双速率