CASIA OpenIR

浏览/检索结果: 共29条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:8/1  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
不确定工业过程运行指标异步更新强化学习决策算法 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 2, 页码: 461-472
作者:  李金娜;  袁林;  丁进良
Adobe PDF(1941Kb)  |  收藏  |  浏览/下载:22/9  |  提交时间:2024/05/09
运行优化控制  强化学习  数据驱动控制  自适应动态规划  安全运行  
State of the Art on Deep Learning-enhanced Rendering Methods 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 6, 页码: 799-821
作者:  Qi Wang;  Zhihua Zhong;  Yuchi Huo;  Hujun Bao;  Rui Wang
Adobe PDF(6540Kb)  |  收藏  |  浏览/下载:32/10  |  提交时间:2024/04/23
Neural rendering, computer graphics, scene representation, rendering, post-processing  
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:  Wai-Chung Kwan;  Hong-Ru Wang;  Hui-Min Wang;  Kam-Fai Wong
Adobe PDF(2211Kb)  |  收藏  |  浏览/下载:7/1  |  提交时间:2024/04/23
Dialogue policy learning (DPL), task-oriented dialogue system (TOD), reinforcement learning (RL), dialogue system, Markov decision process  
Offline Pre-trained Multi-agent Decision Transformer 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248
作者:  Linghui Meng;  Muning Wen;  Chenyang Le;  Xiyun Li;  Dengpeng Xing;  Weinan Zhang;  Ying Wen;  Haifeng Zhang;  Jun Wang;  Yaodong Yang;  Bo Xu
Adobe PDF(2121Kb)  |  收藏  |  浏览/下载:24/7  |  提交时间:2024/04/23
Pre-training model  multi-agent reinforcement learning (MARL)  decision making  transformer  offline reinforcement learning  
基于自适应动态规划的移动机器人视觉伺服跟踪控制 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 11, 页码: 2286-2296
作者:  罗彪;  欧阳志华;  易昕宁;  刘德荣
Adobe PDF(2335Kb)  |  收藏  |  浏览/下载:35/15  |  提交时间:2024/04/18
自适应动态规划  移动机器人  视觉伺服  轨迹跟踪  神经网络控制  
异策略深度强化学习中的经验回放研究综述 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 11, 页码: 2237-2256
作者:  胡子剑;  高晓光;  万开方;  张乐天;  汪强龙;  NERETINEvgeny
Adobe PDF(4679Kb)  |  收藏  |  浏览/下载:34/8  |  提交时间:2024/04/18
深度强化学习  异策略  经验回放  人工智能  
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:45/1  |  提交时间:2024/02/22
Brain-inspired neural circuit evolution for spiking neural networks 期刊论文
Proceedings of the National Academy of Sciences (PNAS), 2023, 卷号: 120, 期号: 39, 页码: 10
作者:  Shen, Guobin;  Zhao, Dongcheng;  Dong, Yiting;  Zeng, Yi
Adobe PDF(8398Kb)  |  收藏  |  浏览/下载:37/2  |  提交时间:2024/02/21
brain-inspired  neural circuit evolution  spiking neural networks  
Semantic Policy Network for Zero-Shot Object Goal Visual Navigation 期刊论文
IEEE ROBOTICS AND AUTOMATION LETTERS, 2023, 卷号: 8, 期号: 11, 页码: 7655-7662
作者:  Zhao, Qianfan;  Zhang, Lu;  He, Bin;  Liu, Zhiyong
Adobe PDF(1888Kb)  |  收藏  |  浏览/下载:99/0  |  提交时间:2023/12/21
Deep learning  path planning  reinforcement learning  vision-based navigation