CASIA OpenIR

浏览/检索结果: 共21条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Smart Decentralized Autonomous Organizations and Operations for Smart Societies: Human–Autonomous Organizations for Industry 5.0 and Society 5.0 期刊论文
IEEE Intelligent Systems, 2023, 页码: 70-74
作者:  Xiao Wang;  Yutong Wang;  Mariana Netto;  Larry Stapleton;  Zhe Wan;  Fei-Yue Wang
Adobe PDF(367Kb)  |  收藏  |  浏览/下载:14/4  |  提交时间:2024/06/06
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文
, 北京华腾美居酒店, 2023-12-9
作者:  Zhourui Guo;  Meng Yao;  Yang Yu;  Qiyue Yin
Adobe PDF(2302Kb)  |  收藏  |  浏览/下载:6/3  |  提交时间:2024/06/03
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:18/5  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Learning Individual Difference Rewards in Multi-Agent Reinforcement Learning 会议论文
, London, United Kingdom, 2023-5
作者:  Yang, Chen;  Yang, Guangkai;  Zhang, Junge
Adobe PDF(2419Kb)  |  收藏  |  浏览/下载:24/7  |  提交时间:2024/05/29
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:  Wai-Chung Kwan;  Hong-Ru Wang;  Hui-Min Wang;  Kam-Fai Wong
Adobe PDF(2211Kb)  |  收藏  |  浏览/下载:9/2  |  提交时间:2024/04/23
Dialogue policy learning (DPL), task-oriented dialogue system (TOD), reinforcement learning (RL), dialogue system, Markov decision process  
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:54/5  |  提交时间:2024/02/22
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 12, 页码: 2233-2247
作者:  Hongyu Ding;  Yuanze Tang;  Qing Wu;  Bo Wang;  Chunlin Chen;  Zhi Wang
Adobe PDF(5205Kb)  |  收藏  |  浏览/下载:108/36  |  提交时间:2023/10/31
Dynamic environments  goal-conditioned reinforcement learning  magnetic field  reward shaping  
Multi-Blockchain Based Data Trading Markets With Novel Pricing Mechanisms 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 12, 页码: 2222-2232
作者:  Juanjuan Li;  Junqing Li;  Xiao Wang;  Rui Qin;  Yong Yuan;  Fei-Yue Wang
Adobe PDF(2004Kb)  |  收藏  |  浏览/下载:189/79  |  提交时间:2023/10/31
Auction  data trading markets  multi-blockchain  pricing mechanisms  
CRule: Category-Aware Symbolic Multi-Hop Reasoning on Knowledge Graphs 期刊论文
IEEE Intelligent Systems, 2023, 页码: 1-9
作者:  Wang, Zikang;  Li, Linjing;  Li, Jinlin;  Zhao, Pengfei;  Zeng, Daniel
Adobe PDF(529Kb)  |  收藏  |  浏览/下载:96/28  |  提交时间:2023/07/28
Adaptive pseudo-Siamese policy network for temporal knowledge prediction 期刊论文
Neural Networks, 2023, 卷号: 160, 页码: 192-201
作者:  Shao PP(邵朋朋)
Adobe PDF(1256Kb)  |  收藏  |  浏览/下载:92/36  |  提交时间:2023/07/03