CASIA OpenIR

浏览/检索结果: 共4条,第1-4条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:21/9  |  提交时间:2024/06/25
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 569-582
作者:  Haoyu Lu;  Yuqi Huo;  Mingyu Ding;  Nanyi Fei;  Zhiwu Lu
Adobe PDF(2928Kb)  |  收藏  |  浏览/下载:57/22  |  提交时间:2024/04/23
Image-text retrieval, multimodal modeling, contrastive learning, weak correlation, computer vision  
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:  Wai-Chung Kwan;  Hong-Ru Wang;  Hui-Min Wang;  Kam-Fai Wong
Adobe PDF(2211Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/04/23
Dialogue policy learning (DPL), task-oriented dialogue system (TOD), reinforcement learning (RL), dialogue system, Markov decision process  
Optimal Strategy for Aircraft Pursuit-Evasion Games via Self-Play Iteration 期刊论文
Machine Intelligence Research, 2023, 页码: 1-12
作者:  Wang Xin;  Wei Qinglai;  Li Tao;  Zhang Jie
Adobe PDF(1556Kb)  |  收藏  |  浏览/下载:213/82  |  提交时间:2023/06/26