CASIA OpenIR

Browse/Search Results:  1-10 of 29 Help

  Show only claimed items
Selected(0)Clear Items/Page:    Sort:
Learning Top-K Subtask Planning Tree Based on Discriminative Representation Pretraining for Decision-making 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 782-800
Authors:  Jingqing Ruan;   Kaishen Wang;   Qingyang Zhang;   Dengpeng Xing;   Bo Xu
Adobe PDF(4577Kb)  |  Favorite  |  View/Download:6/3  |  Submit date:2024/07/18
Reinforcement learning  representation learning  subtask planning  task decomposition  pretraining.  
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
Authors:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  Favorite  |  View/Download:20/7  |  Submit date:2024/06/25
强化学习,分层强化学习  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
Authors:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  Favorite  |  View/Download:33/13  |  Submit date:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
Authors:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  Favorite  |  View/Download:19/9  |  Submit date:2024/06/25
表示增强的深度强化学习算法研究 学位论文
, 2024
Authors:  张清扬
Adobe PDF(37765Kb)  |  Favorite  |  View/Download:83/7  |  Submit date:2024/06/04
请输入关键词深度强化学习,表示学习,分层强化学习,多智能体强化学习,大型语言模型  
PCEN: Potential Correlation-Enhanced Network for Multimodal Named Entity Recognition 会议论文
, Charlotte, NC, USA, 02-03 October 2023
Authors:  Jiakai Geng;  Chenyang Zhang;  Linjing Li;  Qing Yang;  Daniel Zeng
Adobe PDF(4985Kb)  |  Favorite  |  View/Download:57/9  |  Submit date:2024/05/31
named entity recognition  multimodal learning  vision-language pre-trained model  inconsistency loss  
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
Authors:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  Favorite  |  View/Download:38/21  |  Submit date:2024/05/28
Enhancing Multi-agent Coordination via Dual-channel Consensus 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 349-368
Authors:  Qingyang Zhang;  Kaishen Wang;  Jingqing Ruan;  Yiming Yang;  Dengpeng Xing;  Bo Xu
Adobe PDF(4997Kb)  |  Favorite  |  View/Download:71/26  |  Submit date:2024/04/23
Multi-agent reinforcement learning, contrastive representation learning, consensus, multi-agent cooperation, cognitive consistency  
Cross-lingual text image recognition via multi-task sequence to sequence learning 会议论文
, Milan, Italy, 2021-1-10
Authors:  Chen, Zhuo;  Yin, Fei;  Zhang, Xu-Yao;  Yang, Qing;  Liu, Cheng-Lin
Adobe PDF(2273Kb)  |  Favorite  |  View/Download:228/52  |  Submit date:2021/07/01
Convolutional Prototype Network for Open Set Recognition 期刊论文
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, 卷号: 44, 期号: 5, 页码: 2358-2370
Authors:  Hong-Ming Yang;  Xu-Yao Zhang;  Fei Yin;  Qing Yang;  Cheng-Lin Liu
Adobe PDF(5038Kb)  |  Favorite  |  View/Download:390/83  |  Submit date:2021/06/02
open-set recognition  CNN  prototype model  unknown detection  discriminative model  generative model