CASIA OpenIR

Browse/Search Results:  1-10 of 127 Help

Filters    
Selected(0)Clear Items/Page:    Sort:
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
Authors:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  Favorite  |  View/Download:15/5  |  Submit date:2024/06/25
强化学习,分层强化学习  
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
Authors:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  Favorite  |  View/Download:19/7  |  Submit date:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
Authors:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  Favorite  |  View/Download:14/7  |  Submit date:2024/06/25
Context-Aware Talking-Head Video Editing 会议论文
, 加拿大渥太华, 2023.10.29-2023.11.2
Authors:  Songlin Yang;  Wei Wang;  Jun Ling;  Bo Peng;  Xu Tan;  Jing Dong
Adobe PDF(1657Kb)  |  Favorite  |  View/Download:37/12  |  Submit date:2024/06/21
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
Authors:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  Favorite  |  View/Download:28/11  |  Submit date:2024/06/11
Leros: Learning Explicit Reasoning on Synthesized Data for Commonsense Question Answering 会议论文
, Torino, Italia, 2024-5
Authors:  Wang, Chenhao;  Cao, Pengfei;  Li, Jiachun;  Chen, Yubo;  Liu, Kang;  Jiang, Xiaojian;  Xu, Jiexin;  Li, Qiuxia;  Jun Zhao
Adobe PDF(909Kb)  |  Favorite  |  View/Download:36/9  |  Submit date:2024/05/30
T-Agent: A Term-Aware Agent for Medical Dialogue Generation 会议论文
, Yokohama, Japan, 2024-6-30 - 2023-7-5
Authors:  Zefa Hu;  Haozhi Zhao;  Yuanyuan Zhao;  Shuang Xu;  Bo Xu
Adobe PDF(483Kb)  |  Favorite  |  View/Download:43/10  |  Submit date:2024/05/29
SA-MPF: A Status-Aware Mask Prediction Framework for Online Disease Diagnosis 会议论文
, Yokohama, Japan, 2024-6-30 - 2023-7-5
Authors:  Zefa Hu;  Linghui Meng;  Yunlong Zhao;  Yuanyuan Zhao;  Shuang Xu;  Bo Xu
Adobe PDF(307Kb)  |  Favorite  |  View/Download:50/11  |  Submit date:2024/05/29
Matching-based Term Semantics Pre-training for Spoken Patient Query Understanding 会议论文
, Rhodes, Greece, 2023-6-6 - 2023-6-10
Authors:  Zefa Hu;  Xiuyi Chen;  Haoran Wu;  Minglun Han;  Ziyi Ni;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1049Kb)  |  Favorite  |  View/Download:46/14  |  Submit date:2024/05/29
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
Authors:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  Favorite  |  View/Download:199/44  |  Submit date:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning