CASIA OpenIR

浏览/检索结果: 共67条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:9/5  |  提交时间:2024/06/25
强化学习,分层强化学习  
Multi-Scale Dynamic Coding Improved Spiking Actor Network for Reinforcement Learning 会议论文
, Online, February 22–March 1, 2022
作者:  Zhang, Duzhen;  Zhang, Tielin;  Jia, Shuncheng;  Xu, Bo
Adobe PDF(2249Kb)  |  收藏  |  浏览/下载:20/7  |  提交时间:2024/06/11
Continuous Exploration via Multiple Perspectives in Sparse Reward Environment 会议论文
, 厦门国际会议中心, 2023-10-13
作者:  Chen ZP(陈忠鹏);  Guan Q(关强)
Adobe PDF(2260Kb)  |  收藏  |  浏览/下载:26/7  |  提交时间:2024/06/04
Reinforcement Learning · Exploration Strategy · Sparse Reward · Intrinsic Motivation  
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:25/8  |  提交时间:2024/05/28
Mingling Foresight with Imagination: Model-Based Cooperative Multi-Agent Reinforcement Learning 会议论文
, New Orleans, LA, USA,, November 28 - December 9, 2022
作者:  Zhiwei Xu;  Dapeng Li;  Bin Zhang;  Yuan Zhan;  Yunpeng Bai;  Guoliang Fan
Adobe PDF(4367Kb)  |  收藏  |  浏览/下载:23/4  |  提交时间:2024/05/28
SIDE: State Inference for Partially Observable Cooperative Multi-Agent Reinforcement Learning 会议论文
, Auckland, New Zealand, May 9-13, 2022
作者:  Zhiwei Xu;  Yunpeng Bai;  Dapeng Li;  Bin Zhang;  Guoliang Fan
Adobe PDF(2965Kb)  |  收藏  |  浏览/下载:26/6  |  提交时间:2024/05/28
Learning to Coordinate via Multiple Graph Neural Networks 会议论文
, BALI, Indonesia, December 8-12, 2021
作者:  Zhiwei Xu;  Bin Zhang;  Yunpeng Bai;  Dapeng Li;  Guoliang Fan
Adobe PDF(2047Kb)  |  收藏  |  浏览/下载:31/12  |  提交时间:2024/05/28
Counterfactual Supporting Facts Extraction for Explainable Medical Record Based Diagnosis with Graph Network 会议论文
, Online, June 6–11, 2021
作者:  Wu HR(吴浩然);  Chen W(陈炜);  Xu S(徐爽);  Xu B(徐波)
Adobe PDF(1394Kb)  |  收藏  |  浏览/下载:197/62  |  提交时间:2023/06/26
Joint Modeling of Document and Label with Clause Interaction Hypergraph for ICD Medical Code Assignment 会议论文
, Padua, Italy, 18-23 July 2022
作者:  Wu HR(吴浩然);  Meng LH(孟令辉);  Xu S(徐爽);  Xu B(徐波)
Adobe PDF(612Kb)  |  收藏  |  浏览/下载:97/37  |  提交时间:2023/06/26
PKD: General Distillation Framework for Object Detectors via Pearson Correlation Coefficient 会议论文
, New Orleans, America, Monday November 28th through Friday December 9th
作者:  Weihan, Cao;  Yifan, Zhang;  Jianfei, Gao;  Anda, Cheng;  Ke, Cheng;  Jian, Cheng
Adobe PDF(2614Kb)  |  收藏  |  浏览/下载:112/30  |  提交时间:2023/06/21
Knowledge Distillation  Model Compression  Object Detection