CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:12/5  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:12/5  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:11/6  |  提交时间:2024/06/25
BERT-FKGC: Text-Enhanced Few-Shot Representation Learning for Knowledge Graphs 会议论文
, 日本横滨, 2024-6-30
作者:  Li JL(李金林);  Wang ZK(王子康);  Li LJ(李林静);  Ceng DJ(曾大军)
Adobe PDF(528Kb)  |  收藏  |  浏览/下载:29/9  |  提交时间:2024/06/05
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process 会议论文
, Singapore, 2023-12
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(592Kb)  |  收藏  |  浏览/下载:45/19  |  提交时间:2024/05/30
D2AH-PPO: Playing ViZDoom With Object-Aware Hierarchical Reinforcement Learning 会议论文
, 中国重庆, 2024.5.7-5.9
作者:  Niu LY(钮龙宇);  Wan J(万军)
Adobe PDF(1645Kb)  |  收藏  |  浏览/下载:36/7  |  提交时间:2024/05/28
深度强化学习  表征学习  分层学习  
3D Semantic Labeling of Photogrammetry Meshes Based on Active Learning 会议论文
, Milan, Italy, 2021-1-10
作者:  Mengqi Rong;  Shuhan Shen;  Zhanyi Hu
Adobe PDF(2400Kb)  |  收藏  |  浏览/下载:155/44  |  提交时间:2023/09/25
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:216/70  |  提交时间:2023/07/06
Defeating DeepFakes via Adversarial Visual Reconstruction 会议论文
, Lisbon, Oct 10, 2022 - Oct 10, 2022
作者:  Ziwen He;  Wei Wang;  Weinan Guan;  Jing Dong;  Tieniu Tan
Adobe PDF(16773Kb)  |  收藏  |  浏览/下载:181/26  |  提交时间:2023/04/26
A Collaborative Communication-Qmix Approach for Large-scale Networked Traffic Signal Control 会议论文
, Indianapolis, IN, United States, 2021-9-19
作者:  Chen, Xiaoyu;  Xiong, Gang;  Lv, Yisheng;  Chen, yuanyuan;  Song, bing;  Wang, Feiyue
Adobe PDF(1208Kb)  |  收藏  |  浏览/下载:272/70  |  提交时间:2022/06/16