CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:19/9  |  提交时间:2024/06/25
3D Video Object Detection with Learnable Object-Centric Global Optimization 会议论文
, Vancouver Convention Center, 2023-6-18~2023-6-22
作者:  He, Jiawei;  Chen, Yuntao;  Wang, Naiyan;  Zhang, Zhaoxiang
Adobe PDF(1722Kb)  |  收藏  |  浏览/下载:41/16  |  提交时间:2024/06/18
Improving the Homophily of Heterophilic Graphs for Semi-Supervised Node Classification 会议论文
, Brisbane, Australia, 2023-7-10
作者:  Wang YH(王玉虎);  Xiang SM(向世明);  Pan CH(潘春洪)
Adobe PDF(1487Kb)  |  收藏  |  浏览/下载:64/21  |  提交时间:2024/06/17
graph neural networks  graph mining  homophily and heterophily  
Social Relation Reasoning Based on Triangular Constraints 会议论文
, 美国华盛顿, 2023年2月7日-14日
作者:  Guo, Yunfei;  Yin, Fei;  Feng, Wei;  Yan, Xudong;  Xue, Tao;  Mei, Shuqi;  Liu, Cheng-Lin
Adobe PDF(977Kb)  |  收藏  |  浏览/下载:62/23  |  提交时间:2024/06/13
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:44/10  |  提交时间:2024/06/05
Cooperative Object Transportation for Second-order Multi-robot Systems in Dynamic Environment 会议论文
Proceedings of the 42nd Chinese Control Conference, 天津, 2023-7-24
作者:  Cai, Qiang;  Ai, Xiaolin;  Liu, Tianqi;  Pu, zhiqiang
Adobe PDF(3418Kb)  |  收藏  |  浏览/下载:49/19  |  提交时间:2024/05/28
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:51/15  |  提交时间:2024/05/28
A Multi-modal Global Instance Tracking Benchmark (MGIT): Better Locating Target in Complex Spatio-temporal and causal Relationship 会议论文
, New Orleans, 2023-12
作者:  Shiyu, Hu;  Dailing, Zhang;  Meiqi, Wu;  Xiaokun, Feng;  Xuchen, Li;  Xin, Zhao;  Kaiqi, Huang
Adobe PDF(6215Kb)  |  收藏  |  浏览/下载:123/29  |  提交时间:2024/01/22
Learning to Build Reasoning Chains by Reliable Path Retrieval 会议论文
, 希腊罗德岛, 2023
作者:  Zhu MJ(朱敏郡);  Weng YX(翁诣轩);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(994Kb)  |  收藏  |  浏览/下载:173/44  |  提交时间:2023/06/29