CASIA OpenIR

浏览/检索结果: 共14条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:44/17  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:26/12  |  提交时间:2024/06/25
Take a Closer Look at Multilinguality! Improve Multilingual Pre-Training Using Monolingual Corpora Only 会议论文
, Singapore, December 6-10, 2023
作者:  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(1097Kb)  |  收藏  |  浏览/下载:68/25  |  提交时间:2024/06/13
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:50/13  |  提交时间:2024/06/05
Progressive Direction-Aware Pose Grammar for Human Pose Estimation 期刊论文
IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2023, 卷号: 5, 期号: 4, 页码: 593-605
作者:  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(3192Kb)  |  收藏  |  浏览/下载:55/27  |  提交时间:2024/06/03
SCOOT: Self-supervised Centric Open-set Object Tracking 会议论文
, Sydney, Australia, 2023-12-12-2023-12-15
作者:  Li W(李巍);  Meng WL(孟维亮);  Li BW(李博文);  Zhang JG(张吉光);  Zhang XP(张晓鹏)
Adobe PDF(2792Kb)  |  收藏  |  浏览/下载:51/18  |  提交时间:2024/06/03
SECAD-Net: Self-Supervised CAD Reconstruction by Learning Sketch-Extrude Operations 会议论文
, Vancouver, BC, Canada, 2023-6-17至2023-6-24
作者:  Li, Pu;  Guo, Jianwei;  Zhang, Xiaopeng;  Yan, Dong-Ming
Adobe PDF(9384Kb)  |  收藏  |  浏览/下载:48/9  |  提交时间:2024/06/03
Explainable Reinforcement Learning via a Causal World Model 会议论文
Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22
作者:  Yu ZY(余忠蔚);  Ruan JQ(阮景晴);  Xing DP(邢登鹏)
Adobe PDF(850Kb)  |  收藏  |  浏览/下载:60/29  |  提交时间:2024/05/28
强化学习  可解释人工智能  因果推理  
Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文
, 长沙, 2023-11
作者:  Kaishen Wang;  Jingqing Ruan;  Qingyang Zhang;  Dengpeng Xing
Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:51/27  |  提交时间:2024/05/28
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:58/16  |  提交时间:2024/05/28