已选(0)清除
条数/页: 排序方式: |
| Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文 , 澳大利亚, 2023-6 作者: Zhang Qingyang; Yang Yiming; Ruan Jingqing; Xiong Xuantang; Xing Dengpeng; Xu Bo Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:44/17  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:26/12  |  提交时间:2024/06/25 |
| Take a Closer Look at Multilinguality! Improve Multilingual Pre-Training Using Monolingual Corpora Only 会议论文 , Singapore, December 6-10, 2023 作者: Lu JL(陆金梁); Zhang JJ(张家俊) Adobe PDF(1097Kb)  |  收藏  |  浏览/下载:68/25  |  提交时间:2024/06/13 |
| Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文 , Gold Coast, Australia, 2023-6 作者: Qingxu Fu; Tenghai Qiu; Zhiqiang Pu; Jianqiang Yi; Xiaolin Ai; Wanmai Yuan Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:50/13  |  提交时间:2024/06/05 |
| Progressive Direction-Aware Pose Grammar for Human Pose Estimation 期刊论文 IEEE TRANSACTIONS ON BIOMETRICS, BEHAVIOR, AND IDENTITY SCIENCE, 2023, 卷号: 5, 期号: 4, 页码: 593-605 作者: Zhou Lu; Chen Yingying; Wang Jinqiao Adobe PDF(3192Kb)  |  收藏  |  浏览/下载:55/27  |  提交时间:2024/06/03 |
| SCOOT: Self-supervised Centric Open-set Object Tracking 会议论文 , Sydney, Australia, 2023-12-12-2023-12-15 作者: Li W(李巍); Meng WL(孟维亮); Li BW(李博文); Zhang JG(张吉光); Zhang XP(张晓鹏) Adobe PDF(2792Kb)  |  收藏  |  浏览/下载:51/18  |  提交时间:2024/06/03 |
| SECAD-Net: Self-Supervised CAD Reconstruction by Learning Sketch-Extrude Operations 会议论文 , Vancouver, BC, Canada, 2023-6-17至2023-6-24 作者: Li, Pu; Guo, Jianwei; Zhang, Xiaopeng; Yan, Dong-Ming Adobe PDF(9384Kb)  |  收藏  |  浏览/下载:48/9  |  提交时间:2024/06/03 |
| Explainable Reinforcement Learning via a Causal World Model 会议论文 Proceedings of the 32nd International Joint Conference on Artificial Intelligence, 中国澳门, 2023-08-22 作者: Yu ZY(余忠蔚); Ruan JQ(阮景晴); Xing DP(邢登鹏) Adobe PDF(850Kb)  |  收藏  |  浏览/下载:60/29  |  提交时间:2024/05/28 强化学习 可解释人工智能 因果推理 |
| Efficient Hierarchical Reinforcement Learning via Mutual Information Constrained Subgoal Discovery 会议论文 , 长沙, 2023-11 作者: Kaishen Wang; Jingqing Ruan; Qingyang Zhang; Dengpeng Xing Adobe PDF(2044Kb)  |  收藏  |  浏览/下载:51/27  |  提交时间:2024/05/28 |
| Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文 , New Orleans, LA, USA, December 10-16, 2023 作者: Zhiwei Xu; Bin Zhang; Dapeng Li; Guangchong Zhou; Zeren Zhang; Guoliang Fan Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:58/16  |  提交时间:2024/05/28 |