CASIA OpenIR

浏览/检索结果: 共87条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:36/15  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:21/9  |  提交时间:2024/06/25
Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking 会议论文
, Virtual, United States, 2020-06-14至2020-06-19
作者:  Gao, Jin;  Hu, Weiming;  Lu, Yan
Adobe PDF(468Kb)  |  收藏  |  浏览/下载:51/18  |  提交时间:2024/06/21
Self-Modifying State Modeling for Simultaneous Machine Translation 会议论文
, Bangkok, Thailand, August 11–16, 2024
作者:  Donglei, Yu;  Xiaomian, Kang;  Yuchen, Liu;  YU, Zhou;  Chengqing, Zong
Adobe PDF(924Kb)  |  收藏  |  浏览/下载:27/13  |  提交时间:2024/06/20
3D Video Object Detection with Learnable Object-Centric Global Optimization 会议论文
, Vancouver Convention Center, 2023-6-18~2023-6-22
作者:  He, Jiawei;  Chen, Yuntao;  Wang, Naiyan;  Zhang, Zhaoxiang
Adobe PDF(1722Kb)  |  收藏  |  浏览/下载:47/19  |  提交时间:2024/06/18
Bridging the Gap between Different Vocabularies for LLM Ensemble 会议论文
, Mexico City, Mexico, June 16–21, 2024
作者:  徐杨一帆;  Lu JL(陆金梁);  Zhang JJ(张家俊)
Adobe PDF(1982Kb)  |  收藏  |  浏览/下载:66/21  |  提交时间:2024/06/13
3D PARTICLE PICKING IN CRYO-ELECTRON TOMOGRAMS USING INSTANCE SEGMENTATION 会议论文
, Bordeaux, France, 16-19 October 2022
作者:  Guole Liu;  Yaoru Luo;  Ge Yang
Adobe PDF(2356Kb)  |  收藏  |  浏览/下载:46/21  |  提交时间:2024/06/11
Are Conventional SNNs Really Efficient? A Perspective from Network Quantization 会议论文
, Seattle WA, USA, 2024-06-20
作者:  Shen, Guobin;  Zhao, Dongcheng;  Li, Tenglong;  Li, Jindong;  Zeng, Yi
Adobe PDF(587Kb)  |  收藏  |  浏览/下载:54/16  |  提交时间:2024/06/05
Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文
, Singapore, 2023/8/24-27
作者:  Chen,Shuo;  Yang,Ning;  Zhang,Meng;  Wang,Jun
Adobe PDF(1413Kb)  |  收藏  |  浏览/下载:51/11  |  提交时间:2024/06/05
Concentration Network for Reinforcement Learning of Large-Scale Multi-Agent Systems 会议论文
, online, 2022
作者:  Qingxu Fu;  Tenghai Qiu;  Jianqiang Yi;  Zhiqiang Pu;  Shiguang Wu
Adobe PDF(5807Kb)  |  收藏  |  浏览/下载:40/14  |  提交时间:2024/06/05