CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:7/4  |  提交时间:2024/06/25
强化学习,分层强化学习  
ZoomTrack: Target-aware Non-uniform Resizing for Efficient Visual Tracking 会议论文
, New Orleans, United States, Sunday Dec 10 through Saturday Dec 16
作者:  Kou, Yutong;  Gao, Jin;  Li, Bing;  Wang, Gang;  Hu, Weiming;  Wang, Yizheng;  Li, Liang
Adobe PDF(2115Kb)  |  收藏  |  浏览/下载:23/7  |  提交时间:2024/06/21
Embed Trajectory Imitation in Reinforcement Learning: A Hybrid Method for Autonomous Vehicle Planning 会议论文
/, Orlando, FL, USA, 2023-11
作者:  Wang, Yuxiao;  Dai, Xingyuan;  Wang, Kara;  Ali, Hub;  Zhu, Fenghua
Adobe PDF(1410Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/06/11
Imitation Learning  Trajectory Planning  Deep Reinforcement Learning  Autonomous Driving  
Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文
, Singapore, 2023/8/24-27
作者:  Chen,Shuo;  Yang,Ning;  Zhang,Meng;  Wang,Jun
Adobe PDF(1413Kb)  |  收藏  |  浏览/下载:33/8  |  提交时间:2024/06/05
Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文
, Singapore, 2023/8/24-27
作者:  Yang,Ning;  Wen,Junrui;  Zhang,Meng;  Tang,Ming
Adobe PDF(499Kb)  |  收藏  |  浏览/下载:32/12  |  提交时间:2024/06/05
mobile edge computing  multi-objective reinforcement learning  resource scheduling  
Cooperative Object Transportation for Second-order Multi-robot Systems in Dynamic Environment 会议论文
Proceedings of the 42nd Chinese Control Conference, 天津, 2023-7-24
作者:  Cai, Qiang;  Ai, Xiaolin;  Liu, Tianqi;  Pu, zhiqiang
Adobe PDF(3418Kb)  |  收藏  |  浏览/下载:30/9  |  提交时间:2024/05/28
PiCor: Multi-Task Deep Reinforcement Learning with Policy Correction 会议论文
Proceedings of the AAAI Conference on Artificial Intelligence, 美国 华盛顿, 2023.02.07 - 2023.02.14
作者:  Bai FS(白丰硕);  Zhang HM(张鸿铭);  Tao TY(陶天阳);  Wu ZH(武志亨);  Wang YN(王燕娜);  Xu B(徐博)
Adobe PDF(1663Kb)  |  收藏  |  浏览/下载:190/42  |  提交时间:2023/07/05
Reinforcement Learning Algorithms  Transfer  Domain Adaptation  Multi-Task Learning  
Pseudo Value Network Distillation for High-Performance Exploration 会议论文
, 澳大利亚, 2023-06
作者:  Zhao EM(赵恩民);  Xing JL(兴军亮);  Li K(李凯);  Kang YX(康永欣);  Tao P(陶品)
Adobe PDF(5874Kb)  |  收藏  |  浏览/下载:150/43  |  提交时间:2023/06/28
Optimal Strategy for Aircraft Pursuit-Evasion Games via Self-Play Iteration 期刊论文
Machine Intelligence Research, 2023, 页码: 1-12
作者:  Wang Xin;  Wei Qinglai;  Li Tao;  Zhang Jie
Adobe PDF(1556Kb)  |  收藏  |  浏览/下载:192/69  |  提交时间:2023/06/26
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Feilong Chen;  Duzhen Zhang;  Minglun Han;  Xiuyi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(969Kb)  |  收藏  |  浏览/下载:161/32  |  提交时间:2023/06/21