CASIA OpenIR

浏览/检索结果: 共22条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:53/24  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:82/27  |  提交时间:2024/06/05
Joint caching and transmission in the mobile edge network: An multi-agent learning approach 会议论文
, Madrid, Spain, 2021-12-7
作者:  Mi,Qirui;  Yang,Ning;  Zhang,Haifeng;  Zhang,Haijun;  Wang,Jun
Adobe PDF(1724Kb)  |  收藏  |  浏览/下载:53/14  |  提交时间:2024/06/05
Minimizing Age of Information for Mobile Edge Computing Systems: A Nested Index Approach 会议论文
, Singapore, 2023/8/24-27
作者:  Chen,Shuo;  Yang,Ning;  Zhang,Meng;  Wang,Jun
Adobe PDF(1413Kb)  |  收藏  |  浏览/下载:63/14  |  提交时间:2024/06/05
Representative Demonstration Selection for In-Context Learning with Two-Stage Determinantal Point Process 会议论文
, Singapore, 2023-12
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Cao Liu;  Jun Zhao;  Kang Liu
Adobe PDF(592Kb)  |  收藏  |  浏览/下载:63/27  |  提交时间:2024/05/30
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文
, Washington DC, USA, 2023-2-7
作者:  Pei Xu;  Junge Zhang;  Qiyue Yin;  Chao Yu;  Yaodong Yang;  Kaiqi Huang
Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:271/82  |  提交时间:2023/06/19
deep reinforcement learning  sparse reward  exploration  multi-agent  
Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning 会议论文
, 意大利, 2022-07
作者:  Yang GK(杨光开);  Chenhao(陈皓);  Junge Zhang(张俊格);  Qiyue Yin(尹奇跃);  Kaiqi Huang(黄凯奇)
Adobe PDF(2924Kb)  |  收藏  |  浏览/下载:307/67  |  提交时间:2022/07/12
2-DOF camera stabilization platform for robotic fish based on active disturbance rejection control 会议论文
, Suzhou, China, 2019-7
作者:  Pengfei Zhang;  Zhengxing Wu;  Jian Wang;  Min Tan;  Junzhi Yu
Adobe PDF(670Kb)  |  收藏  |  浏览/下载:168/47  |  提交时间:2022/06/27
An open-source, fiducial-based, underwater stereo visual-inertial localization method with refraction correction 会议论文
, Prague, Czech Republic, 2021.09
作者:  Pengfei Zhang;  Zhengxing Wu;  Jian Wang;  Shihan Kong;  Min Tan;  Junzhi Yu
Adobe PDF(657Kb)  |  收藏  |  浏览/下载:316/59  |  提交时间:2022/06/27
一种针对德州扑克AI的对手建模与策略集成框架 期刊论文
自动化学报, 2021, 期号: 0, 页码: 0
作者:  张蒙;  李凯;  吴哲;  臧一凡;  徐航;  兴军亮
Adobe PDF(1354Kb)  |  收藏  |  浏览/下载:451/127  |  提交时间:2021/06/21
不完美信息博弈  德州扑克  演化学习  在线对手建模  种群策略集成