CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:33/15  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:19/9  |  提交时间:2024/06/25
Alignment Rationale for Natural Language Inference 会议论文
, Online, 2021-8-1
作者:  Zhongtao Jiang;  Yuanzhe Zhang;  Zhao Yang;  Jun Zhao;  Kang Liu
Adobe PDF(1280Kb)  |  收藏  |  浏览/下载:41/14  |  提交时间:2024/06/06
Multi-objective Deep Reinforcement Learning for Mobile Edge Computing 会议论文
, Singapore, 2023/8/24-27
作者:  Yang,Ning;  Wen,Junrui;  Zhang,Meng;  Tang,Ming
Adobe PDF(499Kb)  |  收藏  |  浏览/下载:51/18  |  提交时间:2024/06/05
mobile edge computing  multi-objective reinforcement learning  resource scheduling  
Explanation Guided Knowledge Distillation for Pre-trained Language Model Compression 期刊论文
ACM Transactions on Asian and Low-Resource Language Information Processing, 2024, 卷号: 23, 期号: 2, 页码: 1-19
作者:  Zhao Yang;  Yuanzhe Zhang;  Dianbo Sui;  Yiming Ju;  Jun Zhao;  Kang Liu
Adobe PDF(1250Kb)  |  收藏  |  浏览/下载:54/20  |  提交时间:2024/05/30
Explanation  knowledge distillation  model compression  
Consensus Learning for Cooperative Multi-Agent Reinforcement Learning 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Zeren Zhang;  Guangchong Zhou;  Hao Chen;  Guoliang Fan
Adobe PDF(4141Kb)  |  收藏  |  浏览/下载:45/18  |  提交时间:2024/05/28
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:151/5  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
基于软硬件协同设计的深度学习模型压缩与加速 学位论文
, 2023
作者:  刘泽健
Adobe PDF(10064Kb)  |  收藏  |  浏览/下载:106/5  |  提交时间:2023/06/18
软硬件协同设计  模型压缩  DNN 加速器  自动化优化  
基于串联弹性驱动器的关节自对准食指外骨骼设计与分析 学位论文
, 北京: 中国科学院大学, 2022
作者:  孙宁
Adobe PDF(10788Kb)  |  收藏  |  浏览/下载:330/18  |  提交时间:2022/08/26
食指外骨骼, 关节自对准机构, 串联弹性驱动器, 外骨骼-指骨连接装置  
A Brain-Inspired Approach for Probabilistic Estimation and Efficient Planning in Precision Physical Interaction 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2022, 页码: 15
作者:  Xing, Dengpeng;  Yang, Yiming;  Zhang, Tielin;  Xu, Bo
Adobe PDF(2960Kb)  |  收藏  |  浏览/下载:239/17  |  提交时间:2022/06/10
Task analysis  Robots  Force  Planning  Mathematical models  Brain modeling  Biology  Brain-inspired structure  precision physical interaction  spiking neural networks (SNNs)