CASIA OpenIR

浏览/检索结果: 共115条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning 期刊
创刊日期: 2018,
主办者:  Liu BY(刘博寅)
Adobe PDF(5797Kb)  |  收藏  |  浏览/下载:24/6  |  提交时间:2024/07/12
QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:22/3  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:42/18  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
UNSUPERVISED LEARNING OF NEURAL SEMANTIC MAPPINGS WITH THE HUNGARIAN ALGORITHM FOR COMPOSITIONAL SEMANTICS 会议论文
, Seoul, South Korea, 2024-04
作者:  Zhang X(张翔);  He SZ(何世柱);  Liu K(刘康);  Zhao J(赵军)
Adobe PDF(294Kb)  |  收藏  |  浏览/下载:48/22  |  提交时间:2024/06/27
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:23/11  |  提交时间:2024/06/25
MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文
, Torino (Italia), 2024.5.20 - 2024.5.25
作者:  Xiang Li;  Shizhu He;  Jiayu Wu;  Zhao Yang;  Yao Xu;  Yang Jun;  Haifeng Liu;  Kang Liu;  Jun Zhao
Adobe PDF(1062Kb)  |  收藏  |  浏览/下载:40/11  |  提交时间:2024/06/20
CLDRNet: A Difference Refinement Network Based on Category Context Learning for Remote Sensing Image Change Detection 期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 卷号: 17, 页码: 2133-2148
作者:  Wan, Ling;  Tian, Ye;  Kang, Wenchao;  Ma, Lei
Adobe PDF(15230Kb)  |  收藏  |  浏览/下载:103/4  |  提交时间:2024/02/20
Feature extraction  Task analysis  Remote sensing  Transformers  Deep learning  Semantics  Support vector machines  Category context learning (CCL)  clustering learning (CL)  difference map refinement (DMR)  optical remote sensing image  change detection (CD)  
Large sequence models for sequential decision-making: a survey 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18
作者:  Wen, Muning;  Lin, Runji;  Wang, Hanjing;  Yang, Yaodong;  Wen, Ying;  Mai, Luo;  Wang, Jun;  Zhang, Haifeng;  Zhang, Weinan
Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:157/6  |  提交时间:2023/11/17
sequential decision-making  sequence modeling  the Transformer  training system  
UC-OWOD: Unknown-Classified Open World Object Detection 会议论文
, Tel Aviv, Israel, 2022-10
作者:  Zhiheng Wu;  Yue Lu;  Xingyu Chen;  Zhengxing Wu;  Liwen Kang;  Junzhi Yu
Adobe PDF(2702Kb)  |  收藏  |  浏览/下载:134/26  |  提交时间:2023/06/29
Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文
, 线上, 2022-07
作者:  Zhao TL(赵天理);  Zhang X(张希);  Zhu WT(朱文涛);  Wang JX(王家兴);  Yang S(杨森);  Liu J(刘季);  Cheng J(程健)
Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:156/61  |  提交时间:2023/06/21
Deep Neural Networks  Network Pruning  Structured Pruning  Non-structured Pruning  Single Instruction Multiple Data