已选(0)清除
条数/页: 排序方式: |
| Lazy Agents: A New Perspective on Solving Sparse Reward Problem in Multi-agent Reinforcement Learning 期刊 创刊日期: 2018, 主办者: Liu BY(刘博寅) Adobe PDF(5797Kb)  |  收藏  |  浏览/下载:24/6  |  提交时间:2024/07/12 |
| QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12 作者: Liu BY(刘博寅) Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:22/3  |  提交时间:2024/07/12 |
| Learning State-Specific Action Masks for Reinforcement Learning 期刊论文 Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60 作者: Wang ZY(王梓薏); Li XR(李欣然); Sun LY(孙罗洋); Zhang HF(张海峰); Liu HL(刘华林); Jun Wang Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:42/18  |  提交时间:2024/07/05 reinforcement learning exploration efficiency space reduction |
| UNSUPERVISED LEARNING OF NEURAL SEMANTIC MAPPINGS WITH THE HUNGARIAN ALGORITHM FOR COMPOSITIONAL SEMANTICS 会议论文 , Seoul, South Korea, 2024-04 作者: Zhang X(张翔); He SZ(何世柱); Liu K(刘康); Zhao J(赵军) Adobe PDF(294Kb)  |  收藏  |  浏览/下载:48/22  |  提交时间:2024/06/27 |
| Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文 Machine Intelligence Research, 2023, 页码: 158 作者: Zhang Qingyang; Zhang Hongming; Xing Dengpeng; Bo Xu Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:23/11  |  提交时间:2024/06/25 |
| MoDE-CoTD: Chain-of-Thought Distillation for Complex Reasoning Tasks with Mixture of Decoupled LoRA-Experts 会议论文 , Torino (Italia), 2024.5.20 - 2024.5.25 作者: Xiang Li; Shizhu He; Jiayu Wu; Zhao Yang; Yao Xu; Yang Jun; Haifeng Liu; Kang Liu; Jun Zhao Adobe PDF(1062Kb)  |  收藏  |  浏览/下载:40/11  |  提交时间:2024/06/20 |
| CLDRNet: A Difference Refinement Network Based on Category Context Learning for Remote Sensing Image Change Detection 期刊论文 IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2024, 卷号: 17, 页码: 2133-2148 作者: Wan, Ling; Tian, Ye; Kang, Wenchao; Ma, Lei Adobe PDF(15230Kb)  |  收藏  |  浏览/下载:103/4  |  提交时间:2024/02/20 Feature extraction Task analysis Remote sensing Transformers Deep learning Semantics Support vector machines Category context learning (CCL) clustering learning (CL) difference map refinement (DMR) optical remote sensing image change detection (CD) |
| Large sequence models for sequential decision-making: a survey 期刊论文 FRONTIERS OF COMPUTER SCIENCE, 2023, 卷号: 17, 期号: 6, 页码: 18 作者: Wen, Muning; Lin, Runji; Wang, Hanjing; Yang, Yaodong; Wen, Ying; Mai, Luo; Wang, Jun; Zhang, Haifeng; Zhang, Weinan Adobe PDF(1351Kb)  |  收藏  |  浏览/下载:157/6  |  提交时间:2023/11/17 sequential decision-making sequence modeling the Transformer training system |
| UC-OWOD: Unknown-Classified Open World Object Detection 会议论文 , Tel Aviv, Israel, 2022-10 作者: Zhiheng Wu; Yue Lu; Xingyu Chen; Zhengxing Wu; Liwen Kang; Junzhi Yu Adobe PDF(2702Kb)  |  收藏  |  浏览/下载:134/26  |  提交时间:2023/06/29 |
| Multi-Granularity Pruning for Model Acceleration on Mobile Devices 会议论文 , 线上, 2022-07 作者: Zhao TL(赵天理); Zhang X(张希); Zhu WT(朱文涛); Wang JX(王家兴); Yang S(杨森); Liu J(刘季); Cheng J(程健) Adobe PDF(1919Kb)  |  收藏  |  浏览/下载:156/61  |  提交时间:2023/06/21 Deep Neural Networks Network Pruning Structured Pruning Non-structured Pruning Single Instruction Multiple Data |