CASIA OpenIR

浏览/检索结果: 共11条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:10/5  |  提交时间:2024/06/25
Distributed Deep Reinforcement Learning: A Survey and a Multi-player Multi-agent Learning Toolbox 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 411-430
作者:  Qiyue Yin;  Tongtong Yu;  Shengqi Shen;  Jun Yang;  Meijing Zhao;  Wancheng Ni;  Kaiqi Huang;  Bin Liang;  Liang Wang
Adobe PDF(2923Kb)  |  收藏  |  浏览/下载:35/13  |  提交时间:2024/05/23
Deep reinforcement learning, distributed machine learning, self-play, population-play, toolbox  
Effective Model Compression via Stage-wise Pruning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 6, 页码: 937-951
作者:  Ming-Yang Zhang;  Xin-Yi Yu;  Lin-Lin Ou
Adobe PDF(2394Kb)  |  收藏  |  浏览/下载:19/8  |  提交时间:2024/04/23
Automated machine learning (AutoML), channel pruning, model compression, distillation, convolutional neural networks (CNN)  
Masked Vision-language Transformer in Fashion 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 421-434
作者:  Ge-Peng Ji;  Mingchen Zhuge;  Dehong Gao;  Deng-Ping Fan;  Christos Sakaridis;  Luc Van Gool
Adobe PDF(2779Kb)  |  收藏  |  浏览/下载:19/5  |  提交时间:2024/04/23
Vision-language, masked image reconstruction, transformer, fashion, e-commercial  
Dynamic Movement Primitives Based Robot Skills Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 396-407
作者:  Ling-Huan Kong;  Wei He;  Wen-Shi Chen;  Hui Zhang;  Yao-Nan Wang
Adobe PDF(3181Kb)  |  收藏  |  浏览/下载:50/13  |  提交时间:2024/04/23
Dynamic movement primitives (DMPs), trajectory tracking control, robot learning from demonstrations, neural networks (NNs), adaptive control  
A Survey on Collaborative DNN Inference for Edge Intelligence 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 370-395
作者:  Wei-Qing Ren;  Yu-Ben Qu;  Chao Dong;  Yu-Qian Jing;  Hao Sun;  Qi-Hui Wu;  Song Guo
Adobe PDF(2969Kb)  |  收藏  |  浏览/下载:38/11  |  提交时间:2024/04/23
Artificial intelligence (AI), edge intelligence (EI), distributed computing, deep neural network (DNN), collaborative inference  
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:  Wai-Chung Kwan;  Hong-Ru Wang;  Hui-Min Wang;  Kam-Fai Wong
Adobe PDF(2211Kb)  |  收藏  |  浏览/下载:12/4  |  提交时间:2024/04/23
Dialogue policy learning (DPL), task-oriented dialogue system (TOD), reinforcement learning (RL), dialogue system, Markov decision process  
AI in Human-computer Gaming: Techniques, Challenges and Opportunities 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 299-317
作者:  Qi-Yue Yin;  Jun Yang;  Kai-Qi Huang;  Mei-Jing Zhao;  Wan-Cheng Ni;  Bin Liang;  Yan Huang;  Shu Wu;  Liang Wang
Adobe PDF(2608Kb)  |  收藏  |  浏览/下载:46/9  |  提交时间:2024/04/23
Human-computer gaming, AI, intelligent decision making, deep reinforcement learning, self-play  
Offline Pre-trained Multi-agent Decision Transformer 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248
作者:  Linghui Meng;  Muning Wen;  Chenyang Le;  Xiyun Li;  Dengpeng Xing;  Weinan Zhang;  Ying Wen;  Haifeng Zhang;  Jun Wang;  Yaodong Yang;  Bo Xu
Adobe PDF(2121Kb)  |  收藏  |  浏览/下载:46/12  |  提交时间:2024/04/23
Pre-training model  multi-agent reinforcement learning (MARL)  decision making  transformer  offline reinforcement learning  
Continuous-time Distributed Heavy-ball Algorithm for Distributed Convex Optimization over Undirected and Directed Graphs 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 1, 页码: 75-88
作者:  Hao-Ran Yang;  Wei Ni
Adobe PDF(1378Kb)  |  收藏  |  浏览/下载:33/9  |  提交时间:2024/04/23
Distributed convex optimization  second-order distributed algorithm  multi-agent systems  gradient tracking  directed graph