CASIA OpenIR

浏览/检索结果: 共20条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
An Empirical Study on Google Research Football Multi-agent Scenarios 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 549-570
作者:  Yan Song;  He Jiang;  Zheng Tian;  Haifeng Zhang;  Yingping Zhang;  Jiangcheng Zhu;  Zonghong Dai;  Weinan Zhang;  Jun Wang
Adobe PDF(24588Kb)  |  收藏  |  浏览/下载:30/9  |  提交时间:2024/05/23
Multi-agent reinforcement learning (RL), distributed RL system, population-based training, reward shaping, game theory  
Distributed Deep Reinforcement Learning: A Survey and a Multi-player Multi-agent Learning Toolbox 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 411-430
作者:  Qiyue Yin;  Tongtong Yu;  Shengqi Shen;  Jun Yang;  Meijing Zhao;  Wancheng Ni;  Kaiqi Huang;  Bin Liang;  Liang Wang
Adobe PDF(2923Kb)  |  收藏  |  浏览/下载:24/10  |  提交时间:2024/05/23
Deep reinforcement learning, distributed machine learning, self-play, population-play, toolbox  
The Life Cycle of Knowledge in Big Language Models: A Survey 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 2, 页码: 217-238
作者:  Boxi Cao;  Hongyu Lin;  Xianpei Han;  Le Sun
Adobe PDF(1430Kb)  |  收藏  |  浏览/下载:31/4  |  提交时间:2024/04/23
Pre-trained language model, knowledge acquisition, knowledge representation, knowledge probing, knowledge editing, knowledge application  
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 569-582
作者:  Haoyu Lu;  Yuqi Huo;  Mingyu Ding;  Nanyi Fei;  Zhiwu Lu
Adobe PDF(2928Kb)  |  收藏  |  浏览/下载:30/9  |  提交时间:2024/04/23
Image-text retrieval, multimodal modeling, contrastive learning, weak correlation, computer vision  
Dynamic Movement Primitives Based Robot Skills Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 396-407
作者:  Ling-Huan Kong;  Wei He;  Wen-Shi Chen;  Hui Zhang;  Yao-Nan Wang
Adobe PDF(3181Kb)  |  收藏  |  浏览/下载:37/10  |  提交时间:2024/04/23
Dynamic movement primitives (DMPs), trajectory tracking control, robot learning from demonstrations, neural networks (NNs), adaptive control  
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:  Wai-Chung Kwan;  Hong-Ru Wang;  Hui-Min Wang;  Kam-Fai Wong
Adobe PDF(2211Kb)  |  收藏  |  浏览/下载:9/2  |  提交时间:2024/04/23
Dialogue policy learning (DPL), task-oriented dialogue system (TOD), reinforcement learning (RL), dialogue system, Markov decision process  
Offline Pre-trained Multi-agent Decision Transformer 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 2, 页码: 233-248
作者:  Linghui Meng;  Muning Wen;  Chenyang Le;  Xiyun Li;  Dengpeng Xing;  Weinan Zhang;  Ying Wen;  Haifeng Zhang;  Jun Wang;  Yaodong Yang;  Bo Xu
Adobe PDF(2121Kb)  |  收藏  |  浏览/下载:36/11  |  提交时间:2024/04/23
Pre-training model  multi-agent reinforcement learning (MARL)  decision making  transformer  offline reinforcement learning  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:30/8  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
Brain-inspired Intelligent Robotics: Theoretical Analysis and Systematic Application 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 1-18
作者:  Hong Qiao;  Ya-Xiong Wu;  Shan-Lin Zhong;  Pei-Jie Yin;  Jia-Hao Chen
Adobe PDF(2207Kb)  |  收藏  |  浏览/下载:44/10  |  提交时间:2024/04/23
Brain-inspired intelligent robot  software and hardware  decision making  muscle control  cognitive intelligence  
From Teleoperation to Autonomous Robot-assisted Microsurgery: A Survey 期刊论文
Machine Intelligence Research, 2022, 卷号: 19, 期号: 4, 页码: 288-306
作者:  Dandan Zhang;  Weiyong Si;  Wen Fan;  Yuan Guan;  Chenguang Yang
Adobe PDF(1635Kb)  |  收藏  |  浏览/下载:29/8  |  提交时间:2024/04/23
Robot-assisted microsurgery (RAMS)  imaging and sensing  teleoperation  cooperative control  robot learning