CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

  只显示已认领条目
已选(0)清除 条数/页:   排序方式:
Improve the efficiency of deep reinforcement learning through semantic exploration guided by natural language. 会议论文
, 北京华腾美居酒店, 2023-12-9
作者:  Zhourui Guo;  Meng Yao;  Yang Yu;  Qiyue Yin
Adobe PDF(2302Kb)  |  收藏  |  浏览/下载:23/8  |  提交时间:2024/06/03
Distributed Deep Reinforcement Learning: A Survey and a Multi-player Multi-agent Learning Toolbox 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 411-430
作者:  Qiyue Yin;  Tongtong Yu;  Shengqi Shen;  Jun Yang;  Meijing Zhao;  Wancheng Ni;  Kaiqi Huang;  Bin Liang;  Liang Wang
Adobe PDF(2923Kb)  |  收藏  |  浏览/下载:42/16  |  提交时间:2024/05/23
Deep reinforcement learning, distributed machine learning, self-play, population-play, toolbox  
AI in Human-computer Gaming: Techniques, Challenges and Opportunities 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 299-317
作者:  Qi-Yue Yin;  Jun Yang;  Kai-Qi Huang;  Mei-Jing Zhao;  Wan-Cheng Ni;  Bin Liang;  Yan Huang;  Shu Wu;  Liang Wang
Adobe PDF(2608Kb)  |  收藏  |  浏览/下载:53/12  |  提交时间:2024/04/23
Human-computer gaming, AI, intelligent decision making, deep reinforcement learning, self-play  
Position Weighted Convolutional Neural Network for Unbalanced Children Caries Diagnosis 期刊论文
IEEE ACCESS, 2023, 卷号: 11, 页码: 77034-77044
作者:  Zhou, Xiaojie;  Feng, Xueou;  Li, Qingming;  Yin, Qiyue;  Yang, Jun;  Yu, Guoxia;  Shi, Qing
收藏  |  浏览/下载:117/0  |  提交时间:2023/11/17
Caries diagnosis  CNN  transformer  position embedding  panoramic radiograph  
Subspace-Aware Exploration for Sparse-Reward Multi-Agent Tasks 会议论文
, Washington DC, USA, 2023-2-7
作者:  Pei Xu;  Junge Zhang;  Qiyue Yin;  Chao Yu;  Yaodong Yang;  Kaiqi Huang
Adobe PDF(2037Kb)  |  收藏  |  浏览/下载:246/75  |  提交时间:2023/06/19
deep reinforcement learning  sparse reward  exploration  multi-agent  
Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning 会议论文
, 意大利, 2022-07
作者:  Yang GK(杨光开);  Chenhao(陈皓);  Junge Zhang(张俊格);  Qiyue Yin(尹奇跃);  Kaiqi Huang(黄凯奇)
Adobe PDF(2924Kb)  |  收藏  |  浏览/下载:290/62  |  提交时间:2022/07/12
基于不确定度的多智能体信用分配方法 期刊论文
中国科学院大学学报, 2022, 页码: 0
作者:  杨光开;  陈皓;  张茗奕;  尹奇跃;  黄凯奇
Adobe PDF(1076Kb)  |  收藏  |  浏览/下载:519/89  |  提交时间:2022/07/12
面向Ad-Hoc协作的局部观测重建方法 期刊论文
中国科学院大学学报, 2022, 页码: 1
作者:  陈皓;  杨立昆;  尹奇跃;  黄凯奇
Adobe PDF(1491Kb)  |  收藏  |  浏览/下载:250/51  |  提交时间:2022/06/16
多智能体  深度强化学习  信用分配  Ad-Hoc协作  
The human mediodorsal thalamus: Organization, connectivity, and function 期刊论文
NEUROIMAGE, 2022, 卷号: 249, 页码: 10
作者:  Li, Kaixin;  Fan, Lingzhong;  Cui, Yue;  Wei, Xuehu;  He, Yini;  Yang, Jiyue;  Lu, Yuheng;  Li, Wen;  Shi, Weiyang;  Cao, Long;  Cheng, Luqi;  Li, Ang;  You, Bo;  Jiang, Tianzi
Adobe PDF(2751Kb)  |  收藏  |  浏览/下载:257/3  |  提交时间:2022/06/06
Mediodorsal thalamic nucleus  Parcellation  Anatomical organization  Functional connectivity  Cognitive functions