CASIA OpenIR

浏览/检索结果: 共9条,第1-9条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Deep Reinforcement Learning With Part-Aware Exploration Bonus in Video Games 期刊论文
IEEE TRANSACTIONS ON GAMES, 2022, 卷号: 14, 期号: 4, 页码: 644-653
作者:  Xu, Pei;  Yin, Qiyue;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(1480Kb)  |  收藏  |  浏览/下载:284/70  |  提交时间:2023/02/22
Deep learning  exploration  reinforcement learning  video game  
Offline reinforcement learning with representations for actions 期刊论文
INFORMATION SCIENCES, 2022, 卷号: 610, 页码: 746-758
作者:  Lou, Xingzhou;  Yin, Qiyue;  Zhang, Junge;  Yu, Chao;  He, Zhaofeng;  Cheng, Nengjie;  Huang, Kaiqi
收藏  |  浏览/下载:157/0  |  提交时间:2022/11/14
Offline reinforcement learning  Action embedding  
Multi-Agent Uncertainty Sharing for Cooperative Multi-Agent Reinforcement Learning 会议论文
, 意大利, 2022-07
作者:  Yang GK(杨光开);  Chenhao(陈皓);  Junge Zhang(张俊格);  Qiyue Yin(尹奇跃);  Kaiqi Huang(黄凯奇)
Adobe PDF(2924Kb)  |  收藏  |  浏览/下载:229/49  |  提交时间:2022/07/12
对抗场景中的智能体策略泛化研究 学位论文
工学硕士, 中国科学院自动化研究所: 中国科学院自动化研究所, 2022
作者:  陈皓
Adobe PDF(13782Kb)  |  收藏  |  浏览/下载:293/14  |  提交时间:2022/06/16
深度强化学习  多智能体  策略泛化  Ad-Hoc 协作  信用分配  
对抗环境中基于值分解框架的多智能体协同算法研究 学位论文
工学硕士, 中科院自动化研究所: 中科院自动化研究所, 2022
作者:  杨光开
Adobe PDF(17847Kb)  |  收藏  |  浏览/下载:217/7  |  提交时间:2022/06/13
多智能体协同,信用分配,贝叶斯超网络,部分可观测约束,贝叶斯神经网络  
Learning to Learn Cropping Models for Different Aspect Ratio Requirements 会议论文
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, Virtual, 14-19, June, 2020
作者:  Li, Debang;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(1065Kb)  |  收藏  |  浏览/下载:183/59  |  提交时间:2021/05/31
Composing Good Shots by Exploiting Mutual Relations 会议论文
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2020, Virtual, 14-19, June, 2020
作者:  Li, Debang;  Zhang, Junge;  Huang, Kaiqi;  Yang, Ming-Hsuan
Adobe PDF(628Kb)  |  收藏  |  浏览/下载:157/39  |  提交时间:2021/05/31
Universal adversarial perturbations against object detection 期刊论文
PATTERN RECOGNITION, 2021, 卷号: 110, 期号: 无, 页码: 107584
作者:  Li, Debang;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(4553Kb)  |  收藏  |  浏览/下载:294/35  |  提交时间:2021/01/06
Adversarial examples  Object detection  Universal adversarial perturbation  
Fast A3RL: Aesthetics-Aware Adversarial Reinforcement Learning for Image Cropping 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 卷号: 28, 期号: 10, 页码: 5105-5120
作者:  Li, Debang;  Wu, Huikai;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(6588Kb)  |  收藏  |  浏览/下载:364/40  |  提交时间:2019/12/16
Reinforcement learning  adversarial learning  image cropping