CASIA OpenIR

浏览/检索结果: 共37条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Deep Reinforcement Learning With Part-Aware Exploration Bonus in Video Games 期刊论文
IEEE TRANSACTIONS ON GAMES, 2022, 卷号: 14, 期号: 4, 页码: 644-653
作者:  Xu, Pei;  Yin, Qiyue;  Zhang, Junge;  Huang, Kaiqi
Adobe PDF(1480Kb)  |  收藏  |  浏览/下载:300/74  |  提交时间:2023/02/22
Deep learning  exploration  reinforcement learning  video game  
Dual-discriminator adversarial framework for data-free quantization 期刊论文
NEUROCOMPUTING, 2022, 卷号: 511, 页码: 67-77
作者:  Li, Zhikai;  Ma, Liping;  Long, Xianlei;  Xiao, Junrui;  Gu, Qingyi
Adobe PDF(1512Kb)  |  收藏  |  浏览/下载:310/67  |  提交时间:2022/11/21
Model compression  Quantized neural networks  Data-free quantization  
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:156/59  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Solving the spike feature information vanishing problem in spiking deep Q network with potential based normalization 期刊论文
FRONTIERS IN NEUROSCIENCE, 2022, 卷号: 16, 页码: 11
作者:  Sun, Yinqian;  Zeng, Yi;  Li, Yang
Adobe PDF(1561Kb)  |  收藏  |  浏览/下载:216/31  |  提交时间:2022/11/14
brain-inspired decision model  SDQN  reinforcement learning  potential normalization  spiking activity  
Learning adversarial point-wise domain alignment for stereo matching 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 564-574
作者:  Zhang, Chenghao;  Meng, Gaofeng;  Xu, Richard Yi Da;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(3885Kb)  |  收藏  |  浏览/下载:277/53  |  提交时间:2022/09/19
Stereo Matching  Domain adaptation  Point-wise linear transformation  Adversarial learning  
Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game 期刊论文
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 页码: 10
作者:  Li, Hongyang;  Wei, Qinglai
Adobe PDF(716Kb)  |  收藏  |  浏览/下载:193/46  |  提交时间:2022/06/14
Attention Analysis and Calibration for Transformer in Natural Language Generation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, 页码: 1927-1938
作者:  Yu, Lu;  Jiajun, Zhang;  Jiali, Zeng;  Shuangzhi, Wu;  Chengqing, Zong
Adobe PDF(1978Kb)  |  收藏  |  浏览/下载:126/35  |  提交时间:2023/05/31
神经机器翻译  
一种用于两人零和博弈对手适应的元策略演化学习算法 期刊论文
自动化学报, 2022, 页码: 0
作者:  吴哲;  李凯;  徐航;  兴军亮
Adobe PDF(15953Kb)  |  收藏  |  浏览/下载:189/43  |  提交时间:2022/06/17
Multi-modal spatio-temporal meteorological forecasting with deep neural network 期刊论文
ISPRS Journal of Photogrammetry and Remote Sensing, 2022, 页码: 14
作者:  Xinbang Zhang;  Qizhao Jin;  Tingzhao Yu;  Shiming Xiang;  Qiuming Kuang;  Véronique Prinet;  Chunhong Pan
Adobe PDF(3735Kb)  |  收藏  |  浏览/下载:293/68  |  提交时间:2022/07/01
Meterological forecasting  Deep learning  Neural architecture search  AutoML  
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:206/0  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum