CASIA OpenIR

浏览/检索结果: 共12条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Attention Analysis and Calibration for Transformer in Natural Language Generation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing, 2022, 页码: 1927-1938
作者:  Yu, Lu;  Jiajun, Zhang;  Jiali, Zeng;  Shuangzhi, Wu;  Chengqing, Zong
Adobe PDF(1978Kb)  |  收藏  |  浏览/下载:134/38  |  提交时间:2023/05/31
神经机器翻译  
Dynamic-horizon model-based value estimation with latent imagination 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2022, 页码: 1-14
作者:  Wang JJ(王俊杰);  Zhang QC(张启超);  Zhao DB(赵冬斌)
Adobe PDF(2305Kb)  |  收藏  |  浏览/下载:163/60  |  提交时间:2023/05/30
Latent world model  model-based value expansion (MVE)  reinforcement learning  reinforcement learning  
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2022, 页码: doi={10.1109/TCDS.2022.3218940}
作者:  Minsong Liu;  Luntong Li;  Shuai Hao;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(12013Kb)  |  收藏  |  浏览/下载:75/19  |  提交时间:2023/04/26
Empirical Policy Optimization for n-Player Markov Games 期刊论文
IEEE Transactions on Cybernetics, 2022, 页码: doi={10.1109/TCYB.2022.3179775}
作者:  Yuanheng Zhu;  Weifan Li;  Mengchen Zhao;  Jianye Hao;  Dongbin Zhao
Adobe PDF(1739Kb)  |  收藏  |  浏览/下载:97/39  |  提交时间:2023/04/26
Many Hands Make Light Work: Transferring Knowledge from Auxiliary Tasks for Video-Text Retrieval 期刊论文
IEEE Transactions on Multimedia, 2022, 页码: 1-15
作者:  Wang, Wei;  Gao, Junyu;  Yang, Xiaoshan;  Xu, Changsheng
Adobe PDF(3679Kb)  |  收藏  |  浏览/下载:117/21  |  提交时间:2023/04/25
Learning adversarial point-wise domain alignment for stereo matching 期刊论文
NEUROCOMPUTING, 2022, 卷号: 491, 页码: 564-574
作者:  Zhang, Chenghao;  Meng, Gaofeng;  Xu, Richard Yi Da;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(3885Kb)  |  收藏  |  浏览/下载:296/53  |  提交时间:2022/09/19
Stereo Matching  Domain adaptation  Point-wise linear transformation  Adversarial learning  
Multi-modal spatio-temporal meteorological forecasting with deep neural network 期刊论文
ISPRS Journal of Photogrammetry and Remote Sensing, 2022, 页码: 14
作者:  Xinbang Zhang;  Qizhao Jin;  Tingzhao Yu;  Shiming Xiang;  Qiuming Kuang;  Véronique Prinet;  Chunhong Pan
Adobe PDF(3735Kb)  |  收藏  |  浏览/下载:301/71  |  提交时间:2022/07/01
Meterological forecasting  Deep learning  Neural architecture search  AutoML  
Optimal synchronization control for multi-agent systems with input saturation: a nonzero-sum game 期刊论文
FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING, 2022, 页码: 10
作者:  Li, Hongyang;  Wei, Qinglai
Adobe PDF(716Kb)  |  收藏  |  浏览/下载:197/48  |  提交时间:2022/06/14
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:210/0  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Supervised assisted deep reinforcement learning for emergency voltage control of power systems 期刊论文
NEUROCOMPUTING, 2022, 卷号: 475, 页码: 69-79
作者:  Li, Xiaoshuang;  Wang, Xiao;  Zheng, Xinhu;  Dai, Yuxin;  Yu, Zhihong;  Zhang, Jun Jason;  Bu, Guangquan;  Wang, Fei-Yue
Adobe PDF(2551Kb)  |  收藏  |  浏览/下载:320/64  |  提交时间:2022/06/06
Deep reinforcement learning  Behavioral cloning  Dynamic demonstration  Emergency control