CASIA OpenIR

浏览/检索结果: 共77条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:42/0  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
ICaps-ResLSTM: Improved capsule network and residual LSTM for EEG emotion recognition 期刊论文
BIOMEDICAL SIGNAL PROCESSING AND CONTROL, 2024, 卷号: 87, 页码: 9
作者:  Fan, Cunhang;  Xie, Heng;  Tao, Jianhua;  Li, Yongwei;  Pei, Guanxiong;  Li, Taihao;  Lv, Zhao
收藏  |  浏览/下载:101/0  |  提交时间:2023/11/15
Electroencephalogram  Emotion recognition  Capsule network  Residual Long-Short Term Memory  
Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming 会议论文
, Budapest, Hungary, 2019-7-14
作者:  Zhu YH(朱圆恒);  Haibo He;  Dongbin Zhao;  Zhongsheng Hou
Adobe PDF(679Kb)  |  收藏  |  浏览/下载:55/28  |  提交时间:2023/05/22
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
收藏  |  浏览/下载:197/0  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Indexing-Min-Max Hashing: Relaxing the Security-Performance Tradeoff for Cancelable Fingerprint Templates 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 页码: 12
作者:  Li, Yuxing;  Pang, Liaojun;  Zhao, Heng;  Cao, Zhicheng;  Liu, Eryun;  Tian, Jie
收藏  |  浏览/下载:204/0  |  提交时间:2022/03/17
Biometrics (access control)  Transforms  Cryptography  Privacy  Hash functions  Feature extraction  Life sciences  Biometrics template protection  cancelable fingerprint template  fixed-length fingerprint representation  Indexing-Min-Max (IMM) hashing  security-performance tradeoff  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:230/24  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
收藏  |  浏览/下载:193/0  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
作者:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(1431Kb)  |  收藏  |  浏览/下载:274/48  |  提交时间:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Unconstrained end-to-end text reading with feature rectification 期刊论文
PATTERN RECOGNITION LETTERS, 2021, 卷号: 149, 页码: 1-8
作者:  Du, Chen;  Wang, Yanna;  Wang, Chunheng;  Xiao, Baihua;  Shi, Cunzhao
Adobe PDF(1133Kb)  |  收藏  |  浏览/下载:287/56  |  提交时间:2021/11/02
Text recognition  Text detection  Position-sensitive network  Features incompatibility  End-to-end  
Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 卷号: 18, 期号: 3, 页码: 1097-1108
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
收藏  |  浏览/下载:171/0  |  提交时间:2021/08/15
Microscopy  Feedback control  Mathematical model  Data models  Dynamic programming  Psychology  Computational modeling  Adaptive dynamic programming (ADP)  heterogeneous corridors  macroscopic pedestrian dynamics  optimal feedback control  pedestrian flow