CASIA OpenIR

Browse/Search Results:  1-10 of 135 Help

Selected(0)Clear Items/Page:    Sort:
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
Authors:  Zhu, Yuanheng;  Zhao, Dongbin
Favorite  |  View/Download:72/0  |  Submit date:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Indexing-Min-Max Hashing: Relaxing the Security-Performance Tradeoff for Cancelable Fingerprint Templates 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2022, 页码: 12
Authors:  Li, Yuxing;  Pang, Liaojun;  Zhao, Heng;  Cao, Zhicheng;  Liu, Eryun;  Tian, Jie
Favorite  |  View/Download:55/0  |  Submit date:2022/03/17
Biometrics (access control)  Transforms  Cryptography  Privacy  Hash functions  Feature extraction  Life sciences  Biometrics template protection  cancelable fingerprint template  fixed-length fingerprint representation  Indexing-Min-Max (IMM) hashing  security-performance tradeoff  
Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 12
Authors:  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin
Favorite  |  View/Download:57/0  |  Submit date:2021/12/28
Reinforcement learning  Missile guidance  Auxiliary learning  Self-imitation learning  
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
Authors:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Favorite  |  View/Download:45/0  |  Submit date:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
Unconstrained end-to-end text reading with feature rectification 期刊论文
PATTERN RECOGNITION LETTERS, 2021, 卷号: 149, 页码: 1-8
Authors:  Du, Chen;  Wang, Yanna;  Wang, Chunheng;  Xiao, Baihua;  Shi, Cunzhao
Adobe PDF(1133Kb)  |  Favorite  |  View/Download:84/7  |  Submit date:2021/11/02
Text recognition  Text detection  Position-sensitive network  Features incompatibility  End-to-end  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
Authors:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Favorite  |  View/Download:59/0  |  Submit date:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 卷号: 18, 期号: 3, 页码: 1097-1108
Authors:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Favorite  |  View/Download:61/0  |  Submit date:2021/08/15
Microscopy  Feedback control  Mathematical model  Data models  Dynamic programming  Psychology  Computational modeling  Adaptive dynamic programming (ADP)  heterogeneous corridors  macroscopic pedestrian dynamics  optimal feedback control  pedestrian flow  
LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 卷号: 21, 期号: 11, 页码: 4516-4525
Authors:  Zhu, Yuanheng;  He, Haibo;  Zhao, Dongbin
Favorite  |  View/Download:62/0  |  Submit date:2021/01/06
Cooperative adaptive cruise control  string stability  time-delay system  H-infinity control  linear matrix inequality  
Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 卷号: 50, 期号: 11, 页码: 3959-3971
Authors:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Favorite  |  View/Download:79/0  |  Submit date:2021/01/07
Optimal control  Discrete-time systems  Heuristic algorithms  Dynamic programming  Convergence  Artificial intelligence  Nonlinear systems  Adaptive dynamic programming  discrete-time systems  invariant admissibility  optimal control  policy iteration  sum of squares  
Adversarial learning based attentional scene text recognizer 期刊论文
PATTERN RECOGNITION LETTERS, 2020, 卷号: 138, 期号: 1, 页码: 217-222
Authors:  Zhao, Jinyuan;  Wang, Yanna;  Xiao, Baihua;  Shi, Cunzhao;  Jiang, Jingzhong;  Wang, Chunheng
Adobe PDF(1152Kb)  |  Favorite  |  View/Download:92/12  |  Submit date:2021/01/07
Scene text recognition  Generative adversarial network  Image rectification