CASIA OpenIR

浏览/检索结果: 共16条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Learn to flap: foil non-parametric path planning via deep reinforcement learning 期刊论文
Journal of Fluid Mechanics, 2024, 卷号: 984, 页码: A9
作者:  Wang, Zhipeng;  Lin, Runji;  Zhao, Zhiyu;  Chen, Xu;  Guo, Pengming;  Yang, Ning;  Wang,Zhicheng;  Fan, Dixia
Adobe PDF(1892Kb)  |  收藏  |  浏览/下载:23/3  |  提交时间:2024/06/07
A Survey on Reinforcement Learning Methods in Bionic Underwater Robots 期刊论文
BIOMIMETICS, 2023, 卷号: 8, 期号: 2, 页码: 29
作者:  Tong, Ru;  Feng, Yukai;  Wang, Jian;  Wu, Zhengxing;  Tan, Min;  Yu, Junzhi
Adobe PDF(1260Kb)  |  收藏  |  浏览/下载:116/11  |  提交时间:2023/11/17
bionic underwater robot  reinforcement learning  robotic fish  intelligent control  
NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Chai, Jiajun;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2469Kb)  |  收藏  |  浏览/下载:52/0  |  提交时间:2023/11/16
Large-scale multiagent  neighboring communication  reinforcement learning (RL)  variational information flow  
Parallel Transportation in TransVerse: From Foundation Models to DeCAST 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 卷号: 24, 期号: 12, 页码: 15310-15327
作者:  Zhao, Chen;  Wang, Xiao;  Lv, Yisheng;  Tian, Yonglin;  Lin, Yilun;  Wang, Fei-Yue
Adobe PDF(4139Kb)  |  收藏  |  浏览/下载:165/6  |  提交时间:2023/11/16
Intelligent Transportation Systems (ITS)  Cyber-Physical-Social Systems (CPSS)  Artificial Systems, Computational Experiments, Parallel Execution (ACP)  Decentralized/Distributed Autonomous Operations and Organizations (DAO)  
Peer Incentive Reinforcement Learning for Cooperative Multi-Agent Games 期刊论文
IEEE Transactions on Games, 2022, 页码: 1-14
作者:  Zhang TL(张天乐);  Liu Z(刘振);  Pu ZQ(蒲志强);  Yi JQ(易建强)
Adobe PDF(18835Kb)  |  收藏  |  浏览/下载:113/28  |  提交时间:2023/06/12
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:224/3  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 12
作者:  Chai, Jiajun;  Li, Weifan;  Zhu, Yuanheng;  Zhao, Dongbin;  Ma, Zhe;  Sun, Kewu;  Ding, Jishiyu
Adobe PDF(3402Kb)  |  收藏  |  浏览/下载:266/31  |  提交时间:2022/01/27
Multi-agent systems  Training  Task analysis  Reinforcement learning  Sun  Learning systems  Semantics  Centralized training with decentralized execution (CTDE)  multiagent  reinforcement learning  StarCraft II  
CASIA-SURF: A Large-scale Multi-modal Benchmark for Face Anti-spoofing 期刊论文
IEEE Transactions on Biometrics, Behavior, and Identity Science, 2020, 期号: 2, 页码: 182-193
作者:  Zhang, Shifeng;  Liu, Ajian;  Wan, Jun;  Liang, Yanyan;  Guo, Guodong;  Sergio Escalera;  Hugo Jair Escalante;  Li, Stan Z.
浏览  |  Adobe PDF(1961Kb)  |  收藏  |  浏览/下载:280/92  |  提交时间:2020/06/08
Face anti-spoofing, large-scale, multi-modal, dataset, benchmark  
Attention-Based Two-Stream Convolutional Networks for Face Spoofing Detection 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2020, 卷号: 15, 页码: 578-593
作者:  Chen, Haonan;  Hu, Guosheng;  Lei, Zhen;  Chen, Yaowu;  Robertson, Neil M.;  Li, Stan Z.
收藏  |  浏览/下载:255/0  |  提交时间:2020/03/30
Face spoofing  multi-scale retinex  deep learning  attention model  feature fusion  
Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2018, 卷号: 48, 期号: 2, 页码: 500-509
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  Yang, Xiong;  Zhang, Qichao
Adobe PDF(892Kb)  |  收藏  |  浏览/下载:308/44  |  提交时间:2018/10/10
Adaptive Dynamic Programming (Adp)  h Infinity Optimal Control  Policy Iteration (Pi)  Polynomial Nonlinear Systems  Sum Of Squares (Sos)