CASIA OpenIR

浏览/检索结果: 共31条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
Deep Reinforcement Learning-Based Driving Policy at Intersections Utilizing Lane Graph Networks 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1 - 16
作者:  Liu, Yuqi;  Zhang, Qichao;  Gao, Yinfeng;  Zhao, Dongbin
Adobe PDF(22863Kb)  |  收藏  |  浏览/下载:27/7  |  提交时间:2024/06/03
Reinforcement Learning  Autonomous Driving  Intersection Navigating  
Multi-task safe reinforcement learning for navigating intersections in dense traffic 期刊论文
JOURNAL OF THE FRANKLIN INSTITUTE-ENGINEERING AND APPLIED MATHEMATICS, 2023, 卷号: 360, 期号: 17, 页码: 13737-13760
作者:  Liu, Yuqi;  Gao, Yinfeng;  Zhang, Qichao;  Ding, Dawei;  Zhao, Dongbin
Adobe PDF(3095Kb)  |  收藏  |  浏览/下载:73/12  |  提交时间:2024/02/22
Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 33, 期号: 3, 页码: 1228-1241
作者:  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(2838Kb)  |  收藏  |  浏览/下载:236/6  |  提交时间:2022/06/10
Games  Nash equilibrium  Mathematical model  Markov processes  Convergence  Dynamic programming  Training  Deep reinforcement learning (DRL)  generalized policy iteration (GPI)  Markov game (MG)  Nash equilibrium  Q network  zero sum  
Highway Lane Change Decision-Making via Attention-Based Deep Reinforcement Learning 期刊论文
IEEE-CAA JOURNAL OF AUTOMATICA SINICA, 2022, 卷号: 9, 期号: 3, 页码: 567-569
作者:  Wang, Junjie;  Zhang, Qichao;  Zhao, Dongbin
Adobe PDF(803Kb)  |  收藏  |  浏览/下载:287/67  |  提交时间:2022/02/16
Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 页码: 13
作者:  Hu, Guangzheng;  Zhu, Yuanheng;  Zhao, Dongbin;  Zhao, Mengchen;  Hao, Jianye
Adobe PDF(4187Kb)  |  收藏  |  浏览/下载:252/10  |  提交时间:2022/01/27
Bandwidth  Protocols  Reinforcement learning  Task analysis  Optimization  Communication networks  Multi-agent systems  Event trigger  limited bandwidth  multi-agent communication  multi-agent reinforcement learning (MARL)  
BNAS-v2: Memory-efficient and Performance-collapse-prevented Broad Neural Architecture Search 期刊论文
IEEE TRANSACTIONS ON SYSTEMS, MAN, AND CYBERNETICS: SYSTEMS, 2022, 卷号: 0, 期号: 0, 页码: 0
作者:  Zixiang, Ding;  Yaran, Chen;  Nannan, Li;  Dongbin, Zhao
Adobe PDF(7657Kb)  |  收藏  |  浏览/下载:220/56  |  提交时间:2022/01/07
Broad neural architecture search (BNAS), continuous relaxation, confident learning rate, partial channel connections, image classification.  
Stacked BNAS: Rethinking Broad Convolutional Neural Network for Neural Architecture Search 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 卷号: 0, 期号: 0, 页码: 0
作者:  Zixiang, Ding;  Yaran, Chen;  Nannan, Li;  Dongbin, Zhao;  C.L.Philip Chen,
Adobe PDF(764Kb)  |  收藏  |  浏览/下载:230/37  |  提交时间:2022/01/07
broad neural architecture search, stacked broad convolutional neural network, knowledge embedding search, image classification.  
BNAS: Efficient Neural Architecture Search Using Broad Scalable Architecture 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2020, 期号: 0, 页码: 0
作者:  Ding ZX(丁子祥);  Yaran, Chen;  Nannan, Li;  Dingbin, Zhao;  Zhiquan, Sun;  C. L. Philip Chen
Adobe PDF(2713Kb)  |  收藏  |  浏览/下载:195/45  |  提交时间:2022/01/06
Broad convolutional neural network (BCNN), image classification, neural architecture search (NAS), reinforcement learning (RL)  
Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors 期刊论文
IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 卷号: 18, 期号: 3, 页码: 1097-1108
作者:  Zhu, Yuanheng;  Zhao, Dongbin;  He, Haibo
Adobe PDF(2666Kb)  |  收藏  |  浏览/下载:226/12  |  提交时间:2021/08/15
Microscopy  Feedback control  Mathematical model  Data models  Dynamic programming  Psychology  Computational modeling  Adaptive dynamic programming (ADP)  heterogeneous corridors  macroscopic pedestrian dynamics  optimal feedback control  pedestrian flow  
Heuristic rank selection with progressively searching tensor ring network 期刊论文
COMPLEX & INTELLIGENT SYSTEMS, 2021, 页码: 15
作者:  Li, Nannan;  Pan, Yu;  Chen, Yaran;  Ding, Zixiang;  Zhao, Dongbin;  Xu, Zenglin
Adobe PDF(1305Kb)  |  收藏  |  浏览/下载:302/53  |  提交时间:2021/04/27
Tensor ring networks  Rank selection  Progressively search  Image classification