CASIA OpenIR

浏览/检索结果: 共351条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
面向多机器人博弈的深度强化学习方法 学位论文
, 2024
作者:  胡光政
Adobe PDF(17740Kb)  |  收藏  |  浏览/下载:21/0  |  提交时间:2024/07/04
多智能体深度强化学习  多机器人博弈  极小极大Q学习  值分解  最大熵  
Boosting On-Policy Actor-Critic With Shallow Updates in Critic 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 页码: 10
作者:  Li, Luntong;  Zhu, Yuanheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Artificial neural networks  Vectors  Task analysis  Training  Representation learning  Approximation algorithms  Optimization  Actor-critic  deep reinforcement learning (DRL)  proximal policy optimization (PPO)  shallow reinforcement learning (SRL)  
Uncertainty-aware Boundary Attention Network for Real-time Semantic Segmentation 会议论文
, 中国福建厦门, 2023年10月13日
作者:  Zhu YB(朱袁兵);  Zhu BK(朱炳科);  Chen YY(陈盈盈);  Wang JQ(王金桥)
Adobe PDF(2957Kb)  |  收藏  |  浏览/下载:16/8  |  提交时间:2024/06/27
Uncertainty Estimation  Real-time Semantic Segmentation  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:14/7  |  提交时间:2024/06/25
Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2023, 卷号: 15, 期号: 3, 页码: 1463 - 1473
作者:  Liu MS(刘民颂);  Li LT(李伦通);  Hao S(郝帅);  Zhu YH(朱圆恒);  Zhao DB(赵冬斌)
Adobe PDF(4197Kb)  |  收藏  |  浏览/下载:26/5  |  提交时间:2024/06/24
Modeling Socially Normative Navigation Behaviors from Demonstrations with Inverse Reinforcement Learning 会议论文
, Vancouver, British Columbia, Canada, 2019-08-22至2019-08-26
作者:  Xingyuan Gao;  Xiaoguang Zhao;  Min Tan
Adobe PDF(1500Kb)  |  收藏  |  浏览/下载:27/13  |  提交时间:2024/06/21
UAV Path Planning with Terrain Constraints for Aerial Scanning. 期刊论文
IEEE Transactions on Intelligent Vehicles, 2024, 卷号: 9, 期号: 1, 页码: 1189-1203
作者:  Jinbiao Yuan;  Zhenbao Liu;  Xiaoyu Xiong;  Yunfeng Ai;  Long Chen;  Bin Tian
Adobe PDF(3939Kb)  |  收藏  |  浏览/下载:58/12  |  提交时间:2024/06/20
基于信息融合的遥感图像语义分割方法研究 学位论文
, 2024
作者:  曹勇
Adobe PDF(8052Kb)  |  收藏  |  浏览/下载:71/3  |  提交时间:2024/06/13
遥感图像处理  语义分割  信息融合  深度学习  
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
作者:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:30/13  |  提交时间:2024/06/12
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:31/10  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning