CASIA OpenIR

浏览/检索结果: 共225条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Fixed-Time Gradient Flows for Solving Constrained Optimization: A Unified Approach 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 8, 页码: 1849-1864
作者:  Xinli Shi;  Xiangping Xu;  Guanghui Wen;  Jinde Cao
Adobe PDF(2318Kb)  |  收藏  |  浏览/下载:9/3  |  提交时间:2024/07/16
Consensus  constrained optimization  disturbance rejection  linear equations  fixed-time gradient flow (FxTGF)  
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:24/12  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
基于表征学习的离线强化学习方法研究综述 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 6, 页码: 1104-1128
作者:  王雪松;  王荣荣;  程玉虎
Adobe PDF(3333Kb)  |  收藏  |  浏览/下载:12/8  |  提交时间:2024/07/02
强化学习  离线强化学习  表征学习  历史经验数据  分布偏移  
面向算力网络的智慧调度综述 期刊论文
自动化学报, 2024, 卷号: 50, 期号: 6, 页码: 1086-1103
作者:  李逸博;  李小平;  王爽;  蒋嶷川
Adobe PDF(1752Kb)  |  收藏  |  浏览/下载:11/7  |  提交时间:2024/07/02
算力网络  云计算  边缘计算  资源调度  知识  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:16/9  |  提交时间:2024/06/25
Enhancing Reinforcement Learning via Transformer-based State Predictive Representations 期刊论文
IEEE Transactions on Artificial Intelligence, 2024, 页码: 1 - 12
作者:  Liu MS(刘民颂);  Zhu YH(朱圆恒);  Chen YR(陈亚冉);  Zhao DB(赵冬斌)
Adobe PDF(1162Kb)  |  收藏  |  浏览/下载:28/8  |  提交时间:2024/06/24
Recursive Least-Squares Estimator-Aided Online Learning for Visual Tracking 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 3, 页码: 1881-1897
作者:  Gao, Jin;  Lu, Yan;  Qi, Xiaojuan;  Kou, Yutong;  Li, Bing;  Li, Liang;  Yu, Shan;  Hu, Weiming
Adobe PDF(915Kb)  |  收藏  |  浏览/下载:35/14  |  提交时间:2024/06/21
Visualization  Training  Adaptation models  Data models  Optimization  Task analysis  Robustness  Online learning  few-shot online adaptation  visual tracking  continual learning  recursive least-squares estimation  
A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2022, 卷号: 32, 期号: 6, 页码: 3880-3894
作者:  Li, Zhenbang;  Shi, Yaya;  Gao, Jin;  Wang, Shaoru;  Li, Bing;  Liang, Pengpeng;  Hu, Weiming
Adobe PDF(4397Kb)  |  收藏  |  浏览/下载:36/20  |  提交时间:2024/06/21
UAV Path Planning with Terrain Constraints for Aerial Scanning. 期刊论文
IEEE Transactions on Intelligent Vehicles, 2024, 卷号: 9, 期号: 1, 页码: 1189-1203
作者:  Jinbiao Yuan;  Zhenbao Liu;  Xiaoyu Xiong;  Yunfeng Ai;  Long Chen;  Bin Tian
Adobe PDF(3939Kb)  |  收藏  |  浏览/下载:66/16  |  提交时间:2024/06/20
Power Control Based on Deep Reinforcement Learning for Spectrum Sharing 期刊论文
IEEE Transactions on Wireless Communications, 2024, 卷号: 19, 期号: 6, 页码: 4209-4219
作者:  Zhang,Haijun;  Yang,Ning;  Huangfu,Wei;  Long,Keping;  Leung,VictorCM
Adobe PDF(1925Kb)  |  收藏  |  浏览/下载:38/17  |  提交时间:2024/06/12