CASIA OpenIR

浏览/检索结果: 共77条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:7/1  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
CKDF: Cascaded Knowledge Distillation Framework for Robust Incremental Learning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3825–3837
作者:  Li KC(李焜炽);  Wan J(万军);  Yu S(余山)
Adobe PDF(3813Kb)  |  收藏  |  浏览/下载:9/3  |  提交时间:2024/05/28
Contrastive Correlation Preserving Replay for Online Continual Learning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 1, 页码: 124-139
作者:  Yu, Da;  Zhang, Mingyi;  Li, Mantian;  Zha, Fusheng;  Zhang, Junge;  Sun, Lining;  Huang, Kaiqi
收藏  |  浏览/下载:29/0  |  提交时间:2024/03/26
Task analysis  Correlation  Knowledge transfer  Training  Memory management  Data models  Mutual information  Continual learning  catastrophic forgetting  class-incremental learning  experience replay  
Target-Following Control of a Biomimetic Autonomous System Based on Predictive Reinforcement Learning 期刊论文
BIOMIMETICS, 2024, 卷号: 9, 期号: 1, 页码: 19
作者:  Wang, Yu;  Wang, Jian;  Kang, Song;  Yu, Junzhi
Adobe PDF(1553Kb)  |  收藏  |  浏览/下载:29/0  |  提交时间:2024/03/26
biomimetic motion  biomimetic autonomous system  target following  deep reinforcement learning  predictive control  
RTDOD: A large-scale RGB-thermal domain-incremental object detection dataset for UAVs 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 9
作者:  Feng, Hangtao;  Zhang, Lu;  Zhang, Siqi;  Wang, Dong;  Yang, Xu;  Liu, Zhiyong
Adobe PDF(3013Kb)  |  收藏  |  浏览/下载:75/0  |  提交时间:2024/02/22
Domain -incremental object detection  Dataset  RGB-T dataset  Object detection dataset  UAVs dataset  Object detection  
Synergetic learning for unknown nonlinear H. control using neural networks 期刊论文
NEURAL NETWORKS, 2023, 卷号: 168, 页码: 287-299
作者:  Zhu, Liao;  Guo, Ping;  Wei, Qinglai
收藏  |  浏览/下载:83/0  |  提交时间:2023/12/21
H. control  Nonlinear systems  Adaptive dynamic programming  Temporal difference  Neural network  Data-driven  
Multiagent-Reinforcement-Learning-Based Stable Path Tracking Control for a Bionic Robotic Fish With Reaction Wheel 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2023, 卷号: 70, 期号: 12, 页码: 12670-12679
作者:  Qiu, Changlin;  Wu, Zhengxing;  Wang, Jian;  Tan, Min;  Yu, Junzhi
Adobe PDF(1587Kb)  |  收藏  |  浏览/下载:128/0  |  提交时间:2023/11/17
Multiagent reinforcement learning (MARL)  path tracking control  reaction wheel  robotic fish  underwater robot  
Hierarchical Policy Learning With Demonstration Learning for Robotic Multiple Peg-in-Hole Assembly Tasks 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 卷号: 19, 期号: 10, 页码: 10254-10264
作者:  Yan, Shaohua;  Xu, De;  Tao, Xian
Adobe PDF(4845Kb)  |  收藏  |  浏览/下载:87/1  |  提交时间:2023/11/17
Assembly model  demonstration learning (DL)  force-based control algorithm  hierarchical reinforcement learning (HRL)  peg-in-hole assembly  
Parallel Transportation in TransVerse: From Foundation Models to DeCAST 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2023, 卷号: 24, 期号: 12, 页码: 15310-15327
作者:  Zhao, Chen;  Wang, Xiao;  Lv, Yisheng;  Tian, Yonglin;  Lin, Yilun;  Wang, Fei-Yue
Adobe PDF(4139Kb)  |  收藏  |  浏览/下载:143/0  |  提交时间:2023/11/16
Intelligent Transportation Systems (ITS)  Cyber-Physical-Social Systems (CPSS)  Artificial Systems, Computational Experiments, Parallel Execution (ACP)  Decentralized/Distributed Autonomous Operations and Organizations (DAO)  
Residual Reinforcement Learning for Motion Control of a Bionic Exploration Robot - RoboDact 期刊论文
IEEE Transactions on Instrumentation and Measurement, 2023, 页码: 1-13
作者:  Zhang Tiandong;  Wang Rui;  Wang Shuo;  Wang Yu;  Zheng Gang;  Tan Min
Adobe PDF(3127Kb)  |  收藏  |  浏览/下载:122/49  |  提交时间:2023/06/14