CASIA OpenIR

浏览/检索结果: 共21条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Resizemix: Mixing data with preserved object information and true labels 期刊论文
Computational Visual Media, 2023, 页码: --
作者:  Jie Qin;  Jiemin Fang;  Qian Zhang;  Wenyu Liu;  Xingang Wang;  Xinggang Wang
Adobe PDF(9105Kb)  |  收藏  |  浏览/下载:4/3  |  提交时间:2024/06/04
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:19/5  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Constrained-cost adaptive dynamic programming for optimal control of discrete-time nonlinear systems 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 卷号: 35, 期号: 3, 页码: 3251 - 3264
作者:  Wei, Qinglai;  Li, Tao
Adobe PDF(8471Kb)  |  收藏  |  浏览/下载:22/9  |  提交时间:2024/05/28
Adaptive dynamic programming  approximate dynamic programming  constrained cost  optimal control  reinforcement learning  
未知非线性零和博弈最优跟踪的事件触发控制设计 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 1, 页码: 91-101
作者:  王鼎;  胡凌治;  赵明明;  哈明鸣;  乔俊飞
Adobe PDF(1996Kb)  |  收藏  |  浏览/下载:31/13  |  提交时间:2024/05/09
自适应评判设计  事件触发控制  神经网络  最优跟踪控制  稳定性分析  零和博弈  
多智能体博弈、学习与控制 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 3, 页码: 580-613
作者:  王龙;  黄锋
Adobe PDF(2088Kb)  |  收藏  |  浏览/下载:16/4  |  提交时间:2024/05/09
博弈论  多智能体学习  控制论  强化学习  人工智能  
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 569-582
作者:  Haoyu Lu;  Yuqi Huo;  Mingyu Ding;  Nanyi Fei;  Zhiwu Lu
Adobe PDF(2928Kb)  |  收藏  |  浏览/下载:27/7  |  提交时间:2024/04/23
Image-text retrieval, multimodal modeling, contrastive learning, weak correlation, computer vision  
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:  Wai-Chung Kwan;  Hong-Ru Wang;  Hui-Min Wang;  Kam-Fai Wong
Adobe PDF(2211Kb)  |  收藏  |  浏览/下载:9/2  |  提交时间:2024/04/23
Dialogue policy learning (DPL), task-oriented dialogue system (TOD), reinforcement learning (RL), dialogue system, Markov decision process  
基于扩展PI抗扰补偿器的高精度时间同步控制 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 12, 页码: 2520-2531
作者:  代学武;  贾志安;  崔东亮;  柴天佑
Adobe PDF(1595Kb)  |  收藏  |  浏览/下载:42/13  |  提交时间:2024/04/17
扩展PI抗扰补偿器  零极点优化  时间同步  网络控制系统  周期性扰动  
Multistep Look-Ahead Policy Iteration for Optimal Control of Discrete-Time Nonlinear Systems With Isoperimetric Constraints 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 卷号: 54, 期号: 3, 页码: 1414-1426
作者:  Li, Tao;  Wei, Qinglai;  Wang, Fei-Yue
Adobe PDF(784Kb)  |  收藏  |  浏览/下载:81/8  |  提交时间:2024/02/22
Performance analysis  Optimal control  Dynamic programming  Iterative algorithms  Upper bound  Measurement  Convergence  Adaptive dynamic programming (ADP)  isoperimetric constraints  nonlinear systems  optimal control  policy iteration  
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 12, 页码: 2233-2247
作者:  Hongyu Ding;  Yuanze Tang;  Qing Wu;  Bo Wang;  Chunlin Chen;  Zhi Wang
Adobe PDF(5205Kb)  |  收藏  |  浏览/下载:108/36  |  提交时间:2023/10/31
Dynamic environments  goal-conditioned reinforcement learning  magnetic field  reward shaping