CASIA OpenIR

浏览/检索结果: 共15条,第1-10条 帮助

限定条件                        
已选(0)清除 条数/页:   排序方式:
未知非线性零和博弈最优跟踪的事件触发控制设计 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 1, 页码: 91-101
作者:  王鼎;  胡凌治;  赵明明;  哈明鸣;  乔俊飞
Adobe PDF(1996Kb)  |  收藏  |  浏览/下载:32/13  |  提交时间:2024/05/09
自适应评判设计  事件触发控制  神经网络  最优跟踪控制  稳定性分析  零和博弈  
多智能体博弈、学习与控制 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 3, 页码: 580-613
作者:  王龙;  黄锋
Adobe PDF(2088Kb)  |  收藏  |  浏览/下载:16/4  |  提交时间:2024/05/09
博弈论  多智能体学习  控制论  强化学习  人工智能  
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 569-582
作者:  Haoyu Lu;  Yuqi Huo;  Mingyu Ding;  Nanyi Fei;  Zhiwu Lu
Adobe PDF(2928Kb)  |  收藏  |  浏览/下载:30/9  |  提交时间:2024/04/23
Image-text retrieval, multimodal modeling, contrastive learning, weak correlation, computer vision  
A Survey on Recent Advances and Challenges in Reinforcement Learning Methods for Task-oriented Dialogue Policy Learning 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 3, 页码: 318-334
作者:  Wai-Chung Kwan;  Hong-Ru Wang;  Hui-Min Wang;  Kam-Fai Wong
Adobe PDF(2211Kb)  |  收藏  |  浏览/下载:9/2  |  提交时间:2024/04/23
Dialogue policy learning (DPL), task-oriented dialogue system (TOD), reinforcement learning (RL), dialogue system, Markov decision process  
基于扩展PI抗扰补偿器的高精度时间同步控制 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 12, 页码: 2520-2531
作者:  代学武;  贾志安;  崔东亮;  柴天佑
Adobe PDF(1595Kb)  |  收藏  |  浏览/下载:43/13  |  提交时间:2024/04/17
扩展PI抗扰补偿器  零极点优化  时间同步  网络控制系统  周期性扰动  
Magnetic Field-Based Reward Shaping for Goal-Conditioned Reinforcement Learning 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 12, 页码: 2233-2247
作者:  Hongyu Ding;  Yuanze Tang;  Qing Wu;  Bo Wang;  Chunlin Chen;  Zhi Wang
Adobe PDF(5205Kb)  |  收藏  |  浏览/下载:108/36  |  提交时间:2023/10/31
Dynamic environments  goal-conditioned reinforcement learning  magnetic field  reward shaping  
Position Errors and Interference Prediction-Based Trajectory Tracking for Snake Robots 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 9, 页码: 1810-1821
作者:  Dongfang Li;  Yilong Zhang;  Ping Li;  Rob Law;  Zhengrong Xiang;  Xin Xu;  Limin Zhu;  Edmond Q. Wu
Adobe PDF(19961Kb)  |  收藏  |  浏览/下载:142/32  |  提交时间:2023/08/10
Anti-sideslip  compensation  snake robot  trajectory tracking  
Adaptive Multi-Step Evaluation Design With Stability Guarantee for Discrete-Time Optimal Learning Control 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 9, 页码: 1797-1809
作者:  Ding Wang;  Jiangyu Wang;  Mingming Zhao;  Peng Xin;  Junfei Qiao
Adobe PDF(5140Kb)  |  收藏  |  浏览/下载:153/60  |  提交时间:2023/08/10
Adaptive critic  artificial neural networks  Hamilton-Jacobi-Bellman (HJB) equation  multi-step heuristic dynamic programming  multi-step reinforcement learning  optimal control  
Development of a Bias Compensating Q-Learning Controller for a Multi-Zone HVAC Facility 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 8, 页码: 1704-1715
作者:  Syed Ali Asad Rizvi;  Amanda J. Pertzborn;  Zongli Lin
Adobe PDF(4532Kb)  |  收藏  |  浏览/下载:160/68  |  提交时间:2023/07/20
HVAC control  optimal tracking  Q-learning  reinforcement learning (RL)  
Secure Underwater Distributed Antenna Systems: A Multi-Agent Reinforcement Learning Approach 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2023, 卷号: 10, 期号: 7, 页码: 1622-1624
作者:  Chaofeng Wang;  Zhicheng Bi;  Yaping Wan
Adobe PDF(381Kb)  |  收藏  |  浏览/下载:65/12  |  提交时间:2023/06/14