CASIA OpenIR

浏览/检索结果: 共99条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Training Large Language Models to Follow System Prompt with Self-Supervised Fine-Tuning 会议论文
, YOKOHAMA, JAPAN, 2024-07
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(1596Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/06/17
large language models  supervised fine-tuning  instruct tuning  stylized generation  
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:30/10  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Ultimately Bounded Output Feedback Control for Networked Nonlinear Systems With Unreliable Communication Channel: A Buffer-Aided Strategy 期刊论文
IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1566-1578
作者:  Yuhan Zhang;  Zidong Wang;  Lei Zou;  Yun Chen;  Guoping Lu
Adobe PDF(2016Kb)  |  收藏  |  浏览/下载:28/9  |  提交时间:2024/06/07
Buffer-aided strategy  neural networks  nonlinear control  output-feedback control  unreliable communication channel  
Spiking Generative Adversarial Network with Attention Scoring Decoding 期刊论文
Neural Networks, 2024, 页码: 106423
作者:  Feng, Linghao;  Zhao, Dongcheng;  Zeng, Yi
Adobe PDF(1067Kb)  |  收藏  |  浏览/下载:33/13  |  提交时间:2024/06/06
Boosting On-Policy Actor–Critic With Shallow Updates in Critic 期刊论文
IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 1-10
作者:  Luntong Li;  Yuanheng Zhu
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:29/10  |  提交时间:2024/06/05
MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12
作者:  Boyu Li;  Haran Li;  Yuanheng Zhu;  Dongbin Zhao
Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/05
稀疏奖励环境下基于自博弈框架的智能空战算法研究 学位论文
, 2024
作者:  何少钦
Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:37/1  |  提交时间:2024/05/30
强化学习,离线强化学习,空战,智能决策,好奇心机制  
Isoperimetric Constraint Inference for Discrete-Time Nonlinear Systems Based on Inverse Optimal Control 期刊论文
IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 1 - 13
作者:  Wei, Qinglai;  Li, Tao;  Zhang, Jie;  Li, Hongyang;  Wang, Xin;  Xiao, Jun
Adobe PDF(1700Kb)  |  收藏  |  浏览/下载:38/15  |  提交时间:2024/05/28
受限场景下知识引导的人脸图像编辑研究 学位论文
, 2024
作者:  吕月明
Adobe PDF(32704Kb)  |  收藏  |  浏览/下载:75/11  |  提交时间:2024/05/28
受限场景  人脸图像编辑  生成对抗网络  扩散模型  
Optimal Strategy for Aircraft Pursuit-evasion Games via Self-play Iteration 期刊论文
Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 585-596
作者:  Xin Wang;  Qing-Lai Wei;  Tao Li;  Jie Zhang
Adobe PDF(1750Kb)  |  收藏  |  浏览/下载:52/17  |  提交时间:2024/05/23
Differential games, pursuit-evasion games, nonlinear control, optimal control, Nash equilibrium solution