已选(0)清除
条数/页: 排序方式: |
| Training Large Language Models to Follow System Prompt with Self-Supervised Fine-Tuning 会议论文 , YOKOHAMA, JAPAN, 2024-07 作者: Junyan Qiu; Haitao Wang; Yiping Yang Adobe PDF(1596Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/06/17 large language models supervised fine-tuning instruct tuning stylized generation |
| Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文 Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4 作者: Runji, Lin; Haifeng, Zhang Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:30/10  |  提交时间:2024/06/11 Networked System Control Robustness Communicative Multi-Agent Reinforcement Learning |
| Ultimately Bounded Output Feedback Control for Networked Nonlinear Systems With Unreliable Communication Channel: A Buffer-Aided Strategy 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1566-1578 作者: Yuhan Zhang; Zidong Wang; Lei Zou; Yun Chen; Guoping Lu Adobe PDF(2016Kb)  |  收藏  |  浏览/下载:28/9  |  提交时间:2024/06/07 Buffer-aided strategy neural networks nonlinear control output-feedback control unreliable communication channel |
| Spiking Generative Adversarial Network with Attention Scoring Decoding 期刊论文 Neural Networks, 2024, 页码: 106423 作者: Feng, Linghao; Zhao, Dongcheng; Zeng, Yi Adobe PDF(1067Kb)  |  收藏  |  浏览/下载:33/13  |  提交时间:2024/06/06 |
| Boosting On-Policy Actor–Critic With Shallow Updates in Critic 期刊论文 IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 1-10 作者: Luntong Li; Yuanheng Zhu Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:29/10  |  提交时间:2024/06/05 |
| MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12 作者: Boyu Li; Haran Li; Yuanheng Zhu; Dongbin Zhao Adobe PDF(9953Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/05 |
| 稀疏奖励环境下基于自博弈框架的智能空战算法研究 学位论文 , 2024 作者: 何少钦 Adobe PDF(4570Kb)  |  收藏  |  浏览/下载:37/1  |  提交时间:2024/05/30 强化学习,离线强化学习,空战,智能决策,好奇心机制 |
| Isoperimetric Constraint Inference for Discrete-Time Nonlinear Systems Based on Inverse Optimal Control 期刊论文 IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 1 - 13 作者: Wei, Qinglai; Li, Tao; Zhang, Jie; Li, Hongyang; Wang, Xin; Xiao, Jun Adobe PDF(1700Kb)  |  收藏  |  浏览/下载:38/15  |  提交时间:2024/05/28 |
| 受限场景下知识引导的人脸图像编辑研究 学位论文 , 2024 作者: 吕月明 Adobe PDF(32704Kb)  |  收藏  |  浏览/下载:75/11  |  提交时间:2024/05/28 受限场景 人脸图像编辑 生成对抗网络 扩散模型 |
| Optimal Strategy for Aircraft Pursuit-evasion Games via Self-play Iteration 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 585-596 作者: Xin Wang; Qing-Lai Wei; Tao Li; Jie Zhang Adobe PDF(1750Kb)  |  收藏  |  浏览/下载:52/17  |  提交时间:2024/05/23 Differential games, pursuit-evasion games, nonlinear control, optimal control, Nash equilibrium solution |