已选(0)清除
条数/页: 排序方式: |
| Training Large Language Models to Follow System Prompt with Self-Supervised Fine-Tuning 会议论文 , YOKOHAMA, JAPAN, 2024-07 作者: Junyan Qiu ; Haitao Wang ; Yiping Yang![](/image/person.jpg)
Adobe PDF(1596Kb)  |   收藏  |  浏览/下载:44/19  |  提交时间:2024/06/17 large language models supervised fine-tuning instruct tuning stylized generation |
| Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文 Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4 作者: Runji, Lin ; Haifeng, Zhang
Adobe PDF(8334Kb)  |   收藏  |  浏览/下载:47/17  |  提交时间:2024/06/11 Networked System Control Robustness Communicative Multi-Agent Reinforcement Learning |
| Ultimately Bounded Output Feedback Control for Networked Nonlinear Systems With Unreliable Communication Channel: A Buffer-Aided Strategy 期刊论文 IEEE/CAA Journal of Automatica Sinica, 2024, 卷号: 11, 期号: 7, 页码: 1566-1578 作者: Yuhan Zhang; Zidong Wang; Lei Zou; Yun Chen; Guoping Lu
Adobe PDF(2016Kb)  |   收藏  |  浏览/下载:35/12  |  提交时间:2024/06/07 Buffer-aided strategy neural networks nonlinear control output-feedback control unreliable communication channel |
| Spiking Generative Adversarial Network with Attention Scoring Decoding 期刊论文 Neural Networks, 2024, 页码: 106423 作者: Feng, Linghao; Zhao, Dongcheng ; Zeng, Yi![](/image/person.jpg)
Adobe PDF(1067Kb)  |   收藏  |  浏览/下载:46/19  |  提交时间:2024/06/06 |
| Boosting On-Policy Actor–Critic With Shallow Updates in Critic 期刊论文 IEEE Transactions on Neural Networks and Learning Systems, 2024, 页码: 1-10 作者: Luntong Li; Yuanheng Zhu![](/image/person.jpg)
Adobe PDF(9953Kb)  |   收藏  |  浏览/下载:46/15  |  提交时间:2024/06/05 |
| MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning 期刊论文 IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 1-12 作者: Boyu Li; Haran Li; Yuanheng Zhu ; Dongbin Zhao![](/image/person.jpg)
Adobe PDF(9953Kb)  |   收藏  |  浏览/下载:34/10  |  提交时间:2024/06/05 |
| 稀疏奖励环境下基于自博弈框架的智能空战算法研究 学位论文 , 2024 作者: 何少钦![](/image/person.jpg)
Adobe PDF(4570Kb)  |   收藏  |  浏览/下载:53/1  |  提交时间:2024/05/30 强化学习,离线强化学习,空战,智能决策,好奇心机制 |
| Isoperimetric Constraint Inference for Discrete-Time Nonlinear Systems Based on Inverse Optimal Control 期刊论文 IEEE TRANSACTIONS ON CYBERNETICS, 2024, 页码: 1 - 13 作者: Wei, Qinglai ; Li, Tao ; Zhang, Jie ; Li, Hongyang ; Wang, Xin; Xiao, Jun
Adobe PDF(1700Kb)  |   收藏  |  浏览/下载:50/22  |  提交时间:2024/05/28 |
| 受限场景下知识引导的人脸图像编辑研究 学位论文 , 2024 作者: 吕月明![](/image/person.jpg)
Adobe PDF(32704Kb)  |   收藏  |  浏览/下载:81/11  |  提交时间:2024/05/28 受限场景 人脸图像编辑 生成对抗网络 扩散模型 |
| Optimal Strategy for Aircraft Pursuit-evasion Games via Self-play Iteration 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 3, 页码: 585-596 作者: Xin Wang ; Qing-Lai Wei ; Tao Li; Jie Zhang![](/image/person.jpg)
Adobe PDF(1750Kb)  |   收藏  |  浏览/下载:68/22  |  提交时间:2024/05/23 Differential games, pursuit-evasion games, nonlinear control, optimal control, Nash equilibrium solution |