CASIA OpenIR

浏览/检索结果: 共18条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:37/16  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Learning Robust Communication by Adversarial Training in Networked System Control 期刊论文
Lecture Notes in Electrical Engineering, 2024, 页码: Chapter 52 978-981-97-3335-4
作者:  Runji, Lin;  Haifeng, Zhang
Adobe PDF(8334Kb)  |  收藏  |  浏览/下载:46/17  |  提交时间:2024/06/11
Networked System Control  Robustness  Communicative Multi-Agent Reinforcement Learning  
Human-robot object handover: Recent progress and future direction 期刊论文
Biomimetic Intelligence and Robotics, 2024, 卷号: 4, 页码: 100145
作者:  Duan, Haonan;  Yang, Yifan;  Li, Daheng;  Wang, Peng
Adobe PDF(1839Kb)  |  收藏  |  浏览/下载:47/16  |  提交时间:2024/05/29
Human–robot interactions  Object handover  
Position Control of an Underwater Biomimetic Vehicle-Manipulator System via Reinforcement Learning 会议论文
, Liuzhou, China, 20-22 November 2020
作者:  Ma, Ruichen;  Wang, Yu;  Gao, Zisen;  Zhao, Tianzi;  Wang, Rui;  Wang, Shuo;  Zhou, Chao
Adobe PDF(927Kb)  |  收藏  |  浏览/下载:90/40  |  提交时间:2023/08/03
Second-Order Global Attention Networks for Graph Classification and Regression 会议论文
, Beijing, China, August 27-28, 2022
作者:  Hu Fenyu;  Cui Zeyu;  Wu Shu;  Liu Qiang;  Wu Jinlin;  Wang Liang;  Tan Tieniu
Adobe PDF(69424Kb)  |  收藏  |  浏览/下载:227/73  |  提交时间:2023/07/06
AlphaHoldem: High-Performance Artificial Intelligence for Heads-Up No-Limit Poker via End-to-End Reinforcement Learning 会议论文
, 线上, 2022-02-22
作者:  Zhao EM(赵恩民);  Yan RY(闫仁业);  Li JQ(李金秋);  Li K(李凯);  Xing JL(兴军亮)
Adobe PDF(2593Kb)  |  收藏  |  浏览/下载:214/79  |  提交时间:2023/06/29
Optimal Strategy for Aircraft Pursuit-Evasion Games via Self-Play Iteration 期刊论文
Machine Intelligence Research, 2023, 页码: 1-12
作者:  Wang Xin;  Wei Qinglai;  Li Tao;  Zhang Jie
Adobe PDF(1556Kb)  |  收藏  |  浏览/下载:213/82  |  提交时间:2023/06/26
A novel iterative adaptive critic design for smart home energy systems with solar energy 会议论文
, 中国厦门, 2022年11月
作者:  Liao ZH(廖泽华);  Wei, Qinglai;  Li, Hongyang
Adobe PDF(965Kb)  |  收藏  |  浏览/下载:194/82  |  提交时间:2023/06/06
Structure-Enhanced Heterogeneous Graph Contrastive Learning 会议论文
, Online, 2022-3
作者:  Zhu, Yanqiao;  Xu, Yichen;  Cui, Hejie;  Yang, Carl;  Liu, Qiang;  Wu, Shu
Adobe PDF(598Kb)  |  收藏  |  浏览/下载:232/66  |  提交时间:2022/06/13
Parallel Adaptive Critic Designs of Optimal Control for Ice-Storage Air Conditioning Systems 会议论文
, Xiamen, China, 2019-12
作者:  Liao, Zehua;  Wei, Qinglai;  Song, Ruizhuo
浏览  |  Adobe PDF(199Kb)  |  收藏  |  浏览/下载:317/90  |  提交时间:2020/06/26
Parallel adaptive critic design  Adaptive dynamic programming  Particle swarm optimization  Ice-storage air conditioning