CASIA OpenIR

浏览/检索结果: 共41条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Learning Superior Cooperative Policy in Competitive Multi-team Reinforcement Learning 会议论文
, Gold Coast, Australia, 2023-6
作者:  Qingxu Fu;  Tenghai Qiu;  Zhiqiang Pu;  Jianqiang Yi;  Xiaolin Ai;  Wanmai Yuan
Adobe PDF(25675Kb)  |  收藏  |  浏览/下载:10/0  |  提交时间:2024/06/05
EFCPose: End-to-End Multi-Person Pose Estimation with Fully Convolutional Heads 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 页码: early access
作者:  Wang Haixin;  Zhou Lu;  Chen Yingying;  Wang Jinqiao
Adobe PDF(4407Kb)  |  收藏  |  浏览/下载:6/3  |  提交时间:2024/06/03
A Novel Coiled Cable-Conduit-Driven Hyper-Redundant Manipulator for Remote Operating in Narrow Spaces 会议论文
, Detroit, USA, 2023-10-1
作者:  Mingrui, Luo;  Yunong, Tian;  En, Li;  Minghao, Chen;  Cunfeng, Kang;  Guodong, Yang;  Min, Tan
Adobe PDF(5151Kb)  |  收藏  |  浏览/下载:11/3  |  提交时间:2024/05/31
Design methodology  Redundancy  Inspection  Manipulators  Control systems  Cleaning  Safety  
Reward Estimation with Scheduled Knowledge Distillation for Dialogue Policy Learning 期刊论文
Connection Science, 2023, 卷号: 35, 期号: 1, 页码: 2174078
作者:  Qiu JY(邱俊彦);  Haidong Zhang;  Yiping Yang
Adobe PDF(831Kb)  |  收藏  |  浏览/下载:10/2  |  提交时间:2024/05/29
reinforcement learning  dialogue policy learning  curriculum learning  knowledge distillation  
Cooperative Task Scheduling and Planning Considering Resource Conflicts and Precedence Constraints 期刊论文
International Journal of Precision Engineering and Manufacturing, 2023, 页码: 1503-1516
作者:  Li, Donghui;  Su, Hu;  Xu, Xinyi;  Wang, Qingbin;  Qin, Jie;  Zou, Wei
Adobe PDF(2513Kb)  |  收藏  |  浏览/下载:9/4  |  提交时间:2024/05/28
Dual Self-Awareness Value Decomposition Framework without Individual Global Max for Cooperative MARL 会议论文
, New Orleans, LA, USA, December 10-16, 2023
作者:  Zhiwei Xu;  Bin Zhang;  Dapeng Li;  Guangchong Zhou;  Zeren Zhang;  Guoliang Fan
Adobe PDF(8700Kb)  |  收藏  |  浏览/下载:12/2  |  提交时间:2024/05/28
HAVEN: Hierarchical Cooperative Multi-Agent Reinforcement Learning with Dual Coordination Mechanism 会议论文
, Washington, DC, USA, February 7-14, 2023
作者:  Zhiwei Xu;  Yunpeng Bai;  Bin Zhang;  Dapeng Li;  Guoliang Fan
Adobe PDF(3345Kb)  |  收藏  |  浏览/下载:9/1  |  提交时间:2024/05/28
未知非线性零和博弈最优跟踪的事件触发控制设计 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 1, 页码: 91-101
作者:  王鼎;  胡凌治;  赵明明;  哈明鸣;  乔俊飞
Adobe PDF(1996Kb)  |  收藏  |  浏览/下载:25/12  |  提交时间:2024/05/09
自适应评判设计  事件触发控制  神经网络  最优跟踪控制  稳定性分析  零和博弈  
基于因果建模的强化学习控制:现状及展望 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 3, 页码: 661-677
作者:  孙悦雯;  柳文章;  孙长银
Adobe PDF(1926Kb)  |  收藏  |  浏览/下载:20/5  |  提交时间:2024/05/09
强化学习控制  因果发现  因果推理  迁移学习  表示学习  
Machine Learning Methods in Solving the Boolean Satisfiability Problem 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 5, 页码: 640-655
作者:  Wenxuan Guo;  Hui-Ling Zhen;  Xijun Li;  Wanqian Luo;  Mingxuan Yuan;  Yaohui Jin;  Junchi Yan
Adobe PDF(1518Kb)  |  收藏  |  浏览/下载:30/7  |  提交时间:2024/04/23
Machine learning (ML), Boolean satisfiability (SAT), deep learning, graph neural networks (GNNs), combinatorial optimization