CASIA OpenIR

浏览/检索结果: 共439条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Learning to Play Football from Sports Perspective: A Knowledge-embedded Deep Reinforcement Learning Framework 期刊论文
IEEE Transactions on Games, 2022, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(2957Kb)  |  收藏  |  浏览/下载:18/5  |  提交时间:2024/07/12
QFuture: Learning Future Expectation Cognition in Multi-Agent Reinforcement Learning 期刊论文
IEEE Transactions on Cognitive and Developmental Systems, 2024, 页码: 12
作者:  Liu BY(刘博寅)
Adobe PDF(6675Kb)  |  收藏  |  浏览/下载:15/2  |  提交时间:2024/07/12
Learning State-Specific Action Masks for Reinforcement Learning 期刊论文
Algorithms, 2024, 卷号: 17, 期号: 2, 页码: 60
作者:  Wang ZY(王梓薏);  Li XR(李欣然);  Sun LY(孙罗洋);  Zhang HF(张海峰);  Liu HL(刘华林);  Jun Wang
Adobe PDF(2976Kb)  |  收藏  |  浏览/下载:24/12  |  提交时间:2024/07/05
reinforcement learning  exploration efficiency  space reduction  
Structured Light-Based Underwater Collision-Free Navigation and Dense Mapping System for Refined Exploration in Unknown Dark Environments 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2024, 页码: 14
作者:  Ou, Yaming;  Fan, Junfeng;  Zhou, Chao;  Kang, Song;  Zhang, Zhuoliang;  Hou, Zeng-Guang;  Tan, Min
收藏  |  浏览/下载:8/0  |  提交时间:2024/07/03
Refined exploration  structured light vision  underwater collision-free navigation  underwater dense mapping  
Online Optimization of Normalized CPGs for a Multi-Joint Robotic Fish 会议论文
, 中国,上海, 2021年7月
作者:  Tong R(仝茹);  Wu ZX(吴正兴);  Wang J(王健);  Tan M(谭民);  Yu JZ(喻俊志)
Adobe PDF(456Kb)  |  收藏  |  浏览/下载:23/14  |  提交时间:2024/06/26
Visual Pencil: Design of Portable Human-Computer Interaction Based on 2D Visual Tracking 会议论文
, 北京, 2020年10月
作者:  Tong R(仝茹);  Wang TZ(王天柱);  Yu JZ(喻俊志)
Adobe PDF(185Kb)  |  收藏  |  浏览/下载:19/5  |  提交时间:2024/06/26
Balancing Exploration and Exploitation in Hierarchical Reinforcement Learning via Latent Landmark Graphs 会议论文
, 澳大利亚, 2023-6
作者:  Zhang Qingyang;  Yang Yiming;  Ruan Jingqing;  Xiong Xuantang;  Xing Dengpeng;  Xu Bo
Adobe PDF(7948Kb)  |  收藏  |  浏览/下载:31/13  |  提交时间:2024/06/25
强化学习,分层强化学习  
Latent Landmark Graph for Efficient Exploration-Exploitation Balance in Hierarchical Reinforcement Learning 期刊论文
Machine Intelligence Research, 2023, 页码: 158
作者:  Zhang Qingyang;  Zhang Hongming;  Xing Dengpeng;  Bo Xu
Adobe PDF(9639Kb)  |  收藏  |  浏览/下载:17/9  |  提交时间:2024/06/25
MULFE: A Multi-Level Benchmark for Free Text Model Editing 会议论文
, Bangkok, Thailand, 2024-08
作者:  Wang, Chenhao;  Cao, Pengfei;  Jin, Zhuoran;  Chen, Yubo;  Zeng, Daojian;  Liu, Kang;  Zhao, Jun
Adobe PDF(571Kb)  |  收藏  |  浏览/下载:18/8  |  提交时间:2024/06/25
LEGO: A Multi-agent Collaborative Framework with Role-playing and Iterative Feedback for Causality Explanation Generation 会议论文
, Singapore, 2023-12
作者:  Zhitao He;  Pengfei Cao;  Yubo Chen;  Kang Liu;  Jun Zhao
Adobe PDF(1153Kb)  |  收藏  |  浏览/下载:13/4  |  提交时间:2024/06/25