CASIA OpenIR

浏览/检索结果: 共100条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Gait Learning for 3D Bipedal Robots Based on a Combined Strategy of Hybrid Zero Dynamics Feedback Control and Periodic Reward 会议论文
, 中国湖南长沙, 2024-5-25
作者:  Cui LZ(崔凌志);  Tianqi Deng;  Lihua Ma;  Wenhao He
Adobe PDF(690Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/07/01
Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation 会议论文
, 新奥尔良, 2023-12-9 至 2023-12-15
作者:  Keji He;  Chenyang Si;  Zhihe Lu;  Yan Huang;  Liang Wang;  Xinchao Wang
Adobe PDF(2505Kb)  |  收藏  |  浏览/下载:42/16  |  提交时间:2024/06/26
Adaptive Multi-Agent Coordination among Different Team Attribute Tasks via Contextual Meta-Reinforcement Learning 会议论文
, 河南开封, 2024年5月17-19日
作者:  Huang, Shangjing;  Zhao, Zijie;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(15515Kb)  |  收藏  |  浏览/下载:29/10  |  提交时间:2024/06/26
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/25
强化学习,分层强化学习  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:37/16  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
A-Teacher: Asymmetric Network for 3D Semi-Supervised Object Detection 会议论文
, Seattle, United States, 2024-06-17至2024-06-21
作者:  Wang, Hanshi;  Zhang, Zhipeng;  Gao, Jin;  Hu, Weiming
Adobe PDF(2903Kb)  |  收藏  |  浏览/下载:40/7  |  提交时间:2024/06/21
ESTATE: Expert-Guided State Text Enhancement for Zero-Shot Industrial Anomaly Detection 会议论文
, Abu Dhabi, UAE, 2024.10.27-2024.10.30
作者:  Bingke Zhu;  Hao Li;  Changlin Chen;  Liujie Hua;  Jinqiao Wang
Adobe PDF(2871Kb)  |  收藏  |  浏览/下载:33/8  |  提交时间:2024/06/21
Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文
, Greece, 2023-5
作者:  Xu YF(徐一凡);  Pu ZQ(蒲志强);  Cai QA(蔡奇昂);  Li FM(李非墨);  Chai XH(柴兴华)
Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:25/11  |  提交时间:2024/06/21
Memory-based Error Label Suppression for Embodied Self-Improving Object Detection 会议论文
, 意大利巴里, 2024-8-28
作者:  Deng JR(邓杰仁);  Zhang HJ(张好剑);  Hu JH(胡建华);  Wang YK(王云宽)
Adobe PDF(2603Kb)  |  收藏  |  浏览/下载:55/21  |  提交时间:2024/06/20
Training Large Language Models to Follow System Prompt with Self-Supervised Fine-Tuning 会议论文
, YOKOHAMA, JAPAN, 2024-07
作者:  Junyan Qiu;  Haitao Wang;  Yiping Yang
Adobe PDF(1596Kb)  |  收藏  |  浏览/下载:44/19  |  提交时间:2024/06/17
large language models  supervised fine-tuning  instruct tuning  stylized generation