已选(0)清除
条数/页: 排序方式: |
| Gait Learning for 3D Bipedal Robots Based on a Combined Strategy of Hybrid Zero Dynamics Feedback Control and Periodic Reward 会议论文 , 中国湖南长沙, 2024-5-25 作者: Cui LZ(崔凌志); Tianqi Deng; Lihua Ma; Wenhao He Adobe PDF(690Kb)  |  收藏  |  浏览/下载:34/13  |  提交时间:2024/07/01 |
| Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation 会议论文 , 新奥尔良, 2023-12-9 至 2023-12-15 作者: Keji He; Chenyang Si; Zhihe Lu; Yan Huang; Liang Wang; Xinchao Wang Adobe PDF(2505Kb)  |  收藏  |  浏览/下载:42/16  |  提交时间:2024/06/26 |
| Adaptive Multi-Agent Coordination among Different Team Attribute Tasks via Contextual Meta-Reinforcement Learning 会议论文 , 河南开封, 2024年5月17-19日 作者: Huang, Shangjing; Zhao, Zijie; Zhu, Yuanheng; Zhao, Dongbin Adobe PDF(15515Kb)  |  收藏  |  浏览/下载:29/10  |  提交时间:2024/06/26 |
| Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文 , 日本, 2024-6 作者: Zhang Qingyang; Xu Bo Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:21/7  |  提交时间:2024/06/25 强化学习,分层强化学习 |
| User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文 , 新加坡, May 13 - 17, 2024 作者: Zhang, Zhiyuan; Zhang, Qichao; Wu, Xiaoxu; Shi, Xiaowen; Liao, Guogang; Wang, Yongkong; Wang, xingxing; Zhao, Dongbin Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:37/16  |  提交时间:2024/06/25 Ads Allocation Reinforcement Learning User Response Modeling |
| A-Teacher: Asymmetric Network for 3D Semi-Supervised Object Detection 会议论文 , Seattle, United States, 2024-06-17至2024-06-21 作者: Wang, Hanshi; Zhang, Zhipeng; Gao, Jin; Hu, Weiming Adobe PDF(2903Kb)  |  收藏  |  浏览/下载:40/7  |  提交时间:2024/06/21 |
| ESTATE: Expert-Guided State Text Enhancement for Zero-Shot Industrial Anomaly Detection 会议论文 , Abu Dhabi, UAE, 2024.10.27-2024.10.30 作者: Bingke Zhu; Hao Li; Changlin Chen; Liujie Hua; Jinqiao Wang Adobe PDF(2871Kb)  |  收藏  |  浏览/下载:33/8  |  提交时间:2024/06/21 |
| Improving Generalization of Multi-agent Reinforcement Learning through Domain-Invariant Feature Extraction 会议论文 , Greece, 2023-5 作者: Xu YF(徐一凡); Pu ZQ(蒲志强); Cai QA(蔡奇昂); Li FM(李非墨); Chai XH(柴兴华) Adobe PDF(7610Kb)  |  收藏  |  浏览/下载:25/11  |  提交时间:2024/06/21 |
| Memory-based Error Label Suppression for Embodied Self-Improving Object Detection 会议论文 , 意大利巴里, 2024-8-28 作者: Deng JR(邓杰仁); Zhang HJ(张好剑); Hu JH(胡建华); Wang YK(王云宽) Adobe PDF(2603Kb)  |  收藏  |  浏览/下载:55/21  |  提交时间:2024/06/20 |
| Training Large Language Models to Follow System Prompt with Self-Supervised Fine-Tuning 会议论文 , YOKOHAMA, JAPAN, 2024-07 作者: Junyan Qiu; Haitao Wang; Yiping Yang Adobe PDF(1596Kb)  |  收藏  |  浏览/下载:44/19  |  提交时间:2024/06/17 large language models supervised fine-tuning instruct tuning stylized generation |