CASIA OpenIR

浏览/检索结果: 共16条,第1-10条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Fuse & Calibrate: A bi-directional Vision-Language Guided Framework for Referring Image Segmentation 会议论文
, Tianjin, China, 2024/08/05
作者:  Yichen Yan;  Xingjian He;  Sihan Chen;  Shichen Lu;  Jing Liu
Adobe PDF(1978Kb)  |  收藏  |  浏览/下载:27/10  |  提交时间:2024/07/08
Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision  
Optimizing Reward Function Weights and Enhancing Control Mechanisms for Bipedal Robots Using LSTM and Attention Mechanisms 会议论文
, 河北保定, 2023-8-16
作者:  Cui LZ(崔凌志);  Tianqi Deng;  Lihua Ma;  Wenhao He
Adobe PDF(541Kb)  |  收藏  |  浏览/下载:36/14  |  提交时间:2024/07/01
Adaptive Multi-Agent Coordination among Different Team Attribute Tasks via Contextual Meta-Reinforcement Learning 会议论文
, 河南开封, 2024年5月17-19日
作者:  Huang, Shangjing;  Zhao, Zijie;  Zhu, Yuanheng;  Zhao, Dongbin
Adobe PDF(15515Kb)  |  收藏  |  浏览/下载:38/13  |  提交时间:2024/06/26
Towards Zero-Shot Generalization: Mutual Information-Guided Hierarchical Multi-Agent Coordination 会议论文
, 日本, 2024-6
作者:  Zhang Qingyang;  Xu Bo
Adobe PDF(8862Kb)  |  收藏  |  浏览/下载:35/10  |  提交时间:2024/06/25
强化学习,分层强化学习  
User Response Modeling in Reinforcement Learning for Ads Allocation 会议论文
, 新加坡, May 13 - 17, 2024
作者:  Zhang, Zhiyuan;  Zhang, Qichao;  Wu, Xiaoxu;  Shi, Xiaowen;  Liao, Guogang;  Wang, Yongkong;  Wang, xingxing;  Zhao, Dongbin
Adobe PDF(2077Kb)  |  收藏  |  浏览/下载:52/21  |  提交时间:2024/06/25
Ads Allocation  Reinforcement Learning  User Response Modeling  
Review on Peg-in-Hole Insertion Technology Based on Reinforcement Learning 会议论文
, Chongqing, China, 2023-11
作者:  Shen Liancheng;  Su Jianhua;  Zhang Xiaodong
Adobe PDF(254Kb)  |  收藏  |  浏览/下载:53/23  |  提交时间:2024/06/24
—Robot Peg-in-hole Insertion  Reinforcement Learning  Meta-Reinforcement Learning  
Memory-based Error Label Suppression for Embodied Self-Improving Object Detection 会议论文
, 意大利巴里, 2024-8-28
作者:  Deng JR(邓杰仁);  Zhang HJ(张好剑);  Hu JH(胡建华);  Wang YK(王云宽)
Adobe PDF(2603Kb)  |  收藏  |  浏览/下载:63/24  |  提交时间:2024/06/20
A New Pre-Training Paradigm for Offline Multi-Agent Reinforcement Learning with Suboptimal Data 会议论文
, Seoul, Korea, 2024.4.14-2024.4.19
作者:  Meng Linghui;  Zhang Xi;  Xing Dengpeng;  Xu Bo
Adobe PDF(964Kb)  |  收藏  |  浏览/下载:55/22  |  提交时间:2024/06/11
TIM: An Efficient Temporal Interaction Module for Spiking Transformer 会议论文
, Jeju, korea, 2024-08
作者:  Shen, Sicheng;  Zhao, Dongcheng;  Shen, Guobin;  Zeng, Yi
Adobe PDF(717Kb)  |  收藏  |  浏览/下载:43/8  |  提交时间:2024/06/06
Token-level Direct Preference Optimization 会议论文
, Vienna, Austria, 2024/7/21-27
作者:  Zeng,Yongcheng;  Liu,Guoqing;  Ma,Weiyu;  Yang,Ning;  Zhang,Haifeng;  Wang,Jun
Adobe PDF(883Kb)  |  收藏  |  浏览/下载:80/26  |  提交时间:2024/06/05