CASIA OpenIR

浏览/检索结果: 共35条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Social Relation Reasoning Based on Triangular Constraints 会议论文
, 美国华盛顿, 2023年2月7日-14日
作者:  Guo, Yunfei;  Yin, Fei;  Feng, Wei;  Yan, Xudong;  Xue, Tao;  Mei, Shuqi;  Liu, Cheng-Lin
Adobe PDF(977Kb)  |  收藏  |  浏览/下载:13/5  |  提交时间:2024/06/13
FreeSeg: Unified, Universal and Open-Vocabulary Image Segmentation 会议论文
, 加拿大温哥华市, 6.18-6.22
作者:  Jie Qin;  Jie Wu;  Pengxiang Yan;  Ming Li;  Ren Yuxi;  Xuefeng Xiao;  Yitong Wang;  Rui Wang;  Shilei Wen;  Xin Pan;  Xingang Wang
Adobe PDF(5688Kb)  |  收藏  |  浏览/下载:18/5  |  提交时间:2024/06/03
BEVBert: Multimodal Map Pre-training for Language-guided Navigation 会议论文
Proceedings of the IEEE International Conference on Computer Vision, Paris, France, 2023-10-2
作者:  Dong An;  Yuankai Qi;  Yangguang Li;  Yan Huang;  Liang Wang;  Tieniu Tan;  Jing Shao
Adobe PDF(1722Kb)  |  收藏  |  浏览/下载:23/6  |  提交时间:2024/05/28
Hierarchical Attention Network for Open-Set Fine-Grained Recognition 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 页码: 1-14
作者:  Jiayin, Sun;  Hong, Wang;  Qiulei, Dong
Adobe PDF(2596Kb)  |  收藏  |  浏览/下载:24/5  |  提交时间:2024/05/28
视觉语言导航研究进展 期刊论文
自动化学报, 2023, 卷号: 49, 期号: 1, 页码: 1-14
作者:  司马双霖;  黄岩;  何科技;  安东;  袁辉;  王亮
Adobe PDF(6272Kb)  |  收藏  |  浏览/下载:31/9  |  提交时间:2024/05/09
视觉语言导航  视觉语言理解  跨模态匹配  具身智能  
Cross-modal Contrastive Learning for Generalizable and Efficient Image-text Retrieval 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 569-582
作者:  Haoyu Lu;  Yuqi Huo;  Mingyu Ding;  Nanyi Fei;  Zhiwu Lu
Adobe PDF(2928Kb)  |  收藏  |  浏览/下载:27/7  |  提交时间:2024/04/23
Image-text retrieval, multimodal modeling, contrastive learning, weak correlation, computer vision  
Large-scale Multi-modal Pre-trained Models: A Comprehensive Survey 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 4, 页码: 447-482
作者:  Xiao Wang;  Guangyao Chen;  Guangwu Qian;  Pengcheng Gao;  Xiao-Yong Wei;  Yaowei Wang;  Yonghong Tian;  Wen Gao
Adobe PDF(3540Kb)  |  收藏  |  浏览/下载:33/5  |  提交时间:2024/04/23
Multi-modal (MM), pre-trained model (PTM), information fusion, representation learning, deep learning  
VLP: A Survey on Vision-language Pre-training 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 38-56
作者:  Fei-Long Chen;  Du-Zhen Zhang;  Ming-Lun Han;  Xiu-Yi Chen;  Jing Shi;  Shuang Xu;  Bo Xu
Adobe PDF(1427Kb)  |  收藏  |  浏览/下载:27/7  |  提交时间:2024/04/23
Vision and language  pre-training  transformers  multimodal learning  representation learning  
Federated Learning with Privacy-preserving and Model IP-right-protection 期刊论文
Machine Intelligence Research, 2023, 卷号: 20, 期号: 1, 页码: 19-37
作者:  Qiang Yang;  Anbu Huang;  Lixin Fan;  Chee Seng Chan;  Jian Han Lim;  Kam Woh Ng;  Ding Sheng Ong;  Bowen Li
Adobe PDF(2634Kb)  |  收藏  |  浏览/下载:20/5  |  提交时间:2024/04/23
Federated learning  privacy-preserving machine learning  security  decentralized learning  intellectual property protection  
几何图形解析与解题 学位论文
, 2023
作者:  张明亮
Adobe PDF(6293Kb)  |  收藏  |  浏览/下载:98/2  |  提交时间:2024/04/03
几何图形  图例解析  几何解题  定理知识验证