CASIA OpenIR

浏览/检索结果: 共323条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training 会议论文
, New Orleans Convention Center ,America, 2022-11-28至 2022-12-9
作者:  Yulong Liu;  Guibo Zhu;  Bin Zhu;  Qi Song;  Guojing Ge;  Haoran Chen;  Guanhui Qiao;  Ru Peng;  Lingxiang Wu;  Jinqiao Wang
Adobe PDF(2408Kb)  |  收藏  |  浏览/下载:8/1  |  提交时间:2024/06/06
GraphMLLM: A Graph-based Multi-level Layout Language-independent Model for Document Understanding 会议论文
, 希腊雅典, 2024-09
作者:  He-Sen Dai;  Xiao-Hui Li;  Fei Yin;  Xudong Yan;  Shuqi Mei;  Cheng-Lin Liu
Adobe PDF(967Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/06/05
Visual information extraction  Self-supervised pre-training  Multi-level page layouts  
Self-similarity Driven Scale-invariant Learning for Weakly Supervised Person Search 会议论文
, 法国巴黎, 2023 年 10 月 2 日 – 2023 年 10 月 6 日
作者:  Benzhi Wang;  Yang Yang;  Jinlin Wu;  Guo-jun Qi;  Zhen Lei
Adobe PDF(6488Kb)  |  收藏  |  浏览/下载:8/1  |  提交时间:2024/06/04
行人搜索,行人再识别,弱监督学习,度量学习,伪标签预测  
High-Fidelity Clothed Avatar Reconstruction from a Single Image 会议论文
, Canada, Vancouver, 2023年6月18日-6月22日
作者:  Tingting Liao;  Xiaomei Zhang;  Yuliang Xiu;  Hongwei Yi;  Xudong Liu;  Guo-Jun Qi;  Yong Zhang;  Xuan Wang;  Xiangyu Zhu;  Zhen Lei
Adobe PDF(9282Kb)  |  收藏  |  浏览/下载:8/3  |  提交时间:2024/06/03
Coarse-to-Fine Recurrently Aligned Transformer with Balance Tokens for Video Moment Retrieval and Highlight Detection 会议论文
, 日本横滨, 2024-6
作者:  Pan Yi;  Zhang Yujia;  Chang Hui;  Shiying Sun;  Zhou Feihu;  Zhao Xiaoguang
Adobe PDF(1027Kb)  |  收藏  |  浏览/下载:19/8  |  提交时间:2024/05/31
ALIM: Adjusting Label Importance Mechanism for Noisy Partial Label Learning 会议论文
, New Orleans, USA, 2023 年 12 月 10 日 – 2023 年 12 月 16 日
作者:  Mingyu Xu;  Zheng Lian;  Lei Feng;  Bin Liu;  Jianhua Tao
Adobe PDF(861Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/05/31
PCEN: Potential Correlation-Enhanced Network for Multimodal Named Entity Recognition 会议论文
, Charlotte, NC, USA, 02-03 October 2023
作者:  Jiakai Geng;  Chenyang Zhang;  Linjing Li;  Qing Yang;  Daniel Zeng
Adobe PDF(4985Kb)  |  收藏  |  浏览/下载:10/3  |  提交时间:2024/05/31
named entity recognition  multimodal learning  vision-language pre-trained model  inconsistency loss  
GaFET: Learning Geometry-aware Facial Expression Translation from In-The-Wild Images 会议论文
, 法国巴黎, 10.2-10.6
作者:  Tianxiang Ma;  Bingchuan Li;  Qian He;  Jing Dong;  Tieniu Tan
Adobe PDF(7315Kb)  |  收藏  |  浏览/下载:6/0  |  提交时间:2024/05/29
Distance-Ranking-Based Weighted Triplet Loss for Visual Place Recognition 会议论文
, Tianjin, China, 2023-12-8
作者:  Xiong Yu;  Xu Shixiong;  Meng Gaofeng
Adobe PDF(426Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/05/28
Learning Video Localization on Segment-Level Video Copy Detection with Transformer 会议论文
, Heraklion city, Crete, Greece, 2023-9-26
作者:  Chi, Zhang;  Jie, Liu;  Shuwu, Zhang;  Zhi, Zeng;  Ying, Huang
Adobe PDF(1152Kb)  |  收藏  |  浏览/下载:11/2  |  提交时间:2024/05/28
Video Copy Localization  Content Based Video Retrieval  Temporal Alignment