已选(0)清除
条数/页: 排序方式: |
| TaiSu: A 166M Large-scale High-Quality Dataset for Chinese Vision-Language Pre-training 会议论文 , New Orleans Convention Center ,America, 2022-11-28至 2022-12-9 作者: Yulong Liu; Guibo Zhu; Bin Zhu; Qi Song; Guojing Ge; Haoran Chen; Guanhui Qiao; Ru Peng; Lingxiang Wu; Jinqiao Wang Adobe PDF(2408Kb)  |  收藏  |  浏览/下载:8/1  |  提交时间:2024/06/06 |
| GraphMLLM: A Graph-based Multi-level Layout Language-independent Model for Document Understanding 会议论文 , 希腊雅典, 2024-09 作者: He-Sen Dai; Xiao-Hui Li; Fei Yin; Xudong Yan; Shuqi Mei; Cheng-Lin Liu Adobe PDF(967Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/06/05 Visual information extraction Self-supervised pre-training Multi-level page layouts |
| Self-similarity Driven Scale-invariant Learning for Weakly Supervised Person Search 会议论文 , 法国巴黎, 2023 年 10 月 2 日 – 2023 年 10 月 6 日 作者: Benzhi Wang; Yang Yang; Jinlin Wu; Guo-jun Qi; Zhen Lei Adobe PDF(6488Kb)  |  收藏  |  浏览/下载:8/1  |  提交时间:2024/06/04 行人搜索,行人再识别,弱监督学习,度量学习,伪标签预测 |
| High-Fidelity Clothed Avatar Reconstruction from a Single Image 会议论文 , Canada, Vancouver, 2023年6月18日-6月22日 作者: Tingting Liao; Xiaomei Zhang; Yuliang Xiu; Hongwei Yi; Xudong Liu; Guo-Jun Qi; Yong Zhang; Xuan Wang; Xiangyu Zhu; Zhen Lei Adobe PDF(9282Kb)  |  收藏  |  浏览/下载:8/3  |  提交时间:2024/06/03 |
| Coarse-to-Fine Recurrently Aligned Transformer with Balance Tokens for Video Moment Retrieval and Highlight Detection 会议论文 , 日本横滨, 2024-6 作者: Pan Yi; Zhang Yujia; Chang Hui; Shiying Sun; Zhou Feihu; Zhao Xiaoguang Adobe PDF(1027Kb)  |  收藏  |  浏览/下载:19/8  |  提交时间:2024/05/31 |
| ALIM: Adjusting Label Importance Mechanism for Noisy Partial Label Learning 会议论文 , New Orleans, USA, 2023 年 12 月 10 日 – 2023 年 12 月 16 日 作者: Mingyu Xu; Zheng Lian; Lei Feng; Bin Liu; Jianhua Tao Adobe PDF(861Kb)  |  收藏  |  浏览/下载:14/5  |  提交时间:2024/05/31 |
| PCEN: Potential Correlation-Enhanced Network for Multimodal Named Entity Recognition 会议论文 , Charlotte, NC, USA, 02-03 October 2023 作者: Jiakai Geng; Chenyang Zhang; Linjing Li; Qing Yang; Daniel Zeng Adobe PDF(4985Kb)  |  收藏  |  浏览/下载:10/3  |  提交时间:2024/05/31 named entity recognition multimodal learning vision-language pre-trained model inconsistency loss |
| GaFET: Learning Geometry-aware Facial Expression Translation from In-The-Wild Images 会议论文 , 法国巴黎, 10.2-10.6 作者: Tianxiang Ma; Bingchuan Li; Qian He; Jing Dong; Tieniu Tan Adobe PDF(7315Kb)  |  收藏  |  浏览/下载:6/0  |  提交时间:2024/05/29 |
| Distance-Ranking-Based Weighted Triplet Loss for Visual Place Recognition 会议论文 , Tianjin, China, 2023-12-8 作者: Xiong Yu; Xu Shixiong; Meng Gaofeng Adobe PDF(426Kb)  |  收藏  |  浏览/下载:18/4  |  提交时间:2024/05/28 |
| Learning Video Localization on Segment-Level Video Copy Detection with Transformer 会议论文 , Heraklion city, Crete, Greece, 2023-9-26 作者: Chi, Zhang; Jie, Liu; Shuwu, Zhang; Zhi, Zeng; Ying, Huang Adobe PDF(1152Kb)  |  收藏  |  浏览/下载:11/2  |  提交时间:2024/05/28 Video Copy Localization Content Based Video Retrieval Temporal Alignment |