已选(0)清除
条数/页: 排序方式: |
| TextFormer: A Query-based End-to-end Text Spotter with Mixed Supervision 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 704-717 作者: Yukun Zhai; Xiaoqiang Zhang; Xiameng Qin; Sanyuan Zhao; Xingping Dong; Jianbing Shen Adobe PDF(2312Kb)  |  收藏  |  浏览/下载:8/5  |  提交时间:2024/07/18 End-to-end text spotting arbitrarily-shaped texts transformer mixed supervision multitask modeling |
| A Novel Divide and Conquer Solution for Long-term Video Salient Object Detection 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 684-703 作者: Yun-Xiao Li; Cheng-Li-Zhao Chen; Shuai Li; Ai-Min Hao; Hong Qin Adobe PDF(6454Kb)  |  收藏  |  浏览/下载:6/2  |  提交时间:2024/07/18 Video salient object detection background consistency analysis weakly supervised learning long-term information background shift |
| Vision Transformers with Hierarchical Attention 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 670-683 作者: Yun Liu; Yu-Huan Wu; Guolei Sun; Le Zhang; Ajad Chhatkuli; Luc Van Gool Adobe PDF(1358Kb)  |  收藏  |  浏览/下载:8/4  |  提交时间:2024/07/18 Vision transformer hierarchical attention global attention local attention scene understanding |
| Rethinking Global Context in Crowd Counting 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 640-651 作者: Guolei Sun; Yun Liu; Thomas Probst; Danda Pani Paudel; Nikola Popovic; Luc Van Gool Adobe PDF(2388Kb)  |  收藏  |  浏览/下载:6/3  |  提交时间:2024/07/18 Crowd counting vision transformer global context attention density map |
| Rethinking Polyp Segmentation from An Out-ofdistribution Perspective 期刊论文 Machine Intelligence Research, 2024, 卷号: 21, 期号: 4, 页码: 631-639 作者: Ge-Peng Ji; Jing Zhang; Dylan Campbell; Huan Xiong; Nick Barnes Adobe PDF(2420Kb)  |  收藏  |  浏览/下载:5/3  |  提交时间:2024/07/18 Polyp segmentation anomaly segmentation out-of-distribution segmentation masked autoencoder abdomen |
| Multi-Level Pixel-Wise Correspondence Learning for 6DoF Face Pose Estimation 期刊论文 IEEE Transactions on Multimedia, 2024, 页码: IEEE Xplore 作者: Xu M(徐淼); Xiangyu Zhu; Yueying Kao; Zhiwen Chen; Jiangjing Lyu; Zhen Lei Adobe PDF(7908Kb)  |  收藏  |  浏览/下载:32/4  |  提交时间:2024/07/16 |
| 面向视觉-语言的跨模态预训练与匹配方法研究 学位论文 , 2024 作者: chen yuxin Adobe PDF(46981Kb)  |  收藏  |  浏览/下载:20/1  |  提交时间:2024/07/11 视觉语言匹配 图像文本预训练 知识蒸馏 双向匹配评估 令牌合并 |
| CLIP-Driven hierarchical fusion for referring image segmentation 会议论文 , Kunming, China, 2024/03/08 作者: Yichen Yan; Xingjian He; Jing Liu Adobe PDF(5233Kb)  |  收藏  |  浏览/下载:35/10  |  提交时间:2024/07/08 Referring Image Segmentation, CLIP, Hierarchical Fusion, Computer Vision |
| 标注受限的光学遥感图像目标检测模型与算法研究 学位论文 , 2024 作者: 任至达 Adobe PDF(18136Kb)  |  收藏  |  浏览/下载:23/1  |  提交时间:2024/07/08 光学遥感图像目标检测 标注受限 弱监督学习 显著性检测 特征增强 |
| 面向多模态语义理解与推理的视觉问答研究 学位论文 , 2024 作者: 张熙 Adobe PDF(39126Kb)  |  收藏  |  浏览/下载:31/2  |  提交时间:2024/07/08 多模态 视觉问答 语义挖掘 可靠关联 推理泛化 |