CASIA OpenIR

浏览/检索结果: 共491条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6906-6916
作者:  Wang, Wenxuan;  He, Xingjian;  Zhang, Yisi;  Guo, Longteng;  Shen, Jiachen;  Li, Jiangyun;  Liu, Jing
收藏  |  浏览/下载:5/0  |  提交时间:2024/07/03
Referring image segmentation  cross-modality guidance  masked self-distillation  vision and language  
Multi-Stage Image-Language Cross-Generative Fusion Network for Video-Based Referring Expression Comprehension 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 3256-3270
作者:  Zhang, Yujia;  Li, Qianzhong;  Pan, Yi;  Zhao, Xiaoguang;  Tan, Min
收藏  |  浏览/下载:9/0  |  提交时间:2024/07/03
Feature extraction  Visualization  Task analysis  Representation learning  Location awareness  Linguistics  Grounding  Video-based referring expression comprehension  multi-stage learning  image-language cross-generative fusion  consistency loss  
Comprehensive Attribute Prediction Learning for Person Search by Language 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1990-2003
作者:  Niu, Kai;  Huang, Linjiang;  Long, Yuzhou;  Huang, Yan;  Wang, Liang;  Zhang, Yanning
收藏  |  浏览/下载:5/0  |  提交时间:2024/07/03
Person search by language  cross-modal retrieval  smart video surveillance  attribute prediction  
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3469-3480
作者:  Peng, Fang;  Yang, Xiaoshan;  Xiao, Linhui;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:7/0  |  提交时间:2024/07/03
Few-shot  image classification  vision-language models  
Memory-Adaptive Vision-and-Language Navigation 期刊论文
Pattern Recognition, 2024, 卷号: 153, 页码: 110511
作者:  Keji He;  Ya Jing;  Yan Huang;  Zhihe Lu;  Dong An;  Liang Wang
Adobe PDF(3831Kb)  |  收藏  |  浏览/下载:46/18  |  提交时间:2024/06/26
Vision-and-Language Navigation  Memory bank  History noises  Memory-Adaptive Model  
Modal Contrastive Learning Based End-to-End Text Image Machine Translation 期刊论文
IEEE/ACM Transactions on Audio, Speech, and Language Processing (IEEE/ACM TASLP), 2023, 卷号: 32, 期号: 32, 页码: 2153-2165
作者:  Ma, Cong;  Han, Xu;  Wu, Linghui;  Zhang, Yaping;  Zhao, Yang;  Zhou, Yu;  Zong, Chengqing
Adobe PDF(6551Kb)  |  收藏  |  浏览/下载:31/16  |  提交时间:2024/06/26
Transformers  Machine translation  Decoding  Semantics  Pipelines  Text recognition  Task analysis  Text image machine translation  contrastive learning  text image recognition  machine translation  
GFFNet: Global Feature Fusion Network for Semantic Segmentation of Large-Scale Remote Sensing Images 期刊论文
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2024, 卷号: 17, 期号: 2024, 页码: 4222 - 4234
作者:  Cao, Yong;  Huo, Chunlei;  Xiang, Shiming;  Pan, Chunhong
Adobe PDF(4340Kb)  |  收藏  |  浏览/下载:25/5  |  提交时间:2024/06/25
Cross feature fusion (CFF)  global context learning  group transformer  semantic segmentation  
An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition 期刊论文
Pattern Recognition, 2024, 页码: 110373
作者:  MingMing Yu(于明明);  Zhang H(张恒);  Fei Yin(殷飞);  Cheng-Lin Liu(刘成林)
Adobe PDF(5849Kb)  |  收藏  |  浏览/下载:46/16  |  提交时间:2024/06/24
Invisible Intruders: Label-Consistent Backdoor Attack using Re-parameterized Noise Trigger 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 14, 期号: 8, 页码: 1-13
作者:  Bo Wang;  Fei Yu;  Fei Wei;  Yi Li;  Wei Wang
Adobe PDF(1364Kb)  |  收藏  |  浏览/下载:46/15  |  提交时间:2024/06/21
A Simple and Strong Baseline for Universal Targeted Attacks on Siamese Visual Tracking 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2022, 卷号: 32, 期号: 6, 页码: 3880-3894
作者:  Li, Zhenbang;  Shi, Yaya;  Gao, Jin;  Wang, Shaoru;  Li, Bing;  Liang, Pengpeng;  Hu, Weiming
Adobe PDF(4397Kb)  |  收藏  |  浏览/下载:40/22  |  提交时间:2024/06/21