CASIA OpenIR

浏览/检索结果: 共59条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
CM-MaskSD: Cross-Modality Masked Self-Distillation for Referring Image Segmentation 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 6906-6916
作者:  Wang, Wenxuan;  He, Xingjian;  Zhang, Yisi;  Guo, Longteng;  Shen, Jiachen;  Li, Jiangyun;  Liu, Jing
收藏  |  浏览/下载:3/0  |  提交时间:2024/07/03
Referring image segmentation  cross-modality guidance  masked self-distillation  vision and language  
Comment-Context Dual Collaborative Masked Transformer Network for Fake News Detection 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 5170-5180
作者:  Wang, Jinguang;  Qian, Shengsheng;  Hu, Jun;  Hong, Richang
收藏  |  浏览/下载:14/0  |  提交时间:2024/07/03
Fake news detection  multi-modal learning  social media  
SgVA-CLIP: Semantic-Guided Visual Adapting of Vision-Language Models for Few-Shot Image Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3469-3480
作者:  Peng, Fang;  Yang, Xiaoshan;  Xiao, Linhui;  Wang, Yaowei;  Xu, Changsheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/07/03
Few-shot  image classification  vision-language models  
Semantic Distance Adversarial Learning for Text-to-Image Synthesis 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 1255-1266
作者:  Yuan, Bowen;  Sheng, Yefei;  Bao, Bing-Kun;  Chen, Yi-Ping Phoebe;  Xu, Changsheng
收藏  |  浏览/下载:9/0  |  提交时间:2024/07/03
Text-to-image synthesis  adversarial learning  cycle consistency  
Invisible Intruders: Label-Consistent Backdoor Attack using Re-parameterized Noise Trigger 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 14, 期号: 8, 页码: 1-13
作者:  Bo Wang;  Fei Yu;  Fei Wei;  Yi Li;  Wei Wang
Adobe PDF(1364Kb)  |  收藏  |  浏览/下载:43/14  |  提交时间:2024/06/21
AnANet: Association and Alignment Network for Modeling Implicit Relevance in Cross-Modal Correlation Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 7867-7880
作者:  Xu, Nan;  Wang, Junyan;  Tian, Yuan;  Zhang, Ruike;  Mao, Wenji
收藏  |  浏览/下载:48/0  |  提交时间:2024/03/26
Association and alignment network  classification scheme  cross-modal correlation  implicit relevance  
Cross-Lingual Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 4830-4841
作者:  Chen, Zhuo;  Yin, Fei;  Yang, Qing;  Liu, Cheng-Lin
收藏  |  浏览/下载:57/0  |  提交时间:2024/02/22
Cross-lingual text image recognition  cross-modal mimic  multihierarchy mimic  
Quality-Aware Network for Human Parsing 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 7128-7138
作者:  Yang, Lu;  Song, Qing;  Wang, Zhihui;  Liu, Zhiwei;  Xu, Songcen;  Li, Zhihao
收藏  |  浏览/下载:40/0  |  提交时间:2024/02/22
Computer vision  image segmentation  multi-media computing  
A Two-Level Rectification Attention Network for Scene Text Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 2404-2414
作者:  Wu, Lintai;  Xu, Yong;  Hou, Junhui;  Chen, C. L. Philip;  Liu, Cheng-Lin
收藏  |  浏览/下载:71/0  |  提交时间:2023/11/17
Scene text recognition  text rectification  spatial transformer network  optical character recognition  
Explicit Cross-Modal Representation Learning for Visual Commonsense Reasoning 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2986-2997
作者:  Zhang, Xi;  Zhang, Feifei;  Xu, Changsheng
Adobe PDF(5681Kb)  |  收藏  |  浏览/下载:411/4  |  提交时间:2022/07/25
Cognition  Video recording  Syntactics  Visualization  Task analysis  Semantics  Linguistics  Visual Commonsense Reasoning  explicit reasoning  syntactic structure  interpretability