CASIA OpenIR

浏览/检索结果: 共148条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Positive Unlabeled Fake News Detection via Multi-Modal Masked Transformer Network 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 234-244
作者:  Wang, Jinguang;  Qian, Shengsheng;  Hu, Jun;  Hong, Richang
收藏  |  浏览/下载:7/0  |  提交时间:2024/03/26
Fake news detection  multi-modal learning  social media  
Recovering Generalization via Pre-training-like Knowledge Distillation for Out-of-Distribution Visual Question Answering 期刊论文
IEEE Transactions on Multimedia, 2023, 页码: 1-15
作者:  Song, Yaguang;  Yang, Xiaoshan;  Wang, Yaowei;  Xu, Changsheng
Adobe PDF(2397Kb)  |  收藏  |  浏览/下载:162/43  |  提交时间:2023/06/12
Multi-modal Foundation Model  Out-of-Distribution Generalization  Visual Question Answering  Knowledge Distillation  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:118/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Seeing Through Darkness: Visual Localization at Night via Weakly Supervised Learning of Domain Invariant Features 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 1713-1726
作者:  Fan, Bin;  Yang, Yuzhu;  Feng, Wensen;  Wu, Fuchao;  Lu, Jiwen;  Liu, Hongmin
收藏  |  浏览/下载:76/0  |  提交时间:2023/11/17
Domain invariant local features  image matching  long-term visual localization  weakly supervised learning  
BSTG-Trans: A Bayesian Spatial-Temporal Graph Transformer for Long-term Pose Forecasting 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: Early Access, 期号: Early Access, 页码: Early Access
作者:  Shentong Mo;  Xin M(辛淼)
Adobe PDF(2209Kb)  |  收藏  |  浏览/下载:86/14  |  提交时间:2023/04/25
long-term forecasting  spatial-temporal graph transformer  Bayesian transformer  uncertainty estimation  
Cross-Lingual Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 4830-4841
作者:  Chen, Zhuo;  Yin, Fei;  Yang, Qing;  Liu, Cheng-Lin
收藏  |  浏览/下载:21/0  |  提交时间:2024/02/22
Cross-lingual text image recognition  cross-modal mimic  multihierarchy mimic  
Robust Video-Text Retrieval Via Noisy Pair Calibration 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 8632-8645
作者:  Zhang, Huaiwen;  Yang, Yang;  Qi, Fan;  Qian, Shengsheng;  Xu, Changsheng
收藏  |  浏览/下载:17/0  |  提交时间:2024/02/22
Noise calibration  uncertainty  video text retrieval  
ATF: An Alternating Training Framework for Weakly Supervised Face Alignment 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 1798-1809
作者:  Lan, Xing;  Hu, Qinghao;  Cheng, Jian
收藏  |  浏览/下载:50/0  |  提交时间:2023/11/17
Face alignment  multi-task learning  weakly supervised  
AnANet: Association and Alignment Network for Modeling Implicit Relevance in Cross-Modal Correlation Classification 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 7867-7880
作者:  Xu, Nan;  Wang, Junyan;  Tian, Yuan;  Zhang, Ruike;  Mao, Wenji
收藏  |  浏览/下载:6/0  |  提交时间:2024/03/26
Association and alignment network  classification scheme  cross-modal correlation  implicit relevance  
Illumination Guided Attentive Wavelet Network for Low-Light Image Enhancement 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 6258-6271
作者:  Xu, Jingzhao;  Yuan, Mengke;  Yan, Dong-Ming;  Wu, Tieru
收藏  |  浏览/下载:19/0  |  提交时间:2024/02/22
Lighting  Wavelet transforms  Image enhancement  Frequency modulation  Wavelet coefficients  Noise reduction  Discrete wavelet transforms  Attention mechanism  illumination guidance  low-light image enhancement  wavelet transform