CASIA OpenIR

浏览/检索结果: 共306条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Towards Prior Gap and Representation Gap for Long-tailed Recognition, Pattern Recognition 期刊论文
Pattern Recognition, 2023, 卷号: 133, 期号: 109012, 页码: 109012
作者:  Zhang Ming-Liang;  Zhang Xu-Yao;  Wang Chang;  Liu Cheng-Lin
Adobe PDF(2258Kb)  |  收藏  |  浏览/下载:61/13  |  提交时间:2024/04/03
Long-tailed learning  Prior gap  Representation gap  Image recognition  
Scene text recognition via dual character counting-aware visual and semantic modeling network 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 卷号: 67, 期号: 3, 页码: 2
作者:  Xiao, Ke;  Zhu, Anna;  Iwana, Brian Kenji;  Liu, Cheng-Lin
收藏  |  浏览/下载:53/0  |  提交时间:2024/03/13
A New Lightweight Script Independent Scene Text Style Transfer Network 期刊论文
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 页码: 29
作者:  Shivakumara, Palaiahnakote;  Roy, Ayush;  Nandanwar, Lokesh;  Pal, Umapada;  Lu, Yue;  Liu, Cheng-Lin
收藏  |  浏览/下载:22/0  |  提交时间:2024/02/22
Text detection  style transfer  CNN models  multi-lingual transfer  
Cross-Lingual Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 4830-4841
作者:  Chen, Zhuo;  Yin, Fei;  Yang, Qing;  Liu, Cheng-Lin
收藏  |  浏览/下载:25/0  |  提交时间:2024/02/22
Cross-lingual text image recognition  cross-modal mimic  multihierarchy mimic  
TR-MISR: Multiimage super-resolution based on feature fusion with transformers 期刊论文
IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, 2022, 卷号: 15, 页码: 1373-1388
作者:  An T(安泰);  Zhang X(张鑫);  Huo CL(霍春雷);  Xue B(薛斌);  Wang LF(汪凌峰);  Pan CH(潘春洪)
Adobe PDF(6058Kb)  |  收藏  |  浏览/下载:111/9  |  提交时间:2024/01/17
Deep learning  end-to-end networks  feature extraction and fusion  multiimage super-resolution (MISR)  remote sensing  transformers  
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:113/12  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:81/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
A Two-Level Rectification Attention Network for Scene Text Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 2404-2414
作者:  Wu, Lintai;  Xu, Yong;  Hou, Junhui;  Chen, C. L. Philip;  Liu, Cheng-Lin
收藏  |  浏览/下载:50/0  |  提交时间:2023/11/17
Scene text recognition  text rectification  spatial transformer network  optical character recognition  
A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3552-3566
作者:  Shivakumara, Palaiahnakote;  Banerjee, Ayan;  Pal, Umapada;  Nandanwar, Lokesh;  Lu, Tong;  Liu, Cheng-Lin
收藏  |  浏览/下载:31/0  |  提交时间:2023/11/17
Text detection  style transfer  deep learning  EfficientNet  social media images  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:101/15  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection