CASIA OpenIR

浏览/检索结果: 共283条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Scene text recognition via dual character counting-aware visual and semantic modeling network 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 卷号: 67, 期号: 3, 页码: 2
作者:  Xiao, Ke;  Zhu, Anna;  Iwana, Brian Kenji;  Liu, Cheng-Lin
收藏  |  浏览/下载:48/0  |  提交时间:2024/03/13
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:97/9  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
A New Lightweight Script Independent Scene Text Style Transfer Network 期刊论文
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 页码: 29
作者:  Shivakumara, Palaiahnakote;  Roy, Ayush;  Nandanwar, Lokesh;  Pal, Umapada;  Lu, Yue;  Liu, Cheng-Lin
收藏  |  浏览/下载:20/0  |  提交时间:2024/02/22
Text detection  style transfer  CNN models  multi-lingual transfer  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:95/12  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
RSDet++: Point-based modulated loss for more accurate rotated object detection 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 32, 期号: 11, 页码: 7869-7879
作者:  Wen Qian;  Xue Yang;  Silong Peng;  Xiujuan Zhang;  Junchi Yan
Adobe PDF(6998Kb)  |  收藏  |  浏览/下载:168/62  |  提交时间:2023/06/07
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:118/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Texts as points: Scene text detection with point supervision 期刊论文
Pattern Recognition Letters, 2023, 卷号: 170, 页码: 1-8
作者:  Mengbiao Zhao;  Wei Feng;  Fei Yin;  Cheng-Lin Liu
Adobe PDF(1670Kb)  |  收藏  |  浏览/下载:130/40  |  提交时间:2023/06/28
Scene text detection  Point supervision  Mixed-supervised learning  
Towards open-set text recognition via label-to-prototype learning 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 134, 页码: 13
作者:  Liu, Chang;  Yang, Chun;  Qin, Hai-Bo;  Zhu, Xiaobin;  Liu, Cheng-Lin;  Yin, Xu-Cheng
收藏  |  浏览/下载:220/0  |  提交时间:2022/12/27
Open-set recognition  Scene text recognition  Low-shot recognition  
Learning to Adapt Across Dual Discrepancy for Cross-Domain Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 2, 页码: 1963-1980
作者:  Luo, Chuanchen;  Song, Chunfeng;  Zhang, Zhaoxiang
Adobe PDF(2539Kb)  |  收藏  |  浏览/下载:204/61  |  提交时间:2023/03/20
Person re-identification  domain adaptation  cross-domain mixup  camera-aware learning  self-paced learning  
Temporal sparse adversarial attack on sequence-based gait recognition 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 133, 页码: 11
作者:  He, Ziwen;  Wang, Wei;  Dong, Jing;  Tan, Tieniu
Adobe PDF(1435Kb)  |  收藏  |  浏览/下载:288/53  |  提交时间:2022/11/21
Adversarial attack  Gait recognition  Temporal sparsity