CASIA OpenIR

浏览/检索结果: 共410条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Scene text recognition via dual character counting-aware visual and semantic modeling network 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2024, 卷号: 67, 期号: 3, 页码: 2
作者:  Xiao, Ke;  Zhu, Anna;  Iwana, Brian Kenji;  Liu, Cheng-Lin
收藏  |  浏览/下载:47/0  |  提交时间:2024/03/13
Reparameterizing and dynamically quantizing image features for image generation 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 11
作者:  Sun, Mingzhen;  Wang, Weining;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(3612Kb)  |  收藏  |  浏览/下载:85/7  |  提交时间:2023/12/21
Vector quantization  Variational auto-encoder  Unconditional image generation  Text-to-image generation  Autoregressive generation  
A New Lightweight Script Independent Scene Text Style Transfer Network 期刊论文
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 页码: 29
作者:  Shivakumara, Palaiahnakote;  Roy, Ayush;  Nandanwar, Lokesh;  Pal, Umapada;  Lu, Yue;  Liu, Cheng-Lin
收藏  |  浏览/下载:17/0  |  提交时间:2024/02/22
Text detection  style transfer  CNN models  multi-lingual transfer  
So Many Heads, So Many Wits: Multimodal Graph Reasoning for Text-Based Visual Question Answering 期刊论文
IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 页码: 12
作者:  Zheng, Wenbo;  Yan, Lan;  Wang, Fei-Yue
收藏  |  浏览/下载:58/0  |  提交时间:2023/12/21
Graph attention  graph reasoning  multimodal graph  self-attention  text-based visual question answering  
Could ChatGPT Imagine: Content Control for Artistic Painting Generation Via Large Language Models 期刊论文
JOURNAL OF INTELLIGENT & ROBOTIC SYSTEMS, 2023, 卷号: 109, 期号: 2, 页码: 15
作者:  Lu, Yue;  Guo, Chao;  Dou, Yong;  Dai, Xingyuan;  Wang, Fei-Yue
收藏  |  浏览/下载:70/0  |  提交时间:2023/11/15
Intelligent systems  Human-machine interactions  Artistic painting generation  Large language model  ChatGPT  Linguistic intelligence  
Understanding and Mitigating Overfitting in Prompt Tuning for Vision-Language Models 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 9, 页码: 4616-4629
作者:  Ma, Chengcheng;  Liu, Yang;  Deng, Jiankang;  Xie, Lingxi;  Dong, Weiming;  Xu, Changsheng
Adobe PDF(1644Kb)  |  收藏  |  浏览/下载:79/11  |  提交时间:2023/11/16
Vision-language model  prompt tuning  over-fitting  subspace learning  gradient projection  
RSDet++: Point-based modulated loss for more accurate rotated object detection 期刊论文
IEEE Transactions on Circuits and Systems for Video Technology, 2023, 卷号: 32, 期号: 11, 页码: 7869-7879
作者:  Wen Qian;  Xue Yang;  Silong Peng;  Xiujuan Zhang;  Junchi Yan
Adobe PDF(6998Kb)  |  收藏  |  浏览/下载:158/62  |  提交时间:2023/06/07
TinyNeRF: Towards 100 times Compression of Volume Radiance Fields 会议论文
, 线上, 2023-02
作者:  Zhao TL(赵天理);  Chen JY(陈嘉园);  Leng C(冷聪);  Cheng J(程健)
Adobe PDF(2855Kb)  |  收藏  |  浏览/下载:145/35  |  提交时间:2023/06/21
Neural Radiance Fields  Discrete Cosine Transformation  Frequency Domain  
Sounding Video Generator: A Unified Framework for Text-guided Sounding Video Generation 期刊论文
IEEE Transactions on Multimedia, 2023, 卷号: 26, 页码: 1 - 13
作者:  Liu, Jiawei;  Wang, Weining;  Chen, Sihan;  Zhu, Xinxin;  Liu, Jing
Adobe PDF(7741Kb)  |  收藏  |  浏览/下载:112/20  |  提交时间:2023/05/03
Text-guided sounding-video generation  Videoaudio representation  Contrastive learning  Transformer  
Texts as points: Scene text detection with point supervision 期刊论文
Pattern Recognition Letters, 2023, 卷号: 170, 页码: 1-8
作者:  Mengbiao Zhao;  Wei Feng;  Fei Yin;  Cheng-Lin Liu
Adobe PDF(1670Kb)  |  收藏  |  浏览/下载:119/37  |  提交时间:2023/06/28
Scene text detection  Point supervision  Mixed-supervised learning