CASIA OpenIR

浏览/检索结果: 共21条,第1-10条 帮助

限定条件                    
已选(0)清除 条数/页:   排序方式:
Diff-Writer: A Diffusion Model-Based Stylized Online Handwritten Chinese Character Generator 会议论文
, 湖南省 长沙市, 2023-11
作者:  Ren MS(任敏思);  Zhang YM(张燕明);  Wang QF(王秋锋);  Yin F(殷飞);  Liu CL(刘成林)
Adobe PDF(64745Kb)  |  收藏  |  浏览/下载:1/1  |  提交时间:2024/05/31
Generative model  
Towards Prior Gap and Representation Gap for Long-tailed Recognition, Pattern Recognition 期刊论文
Pattern Recognition, 2023, 卷号: 133, 期号: 109012, 页码: 109012
作者:  Zhang Ming-Liang;  Zhang Xu-Yao;  Wang Chang;  Liu Cheng-Lin
Adobe PDF(2258Kb)  |  收藏  |  浏览/下载:78/15  |  提交时间:2024/04/03
Long-tailed learning  Prior gap  Representation gap  Image recognition  
A Multi-Modal Neural Geometric Solver with Textual Clauses Parsed from Diagram 会议论文
, 中国 澳门, 2023-7-19
作者:  Zhang Ming-Liang;  Yin Fei;  Liu Cheng-Lin
Adobe PDF(1110Kb)  |  收藏  |  浏览/下载:66/15  |  提交时间:2024/04/03
VQAPT: A New visual question answering model for personality traits in social media images 期刊论文
PATTERN RECOGNITION LETTERS, 2023, 卷号: 175, 页码: 66-73
作者:  Biswas, Kunal;  Shivakumara, Palaiahnakote;  Pal, Umapada;  Liu, Cheng-Lin;  Lu, Yue
收藏  |  浏览/下载:44/0  |  提交时间:2024/02/22
Personality trait images  Multimodal concept  Text recognition  Social media images  Natural language processing  Visual question answering  
A New Lightweight Script Independent Scene Text Style Transfer Network 期刊论文
INTERNATIONAL JOURNAL OF PATTERN RECOGNITION AND ARTIFICIAL INTELLIGENCE, 2023, 页码: 29
作者:  Shivakumara, Palaiahnakote;  Roy, Ayush;  Nandanwar, Lokesh;  Pal, Umapada;  Lu, Yue;  Liu, Cheng-Lin
收藏  |  浏览/下载:34/0  |  提交时间:2024/02/22
Text detection  style transfer  CNN models  multi-lingual transfer  
Cross-Lingual Text Image Recognition via Multi-Hierarchy Cross-Modal Mimic 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 4830-4841
作者:  Chen, Zhuo;  Yin, Fei;  Yang, Qing;  Liu, Cheng-Lin
收藏  |  浏览/下载:30/0  |  提交时间:2024/02/22
Cross-lingual text image recognition  cross-modal mimic  multihierarchy mimic  
SignParser: An End-to-End Framework for Traffic Sign Understanding 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 页码: 17
作者:  Guo, Yunfei;  Feng, Wei;  Yin, Fei;  Liu, Cheng-Lin
收藏  |  浏览/下载:85/0  |  提交时间:2023/12/21
Traffic sign understanding  Content reasoning  Semantic description generation  
Deep representation learning for domain generalization with information bottleneck principle 期刊论文
PATTERN RECOGNITION, 2023, 卷号: 143, 页码: 12
作者:  Zhang, Jiao;  Zhang, Xu-Yao;  Wang, Chuang;  Liu, Cheng-Lin
收藏  |  浏览/下载:110/0  |  提交时间:2023/11/17
Domain generalization  Information bottleneck  Representation learning  
A Two-Level Rectification Attention Network for Scene Text Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 2404-2414
作者:  Wu, Lintai;  Xu, Yong;  Hou, Junhui;  Chen, C. L. Philip;  Liu, Cheng-Lin
收藏  |  浏览/下载:54/0  |  提交时间:2023/11/17
Scene text recognition  text rectification  spatial transformer network  optical character recognition  
A New Language-Independent Deep CNN for Scene Text Detection and Style Transfer in Social Media Images 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3552-3566
作者:  Shivakumara, Palaiahnakote;  Banerjee, Ayan;  Pal, Umapada;  Nandanwar, Lokesh;  Lu, Tong;  Liu, Cheng-Lin
收藏  |  浏览/下载:32/0  |  提交时间:2023/11/17
Text detection  style transfer  deep learning  EfficientNet  social media images