An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition
MingMing Yu(于明明); Zhang H(张恒); Fei Yin(殷飞); Cheng-Lin Liu(刘成林)
发表期刊Pattern Recognition
2024
页码110373
摘要

Text line recognition methods can be categorized into explicit segmentation based and implicit segmentation based ones. Explicit segmentation based methods require character-level annotation during training, while implicit segmentation based methods, trained on line-level annotated data, face alignment drift challenges. Though some methods have been proposed to address these challenges using weakly supervised object detection, they often rely on cumbersome pseudobox generation processes and complex decoding. In this paper, we propose a unified framework to overcome these challenges, achieving high accuracy in text recognition and character segmentation. To eliminate the need of character-level annotated real text line data in training, we introduce a novel training paradigm that utilizes character-level annotated synthetic data and line-level annotated real data jointly. For synthetic data, candidate characters are explicitly aligned with labeled characters to generate hard labels for supervising model training. For real data, implicit alignment is produced by Connectionist Temporal Classification (CTC) mapping to provide soft labels for weakly-supervised model training. And for inference, we propose two decoding strategies leveraging the advantages of Non-Maximum Suppression (NMS) and CTC decoding. Extensive experiments on benchmark datasets demonstrate the superior performance of our method in text recognition and character localization, even with minimal amounts of character-level annotated line data.

收录类别SCI
语种英语
七大方向——子方向分类图像视频处理与分析
国重实验室规划方向分类视觉信息处理
是否有论文关联数据集需要存交
文献类型期刊论文
条目标识符http://ir.ia.ac.cn/handle/173211/57526
专题多模态人工智能系统全国重点实验室_模式分析与学习
通讯作者Cheng-Lin Liu(刘成林)
推荐引用方式
GB/T 7714
MingMing Yu,Zhang H,Fei Yin,et al. An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition[J]. Pattern Recognition,2024:110373.
APA MingMing Yu,Zhang H,Fei Yin,&Cheng-Lin Liu.(2024).An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition.Pattern Recognition,110373.
MLA MingMing Yu,et al."An Approach for Handwritten Chinese Text Recognition Unifying Character Segmentation and Recognition".Pattern Recognition (2024):110373.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
HCTR_USR(1).pdf(5849KB)期刊论文作者接受稿开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[MingMing Yu(于明明)]的文章
[Zhang H(张恒)]的文章
[Fei Yin(殷飞)]的文章
百度学术
百度学术中相似的文章
[MingMing Yu(于明明)]的文章
[Zhang H(张恒)]的文章
[Fei Yin(殷飞)]的文章
必应学术
必应学术中相似的文章
[MingMing Yu(于明明)]的文章
[Zhang H(张恒)]的文章
[Fei Yin(殷飞)]的文章
相关权益政策
暂无数据
收藏/分享
文件名: HCTR_USR(1).pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。