TextEdge: Multi-oriented Scene Text Detection via Region Segmentation and Edge Classification
Du C(杜臣); Wang CH(王春恒)
2019
会议名称2019 International Conference on Document Analysis and Recognition (ICDAR)
会议日期2019-9
会议地点Sydney, NSW, Australia
摘要

The semantic-segmentation-based scene text detection algorithms always use the bounding-box regions or their shrinks to represent the text pixels. However, the nontext pixel information in these regions easily results in the poor performance of text detection, because these semantic segmentation methods need accurate pixel-level annotated training data to achieve approving performance and they are sensitive to noise and interference. In this work, we propose a fully convolutional network (FCN) based method termed TextEdge for multi-oriented scene text detection. Compared with previous methods simply using bounding-box regions as a segmentation mask, TextEdge introduces the text-region edge map as a new segmentation mask. Edge information is more representative for text areas and is proved to be effective in improving detection performance. TextEdge is optimized in an end-to-end way with multi-task outputs: text and nontext classification, text-edge prediction and the text boundaries regression. Experiments on standard datasets demonstrate that the proposed method achieves state-of-the-art performance in both accuracy and efficiency. Specifically, it achieves an F-score of 0.88 on ICDAR 2013 dataset and 0.86 on ICDAR 2015 dataset.

关键词scene text detection semantic segmentation text edge information multi-task learn
DOI10.1109/ICDAR.2019.00067
收录类别EI
语种英语
七大方向——子方向分类文字识别与文档分析
引用统计
文献类型会议论文
条目标识符http://ir.ia.ac.cn/handle/173211/46633
专题复杂系统管理与控制国家重点实验室_影像分析与机器视觉
通讯作者Wang CH(王春恒)
作者单位中国科学院自动化研究所
第一作者单位中国科学院自动化研究所
通讯作者单位中国科学院自动化研究所
推荐引用方式
GB/T 7714
Du C,Wang CH. TextEdge: Multi-oriented Scene Text Detection via Region Segmentation and Edge Classification[C],2019.
条目包含的文件 下载所有文件
文件名称/大小 文献类型 版本类型 开放类型 使用许可
TextEdge_Multi-orien(761KB)会议论文 开放获取CC BY-NC-SA浏览 下载
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Du C(杜臣)]的文章
[Wang CH(王春恒)]的文章
百度学术
百度学术中相似的文章
[Du C(杜臣)]的文章
[Wang CH(王春恒)]的文章
必应学术
必应学术中相似的文章
[Du C(杜臣)]的文章
[Wang CH(王春恒)]的文章
相关权益政策
暂无数据
收藏/分享
文件名: TextEdge_Multi-oriented_Scene_Text_Detection_via_Region_Segmentation_and_Edge_Classification.pdf
格式: Adobe PDF
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。