Knowledge Commons of Institute of Automation,CAS
Metric Rectification of Curved Document Images | |
Meng, Gaofeng1![]() ![]() ![]() | |
发表期刊 | IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
![]() |
2012-04-01 | |
卷号 | 34期号:4页码:707-722 |
文章类型 | Article |
摘要 | In this paper, we propose a metric rectification method to restore an image from a single camera-captured document image. The core idea is to construct an isometric image mesh by exploiting the geometry of page surface and camera. Our method uses a general cylindrical surface (GCS) to model the curved page shape. Under a few proper assumptions, the printed horizontal text lines are shown to be line convergent symmetric. This property is then used to constrain the estimation of various model parameters under perspective projection. We also introduce a paraperspective projection to approximate the nonlinear perspective projection. A set of close-form formulas is thus derived for the estimate of GCS directrix and document aspect ratio. Our method provides a straightforward framework for image metric rectification. It is insensitive to camera positions, viewing angles, and the shapes of document pages. To evaluate the proposed method, we implemented comprehensive experiments on both synthetic and real-captured images. The results demonstrate the efficiency of our method. We also carried out a comparative experiment on the public CBDAR2007 data set. The experimental results show that our method outperforms the state-of-the-art methods in terms of OCR accuracy and rectification errors. |
关键词 | Document Image Analysis Imaging Geometry Geometric Correction Shape-from-x Mesh Warping |
WOS标题词 | Science & Technology ; Technology |
关键词[WOS] | PERSPECTIVE IMAGES ; SHAPE ; RESTORATION |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000300581700007 |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://ir.ia.ac.cn/handle/173211/3713 |
专题 | 多模态人工智能系统全国重点实验室_先进时空数据分析与学习 |
作者单位 | 1.Chinese Acad Sci, Inst Automat, Natl Lab Pattern Recognit, Beijing 100190, Peoples R China 2.Xi An Jiao Tong Univ, Inst Artificial Intelligence & Robot, Xian 710049, Peoples R China |
第一作者单位 | 模式识别国家重点实验室 |
推荐引用方式 GB/T 7714 | Meng, Gaofeng,Pan, Chunhong,Xiang, Shiming,et al. Metric Rectification of Curved Document Images[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,2012,34(4):707-722. |
APA | Meng, Gaofeng,Pan, Chunhong,Xiang, Shiming,Duan, Jiangyong,&Zheng, Nanning.(2012).Metric Rectification of Curved Document Images.IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,34(4),707-722. |
MLA | Meng, Gaofeng,et al."Metric Rectification of Curved Document Images".IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 34.4(2012):707-722. |
条目包含的文件 | 下载所有文件 | |||||
文件名称/大小 | 文献类型 | 版本类型 | 开放类型 | 使用许可 | ||
TPAMI2012.pdf(5220KB) | 期刊论文 | 作者接受稿 | 开放获取 | CC BY-NC-SA | 浏览 下载 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论