CASIA OpenIR  > 毕业生  > 硕士学位论文
Thesis Advisor杜清秀
Degree Grantor中国科学院大学
Place of Conferral北京
Keyword文档提取 文档图像预处理 文字识别 Android
Other AbstractCharacter recognition technology has been one of the most important research areas in the field of pattern recognition. With the development of the mobile internet, the number of mobiles and users is increasing faster and faster. But the technology has not been widely used in the mobile due to the bad hardware performance. Whether the document image processing, or text recognition algorithm may not be able to get satisfactory results. So, the document character recognition technology still has a lot of problems worthy of study.
This thesis analyzes the status of current character recognition technique in the world. And the key technologies in mobile character recognition process are researched. Finally, this paper realizes a text document recognition system on Android platform. It can extract text from photographs taken by mobile, and convert them into the format that electronic devices can use.
The main contributions of this thesis are summarized as follows:
1.      Research and improve the document image preprocessing technology
Image preprocessing has a direct impact on the recognition performance. So this article firstly improves the document capture algorithm on mobile and puts forward to an algorithm for distortion picture taken by mobile. Secondly, document images may have complex background in form of non-uniform illumination or may be corrupted by noise. this thesis proposes an improved algorithm based on Curvelet transform for image denoising and image binarization. What’s more, this thesis put forward to an algorithm for character segmentation mixed Chinese and English.
2.      Research the character recognition technology
This paper takes the traditional machine learning methods,This method based on the traditional character recognition model, including the segmentation of characters, image normalization, feature extraction, dimension reduction, classifier training. In addition, in order to identify many types of printing fronts, this thesis also presents a fast algorithm for the generation of printed samples.
3.      Develop the character recognition App On the Android Platform
This thesis develops the OCR App based on Android platform. The App integrates document preprocessing technology, character recognition technology and realizes the function of extracting and recogniting characters in the document images taken by mobile phone. Experiments show that the App can provide relevant services for users, and achieve satisfactory results.
Document Type学位论文
Recommended Citation
GB/T 7714
全远航. 基于Android平台文档文字识别技术的研究与实现[D]. 北京. 中国科学院大学,2016.
Files in This Item:
File Name/Size DocType Version Access License
2013E8014661099全远航.p(3744KB)学位论文 暂不开放CC BY-NC-SAApplication Full Text
Related Services
Recommend this item
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[全远航]'s Articles
Baidu academic
Similar articles in Baidu academic
[全远航]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[全远航]'s Articles
Terms of Use
No data!
Social Bookmark/Share
All comments (0)
No comment.

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.