英文摘要 | In recent years, with the rapid growth of Internet industry and the wide spread of electronic products like intelligent mobile phone and digital camera, multimedia information based on photos and videos is becoming the main way for information transmission. Photos and videos contain a large number of natural scene images. Those images contain lots of texts, which take the high level semantic information, that help to understand the content greatly. Natural scene text recognition has great value in many applications. It could be applied in the fields like real time translation, aided navigation, traffic monitoring, disabled service, etc. Therefore, detection and recognition of natural scene text has become an urgent need for everyday life. However, texts in natural scene images, which were collected by mobile devices, have many problems like complex background, uneven illumination, various fonts, etc, thus making the detection and recognition very difficult. Nowadays, natural scene text detection and recognition is becoming the research focus in the field of computer vision. It has become an important area in pattern recognition application. More and more scholars have been attracted to the research of it. The research of scene text recognition has achieved great improvement till now. However, more efforts need to be made in order to bring this technique to people’s everyday life. A complete text recognition system includes text detection and text recognition: Text detection aims to locate the position of text block and to extract character area; Text recognition uses the character block binary or color image for classification. This thesis researches on natural scene text detection and recognition systematically: For text detection, this thesis focuses more on application, taking mobile devices as the target platform; For text recognition, more deep theoretical analysis has been made, and an innovative algorithm is given. The main content of this thesis is summarized as follows: Firstly, this thesis proposes a connect component based and multi-information combined scene text detection method. This method is designed to work for mobile device application. On a mobile device-intelligent mobile phone-for example, user easily marks the target area for text analysis. This simple interaction can greatly reduce the difficulty and increase the speed for detection. Next, edge detection is adopted to located the text block area. The detected text area i... |
修改评论