CASIA OpenIR  > 09年以前成果
Recognition of pornographic web pages by classifying texts and images
Hu, Weiming; Wu, Ou; Chen, Zhouyao; Fu, Zhouyu; Maybank, Steve; Ou Wu
Source PublicationIEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE
2007-06-01
Volume29Issue:6Pages:1019-1034
SubtypeArticle
AbstractWith the rapid development of the World Wide Web, people benefit more and more from the sharing of information. However, Web pages with obscene, harmful, or illegal content can be easily accessed. It is important to recognize such unsuitable, offensive, or pornographic Web pages. In this paper, a novel framework for recognizing pornographic Web pages is described. A C4.5 decision tree is used to divide Web pages, according to content representations, into continuous text pages, discrete text pages, and image pages. These three categories of Web pages are handled, respectively, by a continuous text classifier, a discrete text classifier, and an algorithm that fuses the results from the image classifier and the discrete text classifier. In the continuous text classifier, statistical and semantic features are used to recognize pornographic texts. In the discrete text classifier, the naive Bayes rule is used to calculate the probability that a discrete text is pornographic. In the image classifier, the object's contour-based features are extracted to recognize pornographic images. In the text and image fusion algorithm, the Bayes theory is used to combine the recognition results from images and texts. Experimental results demonstrate that the continuous text classifier outperforms the traditional keyword-statistics-based classifier, the contour-based image classifier outperforms the traditional skin-region-based image classifier, the results obtained by our fusion algorithm outperform those by either of the individual classifiers, and our framework can be adapted to different categories of Web pages.
KeywordWeb Pages Pornographic Texts Pornographic Images Data Fusion Recognition
WOS HeadingsScience & Technology ; Technology
WOS KeywordENGINE
Indexed BySCI
Language英语
WOS Research AreaComputer Science ; Engineering
WOS SubjectComputer Science, Artificial Intelligence ; Engineering, Electrical & Electronic
WOS IDWOS:000245600800008
Citation statistics
Cited Times:84[WOS]   [WOS Record]     [Related Records in WOS]
Document Type期刊论文
Identifierhttp://ir.ia.ac.cn/handle/173211/9484
Collection09年以前成果
Corresponding AuthorOu Wu
Affiliation1.Chinese Acad Sci, Inst Automat, NLPR, Beijing 100080, Peoples R China
2.Univ London Birkbeck Coll, Sch Comp Sci & Informat Syst, London WC1E 7HT, England
Recommended Citation
GB/T 7714
Hu, Weiming,Wu, Ou,Chen, Zhouyao,et al. Recognition of pornographic web pages by classifying texts and images[J]. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,2007,29(6):1019-1034.
APA Hu, Weiming,Wu, Ou,Chen, Zhouyao,Fu, Zhouyu,Maybank, Steve,&Ou Wu.(2007).Recognition of pornographic web pages by classifying texts and images.IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE,29(6),1019-1034.
MLA Hu, Weiming,et al."Recognition of pornographic web pages by classifying texts and images".IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 29.6(2007):1019-1034.
Files in This Item: Download All
File Name/Size DocType Version Access License
tpami1.pdf(1559KB)期刊论文作者接受稿开放获取CC BY-NC-SAView Download
Related Services
Recommend this item
Bookmark
Usage statistics
Export to Endnote
Google Scholar
Similar articles in Google Scholar
[Hu, Weiming]'s Articles
[Wu, Ou]'s Articles
[Chen, Zhouyao]'s Articles
Baidu academic
Similar articles in Baidu academic
[Hu, Weiming]'s Articles
[Wu, Ou]'s Articles
[Chen, Zhouyao]'s Articles
Bing Scholar
Similar articles in Bing Scholar
[Hu, Weiming]'s Articles
[Wu, Ou]'s Articles
[Chen, Zhouyao]'s Articles
Terms of Use
No data!
Social Bookmark/Share
File name: tpami1.pdf
Format: Adobe PDF
All comments (0)
No comment.
 

Items in the repository are protected by copyright, with all rights reserved, unless otherwise indicated.