Classifying web pages by content
Classifying web pages by content
- Author(s): D. Smith ; R. Harvey ; Yi Chan ; J.A. Bangham
- DOI: 10.1049/ic:19990619
For access to this article, please select a purchase option:
Buy conference paper PDF
Buy Knowledge Pack
IET members benefit from discounts to all IET publications and free access to E&T Magazine. If you are an IET member, log in to your account and the discounts will automatically be applied.
IEE European Workshop. Distributed Imaging — Recommend this title to your library
Thank you
Your recommendation has been sent to your librarian.
- Author(s): D. Smith ; R. Harvey ; Yi Chan ; J.A. Bangham Source: IEE European Workshop. Distributed Imaging, 1999 page ()
- Conference: IEE European Workshop. Distributed Imaging
This paper describes a classification strategy for multimedia documents and reviews the prospects for detecting and filtering documents, such as Web pages, that may be pornographic. We examine several colour filtering algorithms with a view to producing a reliable skin filter. The results, very simple features extracted from an image-only database containing around two-thousand hand-labelled images, are surprisingly good. When the image results are combined with a simple text analysis scheme we are able to achieve a very accurate classification. (7 pages)
Inspec keywords: multimedia communication; multimedia databases; information resources; visual databases; image classification; image segmentation
Subjects: Spatial and pictorial databases; Multimedia communications; Information networks; Multimedia databases; Computer vision and image processing techniques; Optical, image and video signal processing
Related content
content/conferences/10.1049/ic_19990619
pub_keyword,iet_inspecKeyword,pub_concept
6
6