New Rule-Based Approach for Classifying HTML Documents

Lieferzeit: Lieferbar innerhalb 14 Tagen

49,90 

ISBN: 6202063203
ISBN 13: 9786202063203
Autor: Flaih, Laith
Verlag: LAP LAMBERT Academic Publishing
Umfang: 100 S.
Erscheinungsdatum: 21.11.2017
Auflage: 1/2017
Format: 0.6 x 22 x 15
Gewicht: 167 g
Produktform: Kartoniert
Einband: KT
Artikelnummer: 3179314 Kategorie:

Beschreibung

The uncontrolled type of nature of web content presents additional challenges to web page classification as compared to the traditional text classification, but the interconnected nature of hypertext lead to s time-consuming and labor intensive for a human to read over and correctly categorize an article manually, web page categorization/classification is one of the essential techniques for web mining. Web page classification aims to determine whether a web page belongs to a Category or categories, without it the web content becomes a mixed noisy data source that wastes time and effort of web users trying access interesting information. This book proposed new method for HTML document classification based on rule-based technique; it was designed used PHP, Apache and MySQL software tools. The method works by analyzing the submitted URL address which is input for proposed system to be scanned. The result of URL classification could be successfully used by the user for accessing relevant web document based on their queries. The method is designed to provide the friendly user interface easy to use. It enables the users of all different levels to use it.

Autorenporträt

Head of Computer Science Department - Cihan University, Head of Twana Private Institute for Computer Science, General Secretary of Cihan University Council, more than 19 scientific papers 13 Postgraduate students, Reviewer for more than 8 International Journals, Representative of the Cihan University - Erbil in Association of Arab Universities.

Das könnte Ihnen auch gefallen …