Language Independent Content Extraction From Web Pages

Lieferzeit: Lieferbar innerhalb 14 Tagen

39,90 

ISBN: 6137328481
ISBN 13: 9786137328484
Autor: Chandramma, R/RaviTeja, Ravindranath R C
Verlag: LAP LAMBERT Academic Publishing
Umfang: 52 S.
Erscheinungsdatum: 30.01.2019
Auflage: 1/2019
Format: 0.4 x 22 x 15
Gewicht: 96 g
Produktform: Kartoniert
Einband: Kartoniert
Artikelnummer: 6308133 Kategorie:

Beschreibung

The rapid development of the internet and web publishing techniques create numerous information sources published as HTML pages on World Wide Web. However, there is lot of redundant and irrelevant information also on web pages. Navigation panels, Table of content (TOC), advertisements, copyright statements, service catalogs, privacy policies etc. on web pages are considered as relevant and irrelevant content. Such information makes various web mining tasks such as web page crawling, web page classification, link based ranking, topic distillation complex.

Autorenporträt

R Chandramma is working as Associate professor in VKIT BangaloreRavindranath R C is working as Assistant professor in VKIT Bangalore

Herstellerkennzeichnung:


BoD - Books on Demand
In de Tarpen 42
22848 Norderstedt
DE

E-Mail: info@bod.de

Das könnte Ihnen auch gefallen …