Combining a segmentation-like approach and a density-based approach in content extraction.

Density-based approaches in content extraction, whose task is to extract contents from Web pages, are commonly used to obtain page contents that are critical to many Web mining applications. However, traditional density-based approaches cannot effectively manage pages that contain short contents and...

Description complète

Détails bibliographiques
Publié dans:Tsinghua Science and Technology 17, 3 (June 2012).
Auteur principal: Lin, Shuang
Autres auteurs: Chen, Jie, Niu, Zhendong
Format: Article
Langue:English
Sujets: