Combining a segmentation-like approach and a density-based approach in content extraction.
Density-based approaches in content extraction, whose task is to extract contents from Web pages, are commonly used to obtain page contents that are critical to many Web mining applications. However, traditional density-based approaches cannot effectively manage pages that contain short contents and...
| Published in: | Tsinghua Science and Technology 17, 3 (June 2012). |
|---|---|
| Main Author: | |
| Other Authors: | , |
| Format: | Article |
| Language: | English |
| Subjects: |