JOURNAL ARTICLE

Data mining for hypertext

Soumen Chakrabarti

Year: 2000 Journal:   ACM SIGKDD Explorations Newsletter Vol: 1 (2)Pages: 1-11   Publisher: Association for Computing Machinery

Abstract

With over 800 million pages covering most areas of human endeavor, the World-wide Web is a fertile ground for data mining research to make a difference to the effectiveness of information search. Today, Web surfers access the Web through two dominant interfaces: clicking on hyperlinks and searching via keyword queries. This process is often tentative and unsatisfactory. Better support is needed for expressing one's information need and dealing with a search result in more structured ways than available now. Data mining and machine learning have significant roles to play towards this end.In this paper we will survey recent advances in learning and mining problems related to hypertext in general and the Web in particular. We will review the continuum of supervised to semi-supervised to unsupervised learning problems, highlight the specific challenges which distinguish data mining in the hypertext domain from data mining in the context of data warehouses, and summarize the key areas of recent and ongoing research.

Keywords:
Computer science Hyperlink Hypertext Web mining World Wide Web Context (archaeology) Data science Web intelligence Information retrieval Process (computing) Key (lock) Web page Web modeling

Metrics

288
Cited By
47.05
FWCI (Field Weighted Citation Impact)
75
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems
Advanced Database Systems and Queries
Physical Sciences →  Computer Science →  Computer Networks and Communications
Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems

Related Documents

JOURNAL ARTICLE

Hypertext databases and data mining

Soumen Chakrabarti

Year: 1999 Pages: 508-508
JOURNAL ARTICLE

Hypertext databases and data mining

Soumen Chakrabarti

Journal:   ACM SIGMOD Record Year: 1999 Vol: 28 (2)Pages: 508-508
JOURNAL ARTICLE

HYPERTEXT AND HYPERMEDIA IN DATA MINING

Safa NakadeT BedardJ MerrettHanS ChakrabartiC CheungA HwangJ FuHan

Journal:   International Research Journal of Modernization in Engineering Technology and Science Year: 2023
JOURNAL ARTICLE

Hypertext data mining (tutorial AM-1)

Soumen Chakrabarti

Year: 2000 Pages: 1-32
© 2026 ScienceGate Book Chapters — All rights reserved.