JOURNAL ARTICLE

Automatically combining ranking heuristics for HTML documents

Abstract

Current search engines use several criteria or heuristics to rank HTML documents. HTML ranking heuristics need to be combined into a ranking function that given a text query returns a ranked list of HTML documents. The standard approach is to build a weighted average by manually estimating the importance of every heuristic and assigning a weight proportional to the estimated importance. In the current paper we apply an automatic method for combining HTML ranking heuristics. Using recall/precision evaluations we study the performance of the automatic method and using collections of HTML documents with different characteristics we show that the automatic method finds weights tailored to specific characteristics of each document collection

Keywords:
Heuristics Ranking (information retrieval) Computer science Information retrieval Heuristic Rank (graph theory) Precision and recall HTML element Data mining Learning to rank Artificial intelligence Web page World Wide Web Mathematics

Metrics

1
Cited By
0.00
FWCI (Field Weighted Citation Impact)
0
Refs
0.74
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems
Information Retrieval and Search Behavior
Physical Sciences →  Computer Science →  Information Systems
Data Management and Algorithms
Physical Sciences →  Computer Science →  Signal Processing

Related Documents

JOURNAL ARTICLE

Automatically Converting HTML Documents with Similar Pattern into XML Documents

O Geum-YongInjun Hwang

Journal:   The KIPS Transactions PartD Year: 2002 Vol: 9D (3)Pages: 355-364
BOOK-CHAPTER

Automatically Created Heuristics

Stefan EdelkampStefan Schrödl

Elsevier eBooks Year: 2011 Pages: 161-192
BOOK-CHAPTER

Creating HTML Documents

Adam Freeman

Apress eBooks Year: 2011 Pages: 117-150
BOOK-CHAPTER

Authoring HTML Documents

Bebo White

Electronic publishing series Year: 1996 Pages: 151-184
© 2026 ScienceGate Book Chapters — All rights reserved.