JOURNAL ARTICLE

Document structure-driven investigative information retrieval

Tuomas KetolaThomas Roelleke

Year: 2023 Journal:   Information Systems Vol: 121 Pages: 102315-102315   Publisher: Elsevier BV

Abstract

Data-driven investigations are increasingly dealing with non-moderated, non-standard and even manipulated information Whether the field in question is journalism, law enforcement, or insurance fraud it is becoming more and more difficult for investigators to verify the outcomes of various black-box systems To contribute to this need of discovery methods that can be used for verification, we introduce a methodology for document structure-driven investigative information retrieval (InvIR) InvIR is defined as a subtask of exploratory IR, where transparency and reasoning take centre stage The aim of InvIR is to facilitate the verification and discovery of facts from data and the communication of those facts to others From a technical perspective, the methodology applies recent work from structured document retrieval (SDR) concerned with formal retrieval constraints and information content-based field weighting (ICFW) Using ICFW, the paper establishes the concept of relevance structures to describe the document structure-based relevance of documents These contexts are then used to help the user navigate during their discovery process and to rank entities of interest The proposed methodology is evaluated using a prototype search system called Relevance Structure-based Entity Ranker (RSER) in order to demonstrate its the feasibility This methodology represents an interesting and important research direction in a world where transparency is becoming more vital than ever.

Keywords:
Computer science Relevance (law) Information retrieval Transparency (behavior) Field (mathematics) Exploratory search Document retrieval Rank (graph theory) Weighting Data science

Metrics

1
Cited By
0.26
FWCI (Field Weighted Citation Impact)
32
Refs
0.58
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Web Data Mining and Analysis
Physical Sciences →  Computer Science →  Information Systems
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

BOOK-CHAPTER

Document Information Retrieval

Stefan KlinkKoichi KiseAndreas DengelMarkus JunkerStefan Agne

Advances in pattern recognition Year: 2007 Pages: 351-378
JOURNAL ARTICLE

Intelligent Information Retrieval: Handling Variability in Document Structure

Aarushi GuptaAkhil ChawlaK S ShushruthaMohana

Journal:   2022 3rd International Conference on Smart Electronics and Communication (ICOSEC) Year: 2022 Vol: 2 Pages: 1635-1640
JOURNAL ARTICLE

AI-driven document management systems: revolutionizing information retrieval and workflow automation

Nagaraj Bhadurgatte Revan Siddappa

Journal:   World Journal of Advanced Research and Reviews Year: 2021 Vol: 12 (2)Pages: 680-691
© 2026 ScienceGate Book Chapters — All rights reserved.