JOURNAL ARTICLE

Opening scholarly documents through text analytics

Abstract

Vast amounts of scholarly knowledge are buried in electronic theses and dissertations (ETDs). ETDs are valuable documents that have been developed at great cost but largely remain unknown and unused. We aim for digital libraries to open up these long documents using computerized text mining and analytics. We add value to the existing systems by providing chapter-level labels and summaries. This allows readers to easily find chapters of interest. We use ETDs to fine-tune language models like BERT and SciBERT, to help better capture the specialized vocabulary present in such documents.

Keywords:
Computer science Analytics Vocabulary Digital library Value (mathematics) World Wide Web Data science Information retrieval Linguistics

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
2
Refs
0.05
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Semantic Web and Ontologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Open access to scholarly full‐text documents

Péter Jacsó

Journal:   Online Information Review Year: 2006 Vol: 30 (5)Pages: 587-594
JOURNAL ARTICLE

Quotations in scholarly text: Converting existing documents to hypertext

Stephanie W. Haas

Journal:   Computers and the Humanities Year: 1994 Vol: 28 (3)Pages: 165-175
BOOK-CHAPTER

Scholarly Documents

Charles HaltonSaana Svärd

Cambridge University Press eBooks Year: 2017
JOURNAL ARTICLE

Efficient Exploration of Algorithms in Scholarly Documents Using Big Data Analytics

B. Tamil Bharathi

Journal:   International Journal Of Engineering And Computer Science Year: 2016
© 2026 ScienceGate Book Chapters — All rights reserved.