JOURNAL ARTICLE

Multi-label Associative Classification of Medical Documents from MEDLINE

Abstract

Ability to provide convenient access to scientific documents becomes a difficult problem due to large and constantly increasing number of incoming documents and extensive manual work associated with their storage, description and classification. This requires intelligent search and classification capabilities for users to find required information. It is especially true for repositories of scientific medical articles due to their extensive use, large size and number of new documents, and well maintained structure. This research aims to provide an automated method for classification of articles into the structure of medical document repositories, which would support currently performed extensive manual work. The proposed method classifies articles from the largest medical repository, MEDLINE, using state of the art data mining technology. The method is based on a novel associative classification technique which considers recurrent items and most importantly multi-label characteristic of the MEDLINE data. Based on large scale experiments that utilize 350,000 documents several different classification algorithms have been compared including both recurrent and non-recurrent associative classification. The algorithms are capable of assigning each medical document to several classes (multi-label classification) and are characterized by relatively high accuracy. We also investigate different measures of classification quality and point out pros and cons of each. Based on experimental result we show that recurrent item based associative classification demonstrates superior performance and propose three alternative setups that allow the user to obtain different desired classification qualities.

Keywords:
Computer science Associative property Information retrieval Document classification Point (geometry) Multi-label classification Data mining Artificial intelligence Machine learning

Metrics

28
Cited By
2.75
FWCI (Field Weighted Citation Impact)
28
Refs
0.91
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Biomedical Text Mining and Ontologies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems

Related Documents

BOOK-CHAPTER

Multi-Label Associative Classification

Adriano VelosoWagner Meira

SpringerBriefs in computer science Year: 2011 Pages: 53-59
BOOK-CHAPTER

Multi-label Lazy Associative Classification

Adriano VelosoWagner MeiraMarcos André GonçalvesMohammed J. Zaki

Lecture notes in computer science Year: 2007 Pages: 605-612
JOURNAL ARTICLE

Study on Multi-Label Classification of Medical Dispute Documents

Baili ZhangZhou ShanLe YangJianhua LvMingjun Zhong

Journal:   Computers, materials & continua/Computers, materials & continua (Print) Year: 2020 Vol: 65 (3)Pages: 1975-1986
JOURNAL ARTICLE

Multi-Label Rules Algorithm Based Associative Classification

Neda AbdelhamidAladdin AyeshWael Hadi

Journal:   Parallel Processing Letters Year: 2014 Vol: 24 (01)Pages: 1450001-1450001
JOURNAL ARTICLE

Associative Classification in Multi-label Classification: an Investigative Study

Raed AlazaidahMohammed Amin AlmaiahMo’ath Alluwaici

Journal:   Jordanian Journal of Computers and Information Technology Year: 2021 Pages: 1-1
© 2026 ScienceGate Book Chapters — All rights reserved.