JOURNAL ARTICLE

IEEE ICHI Healthcare Data Analytics Challenge

Abstract

There are several publicly accessible patient forums where patients can post questions related to their health conditions. The objective of this study was to develop a query-retrieval system that can mine such forums and identify existing questions most similar to the provided question. This pilot study based on a bag-of-words model with latent semantic analysis and cosine similarity suggests that text similarity-based mining holds promise for identification of diabetes-related questions from patient forums and informing self-care management. Further studies involving advance natural language processing tools can be used to reduce false positives and uncover semantically related questions.

Keywords:
Cosine similarity Computer science Latent semantic analysis Similarity (geometry) Identification (biology) Analytics Health care Information retrieval Data science Topic model Semantics (computer science) False positive paradox Artificial intelligence Cluster analysis

Metrics

0
Cited By
0.00
FWCI (Field Weighted Citation Impact)
6
Refs
0.09
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Sentiment Analysis and Opinion Mining
Physical Sciences →  Computer Science →  Artificial Intelligence
Biomedical Text Mining and Ontologies
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology

Related Documents

JOURNAL ARTICLE

Healthcare data analytics challenge

Year: 2016 Pages: xli-xli
JOURNAL ARTICLE

Healthcare Data Analytics Challenge

Zhiguo YuByron WallaceTodd R. Johnson

Year: 2015 Vol: 26 Pages: 515-516
JOURNAL ARTICLE

Healthcare Data Analytics

Ivana Ognjanović

Journal:   Studies in health technology and informatics Year: 2020 Vol: 274 Pages: 122-135
© 2026 ScienceGate Book Chapters — All rights reserved.