JOURNAL ARTICLE

Semi-Supervised Collective Matrix Factorization for Topic Detection and Document Clustering

Abstract

Topic detection and tracking (TDT) under modern media circumstances has been dramatically innovated with the ever-changing social network and inconspicuous connections among participants in the internet communities. Apart from the inherent word features of analysing materials, such as news articles and personal or professional comments, incidental information attracts increasing attention from the research community. Meanwhile, numerous interrelations hiding in the propagated articles and network participants also promote the transfer and evolvement of topics, not only apparent connections, for example having the same tags and belonging to the same party, but also weak connections which are complicated and with little causal relations. Therefore, answering the question how to exploit and use this hidden information in the social network will extend the landscape of research on TDT. In this paper, we employ the followers' groups extracted from Twitter as the social context that accompanied the corresponding news articles and explore the interior links among them to develop a non-negative factorization methods with semi-supervised information derived from the original data. Furthermore, experiments are conducted on real and semi-synthetic data sets to test the performance of topic detection and documents clustering. The results demonstrate that the proposed method outperforms several state-of-the-art methods.

Keywords:
Computer science Exploit Cluster analysis Context (archaeology) Matrix decomposition Information retrieval Social media Data science The Internet Non-negative matrix factorization Social network (sociolinguistics) Artificial intelligence World Wide Web

Metrics

5
Cited By
0.57
FWCI (Field Weighted Citation Impact)
46
Refs
0.64
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Complex Network Analysis Techniques
Physical Sciences →  Physics and Astronomy →  Statistical and Nonlinear Physics
Opinion Dynamics and Social Influence
Physical Sciences →  Physics and Astronomy →  Statistical and Nonlinear Physics
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.