JOURNAL ARTICLE

Unsupervised feature selection for linked social media data

Abstract

The prevalent use of social media produces mountains of unlabeled, high-dimensional data. Feature selection has been shown effective in dealing with high-dimensional data for efficient data mining. Feature selection for unlabeled data remains a challenging task due to the absence of label information by which the feature relevance can be assessed. The unique characteristics of social media data further complicate the already challenging problem of unsupervised feature selection, (e.g., part of social media data is linked, which makes invalid the independent and identically distributed assumption), bringing about new challenges to traditional unsupervised feature selection algorithms. In this paper, we study the differences between social media data and traditional attribute-value data, investigate if the relations revealed in linked data can be used to help select relevant features, and propose a novel unsupervised feature selection framework, LUFS, for linked social media data. We perform experiments with real-world social media datasets to evaluate the effectiveness of the proposed framework and probe the working of its key components.

Keywords:
Feature selection Computer science Social media Feature (linguistics) Selection (genetic algorithm) Artificial intelligence Data mining Relevance (law) Machine learning Pattern recognition (psychology) World Wide Web

Metrics

197
Cited By
31.94
FWCI (Field Weighted Citation Impact)
40
Refs
1.00
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Data Mining Algorithms and Applications
Physical Sciences →  Computer Science →  Information Systems
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Complex Network Analysis Techniques
Physical Sciences →  Physics and Astronomy →  Statistical and Nonlinear Physics

Related Documents

JOURNAL ARTICLE

An Unsupervised Feature Selection Framework for Social Media Data

Jiliang TangHuan Liu

Journal:   IEEE Transactions on Knowledge and Data Engineering Year: 2014 Vol: 26 (12)Pages: 2914-2927
JOURNAL ARTICLE

Unsupervised feature selection in linked biological data

Elham HoseiniEghbal G. Mansoori

Journal:   Pattern Analysis and Applications Year: 2018 Vol: 22 (3)Pages: 999-1013
© 2026 ScienceGate Book Chapters — All rights reserved.