JOURNAL ARTICLE

Deep variational auto-encoder for text classification

Abstract

Dimensionality reduction is an important technique in machine learning and data mining, which makes the processing of high dimensional data faster. An efficient method for dimensionality reduction can find a low-dimension feature subset extracting the most relevant information. The dimensionality reduction methods based on neural network are applied to all kinds of data, especially computer vision data. In this paper, we focus on the text data with high sparse and high dimension, then reduce its dimension by using the variational auto-encoder. The performance of variational auto-encoder in dimensionality reduction is observed by comparison test. First, unstructured text data is converted to computer-processable vectors using term frequencyCinverse document frequency. Then variational auto-encoder is used to reduce the dimensionality. Finally, the experiment verifies the efficiency of variational auto-encoder by comparing seven commonly used dimensionality reduction methods.

Keywords:
Dimensionality reduction Autoencoder Curse of dimensionality Computer science Encoder Dimension (graph theory) Artificial intelligence Artificial neural network Pattern recognition (psychology) Intrinsic dimension Focus (optics) Reduction (mathematics) Clustering high-dimensional data Algorithm Data mining Mathematics Cluster analysis

Metrics

6
Cited By
0.21
FWCI (Field Weighted Citation Impact)
30
Refs
0.53
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Generative Adversarial Networks and Image Synthesis
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Retrieval and Classification Techniques
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition
Image Processing and 3D Reconstruction
Physical Sciences →  Computer Science →  Computer Vision and Pattern Recognition

Related Documents

© 2026 ScienceGate Book Chapters — All rights reserved.