Learning word hierarchical representations with neural networks for document modeling

Longhui Wang; Yong Wang; Yudong Xie

doi:10.1080/0952813x.2019.1653382

ScienceGate Book Chapters

JOURNAL ARTICLE

Learning word hierarchical representations with neural networks for document modeling

Longhui Wang Yong Wang Yudong Xie

Year: 2019 Journal: Journal of Experimental & Theoretical Artificial Intelligence Vol: 32 (3)Pages: 515-531 Publisher: Taylor & Francis

DOI: 10.1080/0952813x.2019.1653382

Get Full-Text PDF Get Analytical Report

Abstract

Word embedding models treat words with equal status, which leads to the neglect of hierarchical semantic relationships between words (e.g., ‘green’ – ‘color’ and ‘cat’ – ‘mammal’). To build a hierarchical structure of words from raw text data, we propose an unsupervised model to learn word hierarchical representations (WHR), which are extended from word representations. Globally, WHRs can describe a word with several other words representing the basic attributes. The WHR model is an extended continuous bag-of-words (CBOW) neural language model with perceptual grouping and attention mechanisms. We further use WHRs to generate document representations, that are more expressive than some widely used document models, such as latent topic and deep learning models. Experimental results demonstrated that our model outperforms state-of-the-art baselines in terms of document retrieval, document classification, and sentiment analysis.

Keywords:

Computer science Artificial intelligence Natural language processing Word (group theory) Hierarchical database model Word embedding Embedding Linguistics Data mining

Metrics

Cited By

0.15

FWCI (Field Weighted Citation Impact)

Refs

0.56

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Advanced Text Analysis Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Learning word hierarchical representations with neural networks for document modeling

Abstract

Metrics

Citation History

Topics

Related Documents

Learning Word Representations with Deep Neural Networks for Turkish

Modeling Word Learning and Processing with Recurrent Neural Networks

Learning attribute representations with deep convolutional neural networks for word spotting

Learning Text Document Representations by One-Class Glial Neural Networks

Learning Word Vectors Efficiently Using Shared Representations and Document Representations