JOURNAL ARTICLE

Learning word hierarchical representations with neural networks for document modeling

Longhui WangYong WangYudong Xie

Year: 2019 Journal:   Journal of Experimental & Theoretical Artificial Intelligence Vol: 32 (3)Pages: 515-531   Publisher: Taylor & Francis

Abstract

Word embedding models treat words with equal status, which leads to the neglect of hierarchical semantic relationships between words (e.g., ‘green’ – ‘color’ and ‘cat’ – ‘mammal’). To build a hierarchical structure of words from raw text data, we propose an unsupervised model to learn word hierarchical representations (WHR), which are extended from word representations. Globally, WHRs can describe a word with several other words representing the basic attributes. The WHR model is an extended continuous bag-of-words (CBOW) neural language model with perceptual grouping and attention mechanisms. We further use WHRs to generate document representations, that are more expressive than some widely used document models, such as latent topic and deep learning models. Experimental results demonstrated that our model outperforms state-of-the-art baselines in terms of document retrieval, document classification, and sentiment analysis.

Keywords:
Computer science Artificial intelligence Natural language processing Word (group theory) Hierarchical database model Word embedding Embedding Linguistics Data mining

Metrics

1
Cited By
0.15
FWCI (Field Weighted Citation Impact)
40
Refs
0.56
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Modeling Word Learning and Processing with Recurrent Neural Networks

Claudia Marzi

Journal:   Information Year: 2020 Vol: 11 (6)Pages: 320-320
JOURNAL ARTICLE

Learning attribute representations with deep convolutional neural networks for word spotting

Sudholt, Sebastian

Journal:   Technische Universität Dortmund Eldorado (Technische Universität Dortmund) Year: 2018
JOURNAL ARTICLE

Learning Word Vectors Efficiently Using Shared Representations and Document Representations

Qun LuoWeiran Xu

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2015 Vol: 29 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.