JOURNAL ARTICLE

Improving Word Representations with Document Labels

Liner YangXinxiong ChenZhiyuan LiuMaosong Sun

Year: 2017 Journal:   IEEE/ACM Transactions on Audio Speech and Language Processing Vol: 25 (4)Pages: 863-870   Publisher: Institute of Electrical and Electronics Engineers

Abstract

Word representations, aiming to build vectors for each word, have been successfully used in a variety of applications. Most word representations are learned from large amounts of documents ignoring other information. This is rather suboptimal because the side information of the documents, such as document labels, is not used in learning word representations. In this paper, we focus on how to exploit these side information to improve word representations. We propose to incorporate document labels into the learning process of word representations in two frameworks: neural network and matrix factorization. The experimental results on word analogy and word similarity task show that our models can better capture the semantic and syntactic information than the original models. Our models also improve the performance of word representations on text classification task.

Keywords:
Computer science Word (group theory) Natural language processing Artificial intelligence Variety (cybernetics) Analogy Task (project management) Similarity (geometry) Focus (optics) Exploit Process (computing) Information retrieval Linguistics

Metrics

13
Cited By
1.38
FWCI (Field Weighted Citation Impact)
61
Refs
0.83
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence

Related Documents

JOURNAL ARTICLE

Generating word and document matrix representations for document classification

Shun GuoNianmin Yao

Journal:   Neural Computing and Applications Year: 2019 Vol: 32 (14)Pages: 10087-10108
JOURNAL ARTICLE

Improving Topic Models with Latent Feature Word Representations

Dat Quoc NguyenRichard BillingsleyLan DuMark Johnson

Journal:   Transactions of the Association for Computational Linguistics Year: 2015 Vol: 3 Pages: 299-313
JOURNAL ARTICLE

Learning Word Vectors Efficiently Using Shared Representations and Document Representations

Qun LuoWeiran Xu

Journal:   Proceedings of the AAAI Conference on Artificial Intelligence Year: 2015 Vol: 29 (1)
© 2026 ScienceGate Book Chapters — All rights reserved.