Semi-supervised Sequence Learning

Andrew M. Dai; Quoc V. Le

doi:10.48550/arxiv.1511.01432

ScienceGate Book Chapters

JOURNAL ARTICLE

Semi-supervised Sequence Learning

Andrew M. Dai Quoc V. Le

Year: 2015 Journal: arXiv (Cornell University) Vol: 28 Pages: 3079-3087 Publisher: Cornell University

DOI: 10.48550/arxiv.1511.01432

Get Full-Text PDF Get Analytical Report

Abstract

We present two approaches that use unlabeled data to improve sequence learning with recurrent networks. The first approach is to predict what comes next in a sequence, which is a conventional language model in natural language processing. The second approach is to use a sequence autoencoder, which reads the input sequence into a vector and predicts the input sequence again. These two algorithms can be used as a "pretraining" step for a later supervised sequence learning algorithm. In other words, the parameters obtained from the unsupervised step can be used as a starting point for other supervised training models. In our experiments, we find that long short term memory recurrent networks after being pretrained with the two approaches are more stable and generalize better. With pretraining, we are able to train long short term memory recurrent networks up to a few hundred timesteps, thereby achieving strong performance in many text classification tasks, such as IMDB, DBpedia and 20 Newsgroups.

Keywords:

Sequence (biology) Computer science Autoencoder Artificial intelligence Sequence learning Recurrent neural network Deep learning Pattern recognition (psychology) Sequence labeling Point (geometry) Machine learning Natural language processing Artificial neural network Mathematics Task (project management)

Metrics

487

Cited By

0.00

FWCI (Field Weighted Citation Impact)

Refs

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Text and Document Classification Technologies

Physical Sciences → Computer Science → Artificial Intelligence

Semi-supervised Sequence Learning

Abstract

Metrics

Citation History

Topics

Related Documents

Semi-supervised active learning for sequence labeling

Balancing signals for semi-supervised sequence learning

Self-supervised learning and semi-supervised learning for multi-sequence medical image classification

Semi-Supervised Learning for Classification of Protein Sequence Data

DeepHeart: Semi-Supervised Sequence Learning for Cardiovascular Risk Prediction