Efficient Transfer Learning for Neural Network Language Models

Jacek Skryzalin; Hamilton Link; Jeremy Wendt; Richard Field; Samuel Richter

doi:10.1109/asonam.2018.8508304

ScienceGate Book Chapters

JOURNAL ARTICLE

Efficient Transfer Learning for Neural Network Language Models

Jacek Skryzalin Hamilton Link Jeremy Wendt Richard Field Samuel Richter

Year: 2018 Pages: 897-902

DOI: 10.1109/asonam.2018.8508304

Get Full-Text PDF Get Analytical Report

Abstract

We apply transfer learning techniques to create topically and/or stylistically biased natural language models from small data samples, given generic long short-term memory (LSTM) language models trained on larger data sets. Although LSTM language models are powerful tools with wide-ranging applications, they require enormous amounts of data and time to train. Thus, we build general purpose language models that take advantage of large standing corpora and computational resources proactively, allowing us to build more specialized analytical tools from smaller data sets on demand. We show that it is possible to construct a language model from a small, focused corpus by first training an LSTM language model on a large corpus (e.g., the text from English Wikipedia) and then retraining only the internal transition model parameters on the smaller corpus. We also show that a single general language model can be reused through transfer learning to create many distinct special purpose language models quickly with modest amounts of data.

Keywords:

Computer science Language model Retraining Artificial intelligence Natural language processing Construct (python library) Transfer of learning Natural language Artificial neural network Recurrent neural network Data modeling Programming language Database

Metrics

Cited By

0.20

FWCI (Field Weighted Citation Impact)

Refs

0.60

Citation Normalized Percentile

Is in top 1%

Is in top 10%

Citation History

Topics

Topic Modeling

Physical Sciences → Computer Science → Artificial Intelligence

Natural Language Processing Techniques

Physical Sciences → Computer Science → Artificial Intelligence

Speech Recognition and Synthesis

Physical Sciences → Computer Science → Artificial Intelligence

Efficient Transfer Learning for Neural Network Language Models

Abstract

Metrics

Citation History

Topics

Related Documents

Efficient transfer learning with pretrained language models

Efficient Transfer Learning for Video-language Foundation Models

Efficient lattice rescoring using recurrent neural network language models

Investigating syntactic transfer in second language learning of neural language models

Multi-Language Neural Network Language Models