JOURNAL ARTICLE

Weakly Supervised Domain Detection

Yumo XuMirella Lapata

Year: 2019 Journal:   Transactions of the Association for Computational Linguistics Vol: 7 Pages: 581-596   Publisher: Association for Computational Linguistics

Abstract

In this paper we introduce domain detection as a new natural language processing task. We argue that the ability to detect textual segments that are domain-heavy (i.e., sentences or phrases that are representative of and provide evidence for a given domain) could enhance the robustness and portability of various text classification applications. We propose an encoder-detector framework for domain detection and bootstrap classifiers with multiple instance learning. The model is hierarchically organized and suited to multilabel classification. We demonstrate that despite learning with minimal supervision, our model can be applied to text spans of different granularities, languages, and genres. We also showcase the potential of domain detection for text summarization.

Keywords:
Computer science Automatic summarization Software portability Artificial intelligence Domain (mathematical analysis) Robustness (evolution) Natural language processing Encoder Task (project management) Machine learning Programming language

Metrics

6
Cited By
0.46
FWCI (Field Weighted Citation Impact)
76
Refs
0.71
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Text and Document Classification Technologies
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Advanced Text Analysis Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.