JOURNAL ARTICLE

Multi-Candidate Word Segmentation using Bi-directional LSTM Neural Networks

Abstract

Most existing word segmentation methods output one single segmentation solution. This paper provides an analysis of word segmentation performance when more than one solutions are taken into account. Towards this investigation, a deep neural network with multiple thresholds is applied to generate multiple candidates for segmentation. As a test-bed, the well-known bidirectional long short-term memory (BiLSTM) units are used with eleven contexts in a deep neural network. As performance indices, three measures; recall, precision and f-measure, are plotted with respect to various thresholds for both boundary level and word level evaluation. By a number of experiments, the result shows that the multi-candidate word segmentation can help us increase the recalls while maintaining the precisions.

Keywords:
Segmentation Computer science Word (group theory) Artificial intelligence Artificial neural network Text segmentation Pattern recognition (psychology) Measure (data warehouse) Deep learning Precision and recall Term (time) Speech recognition Data mining Mathematics

Metrics

15
Cited By
1.79
FWCI (Field Weighted Citation Impact)
35
Refs
0.87
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

Natural Language Processing Techniques
Physical Sciences →  Computer Science →  Artificial Intelligence
Topic Modeling
Physical Sciences →  Computer Science →  Artificial Intelligence
Speech Recognition and Synthesis
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.