JOURNAL ARTICLE

DNA Barcoding through Quaternary LDPC Codes

Elizabeth TapiaFlavio E. SpetaleFlávia KrsticevicLaura AngelonePilar Bulacio

Year: 2015 Journal:   PLoS ONE Vol: 10 (10)Pages: e0140459-e0140459   Publisher: Public Library of Science

Abstract

For many parallel applications of Next-Generation Sequencing (NGS) technologies short barcodes able to accurately multiplex a large number of samples are demanded. To address these competitive requirements, the use of error-correcting codes is advised. Current barcoding systems are mostly built from short random error-correcting codes, a feature that strongly limits their multiplexing accuracy and experimental scalability. To overcome these problems on sequencing systems impaired by mismatch errors, the alternative use of binary BCH and pseudo-quaternary Hamming codes has been proposed. However, these codes either fail to provide a fine-scale with regard to size of barcodes (BCH) or have intrinsic poor error correcting abilities (Hamming). Here, the design of barcodes from shortened binary BCH codes and quaternary Low Density Parity Check (LDPC) codes is introduced. Simulation results show that although accurate barcoding systems of high multiplexing capacity can be obtained with any of these codes, using quaternary LDPC codes may be particularly advantageous due to the lower rates of read losses and undetected sample misidentification errors. Even at mismatch error rates of 10(-2) per base, 24-nt LDPC barcodes can be used to multiplex roughly 2000 samples with a sample misidentification error rate in the order of 10(-9) at the expense of a rate of read losses just in the order of 10(-6).

Keywords:
Low-density parity-check code BCH code Error detection and correction Hamming distance Computer science Hamming code Forward error correction Barcode Algorithm Word error rate Multiplex ligation-dependent probe amplification Turbo code Concatenated error correction code Multiplex Block code Decoding methods Biology Genetics Speech recognition

Metrics

4
Cited By
0.29
FWCI (Field Weighted Citation Impact)
79
Refs
0.63
Citation Normalized Percentile
Is in top 1%
Is in top 10%

Citation History

Topics

DNA and Biological Computing
Life Sciences →  Biochemistry, Genetics and Molecular Biology →  Molecular Biology
Error Correcting Code Techniques
Physical Sciences →  Computer Science →  Computer Networks and Communications
Algorithms and Data Compression
Physical Sciences →  Computer Science →  Artificial Intelligence
© 2026 ScienceGate Book Chapters — All rights reserved.