Error-Correcting Codes for Labeled DNA Sequences
By: Dganit Hanania, Eitan Yaakobi
Potential Business Impact:
Fixes mistakes when reading DNA labels.
Labeling of DNA molecules is a fundamental technique for DNA visualization and analysis. This process was mathematically modeled in [1], where the received sequence indicates the positions of the used labels. In this work, we develop error correcting codes for labeled DNA sequences, establishing bounds and constructing explicit systematic encoders for single substitution, insertion, and deletion errors. We focus on two cases: (1) using the complete set of length-two labels and (2) using the minimal set of length-two labels that ensures the recovery of DNA sequences from their labeling for 'almost' all DNA sequences.
Similar Papers
Constrained Error-Correcting Codes for Efficient DNA Synthesis
Information Theory
Makes storing information in DNA cheaper and more reliable.
LOCO Codes Can Correct as Well: Error-Correction Constrained Coding for DNA Data Storage
Information Theory
Stores more data in DNA, fixes errors.
Function-Correcting Codes for Insertion-Deletion Channel
Information Theory
Saves space when storing computer information.