The Random Variables of the DNA Coverage Depth Problem
By: Şeyma Bodur , Stefano Lia , Hiram H. López and more
Potential Business Impact:
Stores more computer information in tiny DNA.
DNA data storage systems encode digital data into DNA strands, enabling dense and durable storage. Efficient data retrieval depends on coverage depth, a key performance metric. We study the random access coverage depth problem and focus on minimizing the expected number of reads needed to recover information strands encoded via a linear code. We compute the asymptotic performance of a recently proposed code construction, establishing and refining a conjecture in the field by giving two independent proofs. We also analyze a geometric code construction based on balanced quasi-arcs and optimize its parameters. Finally, we investigate the full distribution of the random variables that arise in the coverage depth problem, of which the traditionally studied expectation is just the first moment. This allows us to distinguish between code constructions that, at first glance, may appear to behave identically.
Similar Papers
The Coverage Depth Problem in DNA Storage Over Small Alphabets
Information Theory
Stores more computer data in tiny DNA.
Random Access in DNA Storage: Algorithms, Constructions, and Bounds
Information Theory
Stores more computer data in tiny DNA strands.
Optimizing Sequencing Coverage Depth in DNA Storage: Insights From DNA Storage Data
Information Theory
Stores more computer data in tiny DNA.