On the Number of Subsequences in the Nonbinary Deletion Channel
By: Han Li, Xiang Wang, Fang-Wei Fu
Potential Business Impact:
Counts how many ways words change after removing letters.
In the deletion channel, an important problem is to determine the number of subsequences derived from a string $U$ of length $n$ when subjected to $t$ deletions. It is well-known that the number of subsequences in the setting exhibits a strong dependence on the number of runs in the string $U$, where a run is defined as a maximal substring of identical characters. In this paper we study the number of subsequences of a non-binary string in this scenario, and propose some improved bounds on the number of subsequences of $r$-run non-binary strings. Specifically, we characterize a family of $r$-run non-binary strings with the maximum number of subsequences under any $t$ deletions, and show that this number can be computed in polynomial time.
Similar Papers
Sequence Reconstruction over the Deletion Channel
Information Theory
Recovers lost computer code even with missing parts.
Sequence Reconstruction for the Single-Deletion Single-Substitution Channel
Information Theory
Finds shortest codes to fix errors in messages.
Multiset Deletion-Correcting Codes: Bounds and Constructions
Information Theory
Fixes errors when information is lost.