Expected Length of the Longest Common Subsequence of Multiple Strings
By: Ray Li, William Ren, Yiran Wen
Potential Business Impact:
Finds patterns in random text faster.
We study the generalized Chv\'atal-Sankoff constant $\gamma_{k,d}$, which represents the normalized expected length of the longest common subsequence (LCS) of $d$ independent uniformly random strings over an alphabet of size $k$. We derive asymptotically tight bounds for $\gamma_{2,d}$, establishing that $\gamma_{2,d} = \frac{1}{2} + \Theta\left(\frac{1}{\sqrt{d}}\right)$. We also derive asymptotically near-optimal bounds on $\gamma_{k,d}$ for $d\ge \Omega(\log k)$.
Similar Papers
Online computation of normalized substring complexity
Data Structures and Algorithms
Finds patterns in text super fast.
Linear-space LCS enumeration with quadratic-time delay for two strings
Data Structures and Algorithms
Finds shared patterns in long texts faster.
The Complexity of Maximal Common Subsequence Enumeration
Data Structures and Algorithms
Finds important patterns in data faster.