Score: 0

Expected Length of the Longest Common Subsequence of Multiple Strings

Published: April 14, 2025 | arXiv ID: 2504.10425v1

By: Ray Li, William Ren, Yiran Wen

Potential Business Impact:

Finds patterns in random text faster.

Business Areas:

A/B Testing Data and Analytics

We study the generalized Chv\'atal-Sankoff constant $\gamma_{k,d}$, which represents the normalized expected length of the longest common subsequence (LCS) of $d$ independent uniformly random strings over an alphabet of size $k$. We derive asymptotically tight bounds for $\gamma_{2,d}$, establishing that $\gamma_{2,d} = \frac{1}{2} + \Theta\left(\frac{1}{\sqrt{d}}\right)$. We also derive asymptotically near-optimal bounds on $\gamma_{k,d}$ for $d\ge \Omega(\log k)$.