Score: 0

Kernel Density Balancing

Published: June 14, 2025 | arXiv ID: 2506.12626v1

By: John Park , Ning Hao , Yue Selena Niu and more

Potential Business Impact:

Improves how we see inside cells.

Business Areas:
A/B Testing Data and Analytics

High-throughput chromatin conformation capture (Hi-C) data provide insights into the 3D structure of chromosomes, with normalization being a crucial pre-processing step. A common technique for normalization is matrix balancing, which rescales rows and columns of a Hi-C matrix to equalize their sums. Despite its popularity and convenience, matrix balancing lacks statistical justification. In this paper, we introduce a statistical model to analyze matrix balancing methods and propose a kernel-based estimator that leverages spatial structure. Under mild assumptions, we demonstrate that the kernel-based method is consistent, converges faster, and is more robust to data sparsity compared to existing approaches.

Country of Origin
🇺🇸 United States

Page Count
26 pages

Category
Statistics:
Applications