A clustering algorithm for the single cell analysis of mixtures
By: Robert G. Cowell
Potential Business Impact:
Groups DNA from crime scenes to find suspects.
A probabilistic clustering algorithm is proposed for the analysis of forensic DNA mixtures in which individual cells are isolated and short tandem repeats are amplified using the polymerase chain reaction to generate single cell electropherograms. The task of the algorithm is to use the peak height information in the electropherograms to group the cells according to their contributors. Using a recently developed experimental set of individual cell electropherograms, a large set of simulations shows that the proposed clustering algorithm has excellent performance in correctly grouping single cells, and for assigning likelihood ratios for persons of interest (of known genotype).
Similar Papers
Robust fuzzy clustering with cellwise outliers
Methodology
Finds hidden groups and weird data points.
Random Matrix Theory-guided sparse PCA for single-cell RNA-seq data
Machine Learning (CS)
Cleans up messy cell data for better science.
Bayesian Forensic DNA Mixture Deconvolution Using a Novel String Similarity Measure
Methodology
Finds a suspect's DNA in mixed samples.