A Semiparametric Gaussian Mixture Model with Spatial Dependence and Its Application to Whole-Slide Image Clustering Analysis
By: Baichen Yu, Jin Liu, Hansheng Wang
Potential Business Impact:
Finds cancer in pictures by grouping similar spots.
We develop here a semiparametric Gaussian mixture model (SGMM) for unsupervised learning with valuable spatial information taken into consideration. Specifically, we assume for each instance a random location. Then, conditional on this random location, we assume for the feature vector a standard Gaussian mixture model (GMM). The proposed SGMM allows the mixing probability to be nonparametrically related to the spatial location. Compared with a classical GMM, SGMM is considerably more flexible and allows the instances from the same class to be spatially clustered. To estimate the SGMM, novel EM algorithms are developed and rigorous asymptotic theories are established. Extensive numerical simulations are conducted to demonstrate our finite sample performance. For a real application, we apply our SGMM method to the CAMELYON16 dataset of whole-slide images (WSIs) for breast cancer detection. The SGMM method demonstrates outstanding clustering performance.
Similar Papers
SiamMM: A Mixture Model Perspective on Deep Unsupervised Learning
Machine Learning (CS)
Finds hidden patterns in data, improving learning.
A Scalable Variational Bayes Approach for Fitting Non-Conjugate Spatial Generalized Linear Mixed Models via Basis Expansions
Methodology
Lets computers quickly learn from big, messy data.
Scalable Bayesian inference for high-dimensional mixed-type multivariate spatial data
Methodology
Models different kinds of data together in places.