Future Directions

Overfitting in GMM

The current way that we avoid overfitting is by taking a standard deviation cutoff so only some pixels are evaluated, and the peak separation and silhouette score plateauing checks were found by trial by error. There are other methods prescribed to avoid overfitting in a gaussian mixture model. One such resource is this article by J. Andrews et al 2018 which dicusses using a bootstrap sampling algorithm to find the optimal number of components for the fit.


Without changing the algorithm, Python 3.10 has implemented a match case algorithm which should speed up the overfitting checks as they stand now.

Error Propagation

There are other ways to extract the read noise from the mixture model like in Zhang et al 2019, which may warrant exploration if the current method is not found rigorous enough.