Application of Deep Learning in Biological Data Compression
By: Chunyu Zou
Cryogenic electron microscopy (Cryo-EM) has become an essential tool for capturing high-resolution biological structures. Despite its advantage in visualizations, the large storage size of Cryo-EM data file poses significant challenges for researchers and educators. This paper investigates the application of deep learning, specifically implicit neural representation (INR), to compress Cryo-EM biological data. The proposed approach first extracts the binary map of each file according to the density threshold. The density map is highly repetitive, ehich can be effectively compressed by GZIP. The neural network then trains to encode spatial density information, allowing the storage of network parameters and learnable latent vectors. To improve reconstruction accuracy, I further incorporate the positional encoding to enhance spatial representation and a weighted Mean Squared Error (MSE) loss function to balance density distribution variations. Using this approach, my aim is to provide a practical and efficient biological data compression solution that can be used for educational and research purpose, while maintaining a reasonable compression ratio and reconstruction quality from file to file.
Similar Papers
Compressive Modeling and Visualization of Multivariate Scientific Data using Implicit Neural Representation
Machine Learning (CS)
Shrinks big science data, keeping all details.
Cryo-em images are intrinsically low dimensional
Quantitative Methods
Finds hidden shapes in tiny cell parts.
Cryo-EM as a Stochastic Inverse Problem
Machine Learning (Stat)
Shows how tiny parts of bodies move.