Fusing Multi- and Hyperspectral Satellite Data for Harmful Algal Bloom Monitoring with Self-Supervised and Hierarchical Deep Learning
By: Nicholas LaHaye, Kelly M. Luis, Michelle M. Gierach
Potential Business Impact:
Spots harmful ocean blooms from space.
We present a self-supervised machine learning framework for detecting and mapping harmful algal bloom (HAB) severity and speciation using multi-sensor satellite data. By fusing reflectance data from operational instruments (VIIRS, MODIS, Sentinel-3, PACE) with TROPOMI solar-induced fluorescence (SIF), our framework, called SIT-FUSE, generates HAB severity and speciation products without requiring per-instrument labeled datasets. The framework employs self-supervised representation learning, hierarchical deep clustering to segment phytoplankton concentrations and speciations into interpretable classes, validated against in-situ data from the Gulf of Mexico and Southern California (2018-2025). Results show strong agreement with total phytoplankton, Karenia brevis, Alexandrium spp., and Pseudo-nitzschia spp. measurements. This work advances scalable HAB monitoring in label-scarce environments while enabling exploratory analysis via hierarchical embeddings: a critical step toward operationalizing self-supervised learning for global aquatic biogeochemistry.
Similar Papers
AI-driven multi-source data fusion for algal bloom severity classification in small inland water bodies: Leveraging Sentinel-2, DEM, and NOAA climate data
Machine Learning (CS)
Spots harmful algae blooms in water using satellites.
Seg the HAB: Language-Guided Geospatial Algae Bloom Reasoning and Segmentation
Artificial Intelligence
Spots and measures harmful algae blooms automatically.
Synthetic Data Augmentation for Enhancing Harmful Algal Bloom Detection with Machine Learning
Machine Learning (CS)
Find harmful algae blooms faster with fake data.