Score: 0

Fourier analysis of the physics of transfer learning for data-driven subgrid-scale models of ocean turbulence

Published: April 21, 2025 | arXiv ID: 2504.15487v1

By: Moein Darman , Pedram Hassanzadeh , Laure Zanna and more

Potential Business Impact:

Helps computer weather forecasts learn new tricks.

Business Areas:
Machine Learning Artificial Intelligence, Data and Analytics, Software

Transfer learning (TL) is a powerful tool for enhancing the performance of neural networks (NNs) in applications such as weather and climate prediction and turbulence modeling. TL enables models to generalize to out-of-distribution data with minimal training data from the new system. In this study, we employ a 9-layer convolutional NN to predict the subgrid forcing in a two-layer ocean quasi-geostrophic system and examine which metrics best describe its performance and generalizability to unseen dynamical regimes. Fourier analysis of the NN kernels reveals that they learn low-pass, Gabor, and high-pass filters, regardless of whether the training data are isotropic or anisotropic. By analyzing the activation spectra, we identify why NNs fail to generalize without TL and how TL can overcome these limitations: the learned weights and biases from one dataset underestimate the out-of-distribution sample spectra as they pass through the network, leading to an underestimation of output spectra. By re-training only one layer with data from the target system, this underestimation is corrected, enabling the NN to produce predictions that match the target spectra. These findings are broadly applicable to data-driven parameterization of dynamical systems.

Country of Origin
🇺🇸 United States

Page Count
22 pages

Category
Computer Science:
Machine Learning (CS)