Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational data

Published in Journal of Geophysical Research: Machine Learning and Computation, 2024

Zhang, H., J. Finkel, D. S. Abbot, E. P. Gerber, and J. Weare, 2024: Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational data, J. Geophys. Res.: Machine Learning Comp., in press.

Official Version to appear

Key points

  • Given sufficient training data, convolutional neural networks can predict the maintenance of Atlantic blocking from an initial blocked state.
  • Transfer learning from an idealized model to reanalysis data enables predictive skill in the low data regime of the observational record.
  • Feature importance analysis reveals the influence of upstream flow on blocking persistence and quantifies biases in the idealized model.

Blocking events are an important cause of extreme weather, especially long-lasting blocking events that trap weather systems in place. The duration of blocking events is, however, underestimated in climate models. Explainable Artificial Intelligence are a class of data analysis methods that can help identify physical causes of prolonged blocking events and diagnose model deficiencies. We demonstrate this approach on an idealized quasigeostrophic model developed by Marshall and Molteni (1993). We train a convolutional neural network (CNN), and subsequently, build a sparse predictive model for the persistence of Atlantic blocking, conditioned on an initial high-pressure anomaly. Shapley Additive ExPlanation (SHAP) analysis reveals that high-pressure anomalies in the American Southeast and North Atlantic, separated by a trough over Atlantic Canada, contribute significantly to prediction of sustained blocking events in the Atlantic region. This agrees with previous work that identified precursors in the same regions via wave train analysis. When we apply the same CNN to blockings in the ERA5 atmospheric reanalysis, there is insufficient data to accurately predict persistent blocks. We partially overcome this limitation by pre-training the CNN on the plentiful data of the Marshall-Molteni model, and then using Transfer Learning to achieve better predictions than direct training. SHAP analysis before and after transfer learning allows a comparison between the predictive features in the reanalysis and the quasigeostrophic model, quantifying dynamical biases in the idealized model. This work demonstrates the potential for machine learning methods to extract meaningful precursors of extreme weather events and achieve better prediction using limited observational data.