We propose a nonparametric approach for probabilistic prediction of the AL index trained with AL and solar wind ($v B_z$) data. Our framework relies on the diffusion forecasting technique, which views AL and $ v B_z $ data as observables of an autonomous, ergodic, stochastic dynamical system operating on a manifold. Diffusion forecasting builds a data-driven representation of the Markov semigroup governing the evolution of probability measures of the dynamical system. In particular, the Markov semigroup operator is represented in an orthonormal basis acquired from data using the diffusion maps algorithm and Takens delay embeddings. This representation of the evolution semigroup is used in conjunction with a Bayesian filtering algorithm for forecast initialization to predict the probability that the AL index is less than a user-selected threshold over arbitrary lead times and without requiring exogenous inputs. We find that the model produces skillful forecasts out to at least two-hour leads despite gaps in the training data.
This document includes the Supplementary Information