Description of neural network model (version 3.5)
A neural network (NN) model has been used by our group to predict the
SST anomalies in the Nino3.4 region in the equatorial Pacific (Tang et
al. 2000). By version 3.1, the NN model has been extended to forecast
the SST anomalies over the whole tropical Pacific. In version 3.2, the
subsurface temperature anomalies in the tropical Pacific ocean were
included as predictors. In version 3.3, we have
removed the subsurface temperature anomalies as predictors, since their
record is relatively short (available only since 1980), and they are not
always available on time for our monthly forecasts. We have, however,
introduced a Bayesian NN in version 3.3 (see
Wu et al. 2006
for details).
The data used in version 3.3 came from two datasets: (a) the
monthly sea level pressure (SLP) on 2.5° × 2.5° grids
from the NCEP/NCAR reanalysis (Kalnay et al. 1996; downloadable from
ftp.cdc.noaa.gov/Datasets/ncep.reanalysis.derived/surface);
(b) the monthly extended reconstructed sea surface temperature on 2°
× 2° grids (ERSST version 2; Smith and Reynolds 2004).
By Jan. 2010, ERSST version 2 was no longer available. We therefore had
to retrain our Bayesian NN model (version 3.4) using ERSST version 3 (downloadable from http://www.ncdc.noaa.gov/oa/climate/research/sst/ersstv3.php; Smith et al. 2008).
Relative to version 3.3 (Wu et al. 2006), the most current version
(version 3.5) has several changes:
(1) The data used to train and
validate models has been extended (Jan. 1948- Dec. 2009).
(2) A more
rigorous cross-validation method has been implemented to verify forecast
skills -- namely, data from five consecutive calendar years were held
for validation/verification, while data from the remaining years were
used as training data. Only the 3rd year in the 5-year window was
forecasted by the trained model. The 5-year window was moved forward by
one year each time until independent forecast verification was done for
all years.
(3) Among the training data, only 2/3 of the data were
randomly selected (in one-year blocks) and used to train the NN
models. The remaining 1/3 data were used as testing data for model
selection. NNs with 1, 2 and 3 hidden neurons were trained separately
and validated on the testing data, and the one model with highest correlation
skill was selected. This procedure was repeated 100 times and the
ensemble average from these 100 models was used to give the final
forecast.
Cross-validation forecast correlation skills by Bayesian NN model (with
corresponding value from multiple linear regression given in parenthesis)
for the Nino4, Nino3.4, Nino3 and Nino1+2 regions and for lead times of
3, 6, 9, and 12 months (with definition of lead time same as in
Wu et al. 2006) are given in the table below:
| Nino4 | Nino3.4 | Nino3 | Nino1+2 |
3 month | 0.9109 (0.9100) | 0.9051 (0.9048) | 0.8734
(0.8754) | 0.7997 (0.8022) |
6 month | 0.7657 (0.7615) | 0.7387 (0.7366) | 0.6810
(0.6799) | 0.5524 (0.5457) |
9 month | 0.6320 (0.6283) | 0.5960 (0.5921) | 0.5364
(0.5322) | 0.3500 (0.3377) |
12 month | 0.4899 (0.4882) | 0.4737 (0.4703) | 0.4413
(0.4367) | 0.2583 (0.2486) |
References:
Kalnay, E., et al., 1996.
The NCEP/NCAR 40 year reanalysis project. Bull. Amer. Meteorol. Soc.,
77, 437-471.
Smith, T.M., and R.W. Reynolds, 2004. Improved extended reconstruction
of SST [1854-1997]. Journal of Climate, 17, 2466-2477.
Smith, T.M., R.W. Reynolds, Thomas C. Peterson, and Jay Lawrimore, 2008.
Improvements to NOAA's historical merged land-ocean surface temperature
analysis (1880-2006). Journal of Climate, 21, 2283-2296.
Tang, B., W.W. Hsieh, A.H. Monahan and F.T. Tangang, 2000. Skill
comparisons between neural networks and canonical correlation analysis
in predicting the equatorial Pacific sea surface
temperatures. J.Climate, 13: 287-293.
Wu, A., W.W. Hsieh and B. Tang, 2006. Neural network forecasts of the
tropical Pacific sea surface temperatures. Neural Networks. 19: 145-154.
doi:10.1016/j.neunet.2006.01.004.
(preprint
in PDF).