Constructed Analogue Prediction of the East Central

Tropical Pacific SST for 1999 into 2000



contributed by Huug van den Dool



Climate Prediction Center, NOAA, Camp Springs, Maryland



Because natural analogues are highly unlikely to occur in high degree-of-freedom processes, we may benefit from constructing an analogue having greater similarity than the best natural analogue. As described in Van den Dool (1994), the construction is a linear combination of past observed anomaly patterns in the predictor fields such that the combination is as close as desired to the base. Here, we forecast the future SST anomaly in the Niño 3.4 region (5N-5S, 120-170W) of the tropical Pacific. We use as our predictor (the analogue selection criterion) the first five EOFs of the global SST field at four consecutive 3-month periods prior to forecast time. Predictor and predictand data extending from 1955 to the present are used for a priori skill evaluation.

For a given base time (previous ones extending back to 1956, or the current real forecast ending with DJF 1998/99), a linear combination is made of the first five EOFs of global SST from all 42 years (excluding the base year), so as to match the SST pattern of the base time. This is done using multiple regression, with each year's SST state as a predictor to which a weight is assigned, determined by inverting the 42 X 42 (available years) covariance matrix. These weights are then applied to the subsequently occurring Niño 3.4 SST in the predictand period for these years past, forming the forecast for the base year's predictand period. Note that the predictand is not involved in the construction process. The constructed analogue is the same linear combination for all leads, i.e the weights are persisted, and can be applied to predictands other than Nino3.4.

Additional detail about the constructed analogue method (Van den Dool 1994) shows that constructed analogues usually outperform natural analogues (such as they are) in specification mode (i.e. "forecasting" one meteorological variable from another, contemporaneously). This advantage may also be expected to occur in real forecasting, as long as the (linear) construction does not compromise the physics of the system too much. A constructed analogue yields a single linear operator derived from data by which the system can be propagated forward in time. This is methodologically related to POP and linear inverse modeling. The skill of the constructed analogue method in forecasting SST is discussed in Van den Dool and Barnston (1995).

The current constructed analogue forecasts for Niño 3.4 out to 1.5 years lead are shown in Fig. 1, using data through Feb 1999. The expected cross-validated skill is also shown (dashed;right-hand scale). The SST anomaly observed during DJF 1998/99 is plotted as the earliest "forecast" value. For the early leads JFM and FMA the observed SST for DJF enters into the plotted forecast with a 2/3 and 1/3 weight, respectively, providing continuity with the known initial condition (DJF).

A closer look at the skill of the constructed analogue method is provided by Fig. 2 in the June 1996 issue of this Bulletin (p. 73). The skill is competitive with those of other empirical as well as dynamical methods (Barnston et al. 1994). An evaluation over 1996-98 (Barnston et al. 1999) shows CA, CCA and CLIPER to be the clear frontrunners among the empirical methods and continuing to be competitive with dynamical methods NCEP and COLA models. Forecasts for late fall through winter tend to be most skillful, while summer forecasts have lower skill. While skill (dashed line in Fig. 1) generally decreases with lead time, the dependence on the target season is sometimes a stronger factor.

The current strong cold La Niña is forecast to demise and finish by AMJ99, but no transition to positive Nino3.4 anomalies is forecast for any time in 1999. Values of close to -1.0C persist from JJA99 thru winter 99/00, and one could debate whether next winter is another cold event. At this point of the annual cycle skill is at its lowest.

Although the forecast is for negative Nino3.4 it is too simplistic to say that the forecast is for the opposite of a warm event. Inspection of the climate state in terms of SST-EOFs shows that winter 97/98 was extreme in EOF#1, while winter 98/99 appeared to have peaked in EOF#2, 3 and 4. Put another way: Nino3.4 is a compromise index to describe warm events with SST anomalies from the dataline to the S. American coast and cold events that have SST anomalies both east and west of the dateline. Perhaps next winter, if a cold event, will be different still, and using a single index is too little.



Table 1 provides information about the role of each of the past years in the construction process for the current forecasts. The inner product (IP) shows the degree of similarity (or, if negative, similarity to opposite) of this year's predictor periods to those of the other years on the global domain. On the other hand, the weights (Wt) shows the contribution of each year's pattern to the constructed analogue. The inner products and the weights, while similar, are not proportional, because co-linearity among years is accounted for. This is because, for example, two past years having the same kind of similarity are unnecessary; only one of them may have been assigned the appropriately high weight, leaving the other with little to contribute. The weights have changed some but not dramatically from 3 months ago, see December issue, as they should if CA is to be skillful in making forecasts. Indeed, CA has been above average in accuracy on both the onset and maturing of the current cold event.

The most important positive (+) and negative (-) contributors to the description of the global SST over the last four seasons (MAM98 to DJF98/9; denoted as 1999) are, in chronological order, 1957(-), 1959(+),1963(+),1966(--), 1968(--), 1969(-), 1981(-), 1984(+), 1987(-), 1988(++), 1989(++) and 1996(+). An interdecadal variability in this analogue weights time series (e.g. negatives before 1980, positives in 1980s and 1990s) is suggested in the weights and more clearly in the inner products . The current global SST has a positive correlation with the global SST fields in all years since 1980 (except 87). Although ENSO clearly dominates the interdecadal variability at this time, the trends are still potent. The cluster 1966, 1968 (denoting the MAM67-DJF67/8 period), and 1969 are very heavily negatively weighted. While the ENSO situation definitely enters into the analogue selection (more strongly so at the moment than generally), non-ENSO (remember, global SST EOFs are used) processes also determine the weighting process and the resulting forecast as well. The weights for 1984 , 1988and 1989 are so high and positive that, at present, to some good approximation the CA system can be reduced to a natural analogue system.

All anomalies refer to 1961-90 base period.

Barnston, A.G., H.M. van den Dool, S.E. Zebiak, T.P. Barnett, M. Ji, D.R. Rodenhuis, M.A. Cane, A. Leetmaa, N.E. Graham, C.F. Ropelewski, V.E. Kousky, E.A. O'Lenic and R.E. Livezey, 1994: Long-lead seasonal forecasts-Where do we stand? Bull. Amer. Meteor. Soc., 75, 2097-2114.

Barnston, A. G., M. H. Glantz and Yuxiang He, 1999: Predictive skill of statistical and dynamical climate models in SST forecasts during the 1997/98 El Niño episode and the 1998 La Niña onset. Bull. Amer. Meteor. Soc., 80, 217-243.

van den Dool, H.M., 1994: Searching for analogues, how long must we wait? Tellus, 46A, 314-324.

van den Dool, H.M. and A.G. Barnston, 1995: Forecasts of global sea surface temperature out to a year using the constructed analogue method. Proceed-ings of the 19th Annual Climate Diagnostics Workshop, Nov. 14-18, 1994, College Park, Maryland, 416-419.

Table 1. Inner products (IP; scaled such that sum of absolute values is 100) and weights (Wt; from multiple regression) of each of the years to construct an analogue to the sequence of 4 consecutive 3-month periods defined as the base (currently the string MAM98, JJA98, SON98 and DJF98/99). Years are labeled by the middle month of the last of the four consecutive predictor seasons. 1998 is not yet used as a candidate analogue because long lead forecasts are not possible beyond the latest observations.
Year IP Wt Year IP Wt Year IP Wt Year IP Wt
56 -1 0 67 -3 -9 78 -6 -12 89 5 37
57 -3 -15 68 -5 -26 79 -2 10 90 3 10
58 -2 1 69 -5 -19 80 0 -14 91 3 4
59 0 16 70 3 11 81 2 -18 92 1 4
60 -3 -1 71 2 14 82 4 7 93 0 -2
61 0 14 72 -2 11 83 0 -8 94 1 -14
62 0 0 73 -2 -3 84 3 29 95 0 -10
63 2 18 74 2 6 85 2 -8 96 4 18
64 -1 -8 75 -4 -6 86 0 -14 97 3 5
65 -3 -5 76 -1 -3 87 -1 -18
66 -6 -23 77 -4 -9 88 3 38



Fig. 1. Time series of constructed analogue forecasts (solid line) for Niño 3.4 SST based on the sequence of four consecutive 3-month periods ending in Feb 1998. The dashed line indicates the expected skill (correlation) based on historical performance for 1956-96. The x-axis represents the target period. The left y-axis (solid line) shows the SST forecast; the right y-axis (thin dashed line) shows the skill. The observation is shown instead of the constructed analogue specification for the initial state DJF 1998/99, and this observation also contributes by decreasing amounts to the JFM and FMA99 plotted values (see text).