Multiple Regression and Discriminant Analysis to Predict
Mar-Apr-May-Jun 1999 Rainfall in Northeast Brazil
contributed by Larry Greischar and Stefan Hastenrath
University of Wisconsin, Madison, Wisconsin
In the approach used at the University of Wisconsin to forecast March-to-June precipitation in the Nordeste, several predictors are used with stepwise multiple linear regression, linear discriminant analysis, and neural networking. The predictand includes 27 selected stations in the Nordeste (Hastenrath and Greischar, 1993), shown in Fig. 1. The forecasts shown here are made at one month lead-i.e., using data no later than January 1999. The potential predictors of March-June Nordeste rainfall (not all of which are necessarily used in a given prediction model) are listed in Table 1 along with their correlations with the predictand over the training period, 1921-57. Correlation coefficients are in hundredths, with one or two asterisks indicating significance at the 5% and 1% levels, respectively.
As shown above and described in Hastenrath and Greischar (1993), March-to-June rainfall in the Nordeste is correlated positively with the first and negatively correlated with all the other predictors. This pre-season, predictor values have been: (1) near average pre-season rain, (2) negative (i.e. northerly wind anomaly), (3) negative Pacific SST, (4) negative tropical Atlantic SST index (i.e. cold/warm SST anomalies to the North/South of the Atlantic equator), and (5) positive tropical Atlantic SST index for Nov-Dec-Jan. The meridional gradient of Atlantic SST, meridional component of surface wind, and Pacific SST indicate a wetter than normal March-June 1999, while the Atlantic SST index for Nov-Dec-Jan points to dry conditions.
Table 2 shows skill evaluations for eight prediction models, each using a different combination of predictors listed above. The eighth model uses neural networking rather than stepwise multiple linear regression.
The models indicate above average March-June Nordeste precipitation for 1999. The 1912-56 historical average mean and standard deviation for the 27 stations are 500 and 200 mm, respectively.
Using the same sets of predictors, Nordeste rainfall was also predicted using linear discriminant analysis, in which five equiprobable categories of rainfall amount are defined and associated with predictor values using Bayes' theorem. In an individual forecast each of the categories is assigned a probability, given the pre-season values of the predictors. Table 11-2 in the March 1995 issue of this Bulletin shows, for predictor models 5 and 7, the five-by-five verification matrices obtained over the 1958-89 period using the earlier years to develop the models. The hit rates are 0.34 and 0.38, corresponding to Heidke skills of 0.18 and 0.22 and expected correlation skills of about 0.55 and 0.65. The quintile probability forecasts for Mar-Apr-May-Jun 1999 using each of these predictor models are listed in Table 3. Both modes 5 and 7 show a maximum likelihood of a wetter than average 1999.
In summary, the pre-season Nordeste rainfall, the meridional gradient of Atlantic SST, the meridional component of surface wind, and the Pacific SST point to above average March-June 1999 precipitation, while the Atlantic SST index for Nov-Dec-Jan favors drier conditions. Quantitatively, based on the various SMR, NN and LDA models, we predict above average precipitation for the 1999 rainy season (MJ index +0.1 to +0.4, comparable to the years 1961, 63, 69, 78, 87).
Acknowledgments: This prediction exercise by LG and SH at the University of Wisconsin relied on various real-time data sources. The pre-season rainfall data in the Nordeste were provided by the Staff of FUNCEME; David Cullum, U.K. Meteorological Office, Bracknell, dispatched November-December 1998 and January 1999 SST data of the tropical Atlantic; NMC wind data of the tropical Atlantic and SST data of the equatorial Pacific for January 1999 were obtained from NOAA-CDC, Boulder, Colorado; and OLR data was obtained from the NOAA-CPC, Camp Springs Maryland. All of these contributions were crucial to the timely issue of the forecast.
References
Hastenrath, S. and L. Greischar, 1993: Further work on the prediction of northeast Brazil rainfall anomalies. J. Climate, 6, 743-758.
| (1) Oct-Jan precipitation at the 27 predictand stations | +55** |
| (2) An index of Jan meridiondal surface wind component over the tropical Atlantic, 30oN-30oS. | -35* |
| (3) An index of Jan SST in the equatorial Pacific. | -11 |
| (4) An index of Jan SST in the tropical Atlantic, 30oN-30oS | -57** |
| (5) An index of Nov-Dec-Jan SST in the tropical Atlantic, 30oN-30oS. | -70** |
Table 2 Skill (% Variance Explained)
#/Type Used 1921-57 1958-89 1968-89 Mar-Apr-May-Jun
1999 Table 2. Skill of eight prediction models for Mar-Apr-May-Jun Nordeste rainfall (expressed as percentage of predictand variance explained),
followed by the forecast standardized Nordeste rainfall anomaly for Mar-Apr-May-Jun 1999. The model type is SMR (stepwise multiple
regression) or NN (neural network), and predictors numbers (1)-(5) are as shown above. Table 3: Probability for each of the five equiprobable categories.
Model
Predictors
Training Period
Forecast Period 1
Forecast Period 2
Rainfall Forecast
1 SMR
(1)
30
35
49
+0.16 2 SMR
(1),(2)
38
49
69
+0.24 3 SMR
(1),(4)
49
52
66
+0.29 4 SMR
(1),(2),(4)
44
58
74
+0.22 5 SMR
(1),(2),(3),(4)
50
61
74
+0.39 6 SMR
(1),(3),(5)
62
61
71
+0.07 7 SMR
(3),(5)
56
58
62
+0.05 8 NN
(1),(2),(3),(4)
55
66
81
+0.43
Model
Predictors
Q1
Q2
Q3
Q4
Q5 5
(1),(2),(3),(4)
.05
.18
.19
.42
.16 7
(3),(5)
.09
.24
.21
.38
.08