2000 West African June to September Rainfall :

Experimental Statistical Forecasts Based On April

Values of Regional Predictors

contributed by Nathalie Philippon et Bernard Fontaine

Centre de Recherche de Climatologie, 21000 Dijon - FRANCE

Forecasts of July-September 2000 West African rainfall have been performed at the Centre de Recherche de Climatologie (CNRS, Dijon, France), using only the information available by the end of April, through three complementary statistical techniques : the Multiple Linear Regression (MLR) and the Linear Discriminant Analysis (LDA) as in Folland et al. (1991), and neural networks (Hagan et al, 1996).

Data:

The predictands (June-September cumulated rainfall) refer to a large West African region (17.5N-5N; 17.5W-17.5E) using the 2.5 latitude., 3.75 longitude grid box rainfall database developed by Hulme and Jones (1993). They consist on one hand, of a regional Sahel index computed over the domain bounded by the latitudes 17.5N-10N and longitudes 15W-15E, and, on the other hand, in 41 local indexes covering all the West African region mentioned above. The Sahelian region suffering from a significant reduction in total rainfall since the end of the 1960's, the statistical models have only been trained over the years 1968-1998. For convenience we have expressed our forecasts as percentages of the 1961-1990 mean.

In order to be able to propose a >useful= forecast (and not the best one), the predictors document only the near-surface information available on the web by the end of April, 2 months before the beginning of the season to predict. April data have been arranged into 2 pools : the first set includes 27 potential tropical (30N-30S) SST indexes using the GISST6 observed database analysed on a 5x5 degrees grid (Bottomley et al, 1990); the second group is composed of 50 potential regional (15W-15E; 25N-5S) atmospheric indexes (near-surface humidity, moist static energy, and geopotential values) for better documenting the land/sea surface conditions in the West African/eastern Atlantic sector just at the beginning of the northward excursion of the rainbelt into the continent. The atmospheric data has been extracted from the NCEP/NCAR reanalysis (Kalnay et al, 1996) and the selected indexes describe the atmospheric field in terms of local values and of zonal and meridional gradients. This approach was retained as recent works of the authors (Fontaine et al, 1999; Fontaine et Philippon, 2000) underline the existence of significant signals in the NCEP/NCAR air mass energy content in northern spring : the hindcast skills are significantly improved when selected predictors of regional scale describing the monsoon air mass energy are included.

Methods:

Concerning the MLR model, the predictand value R is given in the regression equation

where bn are the regression coefficients and Xn the predictors. Thanks to a stepwise procedure maximizing the explained variance, we limited the number of predictors entering the MLR model to four. Besides, the discriminant model and the neural networks were fitted using the same four MLR selected predictors to assess the robustness of the forecasts. The LDA model is based on three rainfall categories Qi (Dry, Normal and Wet) each of which has a prior probability pi of 0.33. Using Bayes= probability theorem, the LDA computes the posterior probability of observing a category Qi when a predictor vector x (here composed of 4 variables x) is given. These probabilities are noted:

where fi(x) is the probability density of x when Qi is known. The denominator being the same for all the i groups, the model has to find pifi(x) maximal. As regards the neural method, we use a single hidden layer feed-forward backpropagation network. The activation functions for the two hidden and the single output units are respectively log-sigmod w/ï and linear. The weights initial value was randomly chosen whereas the biases were set to one. The three models have been trained using the jack-knife procedure which is particularly useful for small data sets and allows assess to the real skill of a forecast model before it is actually applied.

Results:

1- The regional Sahelian index:

Over the period 1968-1998, the correlation coefficient obtained between the observed and predicted rainfall rises 0.84 (table 1) with the MLR model (RMSE of 0.53). Among the four retained predictors, 3 indexes document the continental surface state: AHUM@ designates the specific humidity content at 17.5N, over the longitudes 5E-15E; AMSE1" and AMSE2" refer to Moist Static Energy gradients along the 10W meridian, between the Guinean shore (1) or the Sahel band (2) and the Sahara margins. They reflect the preseason meridional arrangement of energy over West Africa which is of prime importance for the monsoon excursion onto the continent. The last selected index, AATL@, documents a surface temperature gradient over the tropical Atlantic southeastern basin. This conforms to the fact that April-June SST anomalies in the south tropical Atlantic are associated with July-September west African rainfall (see, among others, Zheng et al., 1999).

The coefficients (underlined when R>0.75) between local West African observed and predicted rainfall using 4 predictors selected among the 2 different sets detailed in columns:

The forty-one local (2.5 lat., 3.75 long.) multiple regression forecasts are presented in figure 1, using 4 April predictors selected among the second set (table 3). On the whole, the JAS 2000 season is predicted as normal (which is in agreement with the global Sahel index forecast); we see however the northeastern parts would know drier conditions.

In April 2000, the above mentioned predictors show the following anomalies: ATL=-0.015, HUM=0.142, MSE1=0.13, MSE2=0.99. Except MSE2, no predictor exhibits a strong anomaly, so, the models forecast a normal rainfall season for the Sahel (table 2).

2- West African more local indexes :

Table 3 presents, in terms of correlation coefficients between the observed and predicted rainfall (over the training 1968-1998 period), some improvements obtained by selecting the most 4 efficient April predictors for different latitudinal bands and predictor sets. Comparing the columns, we can see that the forecasts become more confident when the near-global sea-surface temperatures and regional land-surface predictors are merged : in this context (last column) more than 55% of the observed July-September rainfall variance can be explained using only the April data. The improvements are particularly notable for the Sahel area, where the rainfall amounts are less well depicted by the SST anomalies. These scores are of course improved when the May-June information is added (more than 70% of variance is explained, Philippon and Fontaine, 2000).

Acknowledgements. The authors thank Mike Hulme (University of Norwich, England) for providing the global analyzed monthly precipitation database, the United Kingdom Meteorological Office and the National Center for Atmospheric Research (USA) for providing respectively the global analyzed monthly SST database and the NCEP/NCAR outputs. They also thank Isabelle Poccard, CRC, for downloading the 2000 April values.

References:

Bottomley M, Folland CK, Hsiung J, Parker DE, 1990: Global ocean surface temperature atlas, Meteorological Office and Massachusetts Institute of Technology, 313 plates.

Folland CK, Owen J, Ward N, Colman A, 1991: Prediction of seasonal rainfall in the Sahel region using empirical and dynamical methods, Journal of forecasting, 10, p. 21-56.

Fontaine B, Philippon N, Camberlin P, 1999: An improvement of June-September rainfall forecasting in the Sahel based upon region April-May moist static energy content (1968-1997), Geophysical Research Letters, 26, p 2041-2044.

Fontaine B, Philippon N, 2000: seasonal evolution of boundary layer content in the west African monsoon from the NCEP/NCAR reanalysis (1968-1998), accepted for publication in Int. Journ. of Climatology.

Hagan MT, Demuth HB, Beale MH, 1996: Neural network design, PWS Publishing Company, Boston.

Hulme M, Jones PD, 1993: A historical monthly precipitation data set for global land areas: application for climate monitoring and climate model evaluation. Analysis methods of precipitation on a global scale, report of a GEWEX Workshop, 14-17 September 1992, Koblenz, WMO/TD 558, Geneva, pp. A/14-A/17.

Kalnay E, Kanamitsu M, Kistler R, Collins W, Deaven D, Gandin L, Iredell M, Saha S, White G, Woollen J, Zhu Y, Chelliah M, Ebisuzaki W, Higgins W, Janowiak J, Mo KC, Ropelewski C, Wang J, Leetmaa A, Reynolds R, Jenne R, Joseph D, 1996 : The NCEP/NCAR 40-year reanalysis project, Bull. of the American Met. Society, 77, p 437-471.

Philippon N, Fontaine B, 1999: A new statistical predictability scheme for July-September Sahel rainfall (1968-1994), Comptes Rendus de Acadmie des Sciences, 329, p 1-6.

Philippon N, Fontaine B, 2000: statistical predictability study for July-September Sahelian rainfall (1968-1998) and application to the 1999 summer rainfall forecast, XXVth Assembly of the European Geophysical Society, Nice, 24-27 April 2000.

Zheng X, Eltahir EAB, Emanuel KA, 1999: A mechanism relating tropical Atlantic spring sea surface temperature and west African rainfall, Q.J.R. Meteorol. Soc., 125, P 1129-1164.

Table 1: Total (Ctotal) and partial (Cpartial) correlation, and regression coefficients (R) between the standardized values of predictors and the global Sahel index (17.5N-10N/15W-15E) over the period 1968-1998.

 

 

 

ATL

HUM

MSE1

MSE2

All

Ctotal

0.56

0.35

0.57

-0.03

0.84

Cpartial

0.6

0.58

0.7

-0.46

###

R

0.41

0.42

0.56

-0.31

###

Table 2: 2000 July-September rainfall forecasts of the Sahelian region; MLR and NN forecasts are expressed as percentages of the mean rainfall values registered over the period 1961-1990.

MLR models with the 4 predictors above mentioned (a)

and NN Model (b)

% of 1961-1990 mean

(mean = 337,6 mm)

(a) 97

(b) 96

 

 

LDA

 

 

 

 

 

 

Dry

Normal

Wet

Probability in each category

0.39

0.46

0.15

Table 3: Over each latitudinal band, averaged correlation coefficients (underlined when R>0.75) between local West African observed and predicted rainfall using 4 predictors selected among the 2 different sets detailed in columns:

 

 

 

 

17,5oN

Set 1:

27 April

SST indexes

(30N-30S)

Set 2:

April air mass

energy content

indexes (25N-5S;

15W-15E) + Set1

15o

0.66

0.77

12,5o

0.67

0.79

10o

0.67

0.77

7,5o

0.71

0.79

5oN

0.70

0.74

Figure 1: 2000 JAS rainfall forecasts for each 2.5*3.75 West African box using the four best predictors in Set 2 (forecasts are expressed in % of 1961-1990 mean). VW for Very Wet, W for Wet, N for Normal, D for Dry and VD for Very Dry, we have adopted the same scale as the Agrhymet Center. The crosshatched areas indicate missing rainfall information; non underlined forecasts must be considered carefully because of models explaining less than 56% of the rainfall variance or of forecasts not consistent with the surrounding regions.