A Dynamic Panel Data Approach to the Forecasting of the GDP of German Länder
ABSTRACT Abstract In this paper, we make multi-step forecasts of the annual growth rates of the real GDP for each of the 16 German Länder simultaneously. We apply dynamic panel models accounting for spatial dependence between regional GDP. We find that both pooling and accounting for spatial effects help to improve the forecast performance substantially. We demonstrate that the effect of accounting for spatial dependence is more pronounced for longer forecasting horizons (the forecast accuracy gain is about 9% for a 1-year horizon and exceeds 40% for a 5-year horizon). We recommend incorporating a spatial dependence structure into regional forecasting models, especially when long-term forecasts are made.
-
Citations (0)
-
Cited In (0)
Page 1
A dynamic panel data approach to the forecasting of the GDP of German L¨ ander¶
Konstantin A. Kholodilin∗
Boriss Siliverstovs∗∗
Stefan Kooths§
December 18, 2007
Abstract
In this paper, we make multi-step forecasts of the annual growth rates of the real GDP for each of the 16
German L¨ ander (states) simultaneously. Beside the usual panel data models, such as pooled and fixed-effects
models, we apply panel models that explicitly account for spatial dependence between regional GDP. We
find that both pooling and accounting for spatial effects helps substantially improve the forecast performance
compared to the individual autoregressive models estimated for each of the L¨ ander separately. More impor-
tantly, we have demonstrated that effect of accounting for spatial dependence is even more pronounced at
longer forecasting horizons (the forecast accuracy gain as measured by the root mean squared forecast error is
about 9% at 1-year horizon and exceeds 40% at 5-year horizon). Hence, we strongly recommend incorporating
spatial dependence structure into regional forecasting models, especially, when long-term forecasts are made.
Keywords: German L¨ ander; forecasting; dynamic panel model; spatial autocorrelation.
JEL classification: C21; C23; C53.
¶We are very grateful to J. P. Elhorst for providing his Matlab code and for patiently explaining us all the necessary details
concerning his model. We also are grateful to C. Dreger and to the three anonymous referees whose comments on the earlier draft
of the paper led to substantial improvements.
∗DIW Berlin, Mohrenstr. 58, 10117 Berlin, Germany,kkholodilin@diw.de
∗∗DIW Berlin, Mohrenstr. 58, 10117 Berlin, Germany,bsiliverstovs@diw.de
§DIW Berlin, Mohrenstr. 58, 10117 Berlin, Germany,skooths@diw.de
Page 2
Contents
1Introduction4
2Data properties6
3 Dynamic panel data models7
4Estimation results 11
5Forecasting performance 13
6Summary15
References 16
Appendix 20
2
Page 3
List of Tables
1 Descriptive statistics of the growth rates of real GDP of the German L¨ ander (%) . . . . . . . . .20
2Estimation results 1993 - 2006. . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 21
3 Forecasting performance, all L¨ ander 2002-2006. . . . . . . . . . . . . . . . . . . . . . . . . . . . 22
3
Page 4
1Introduction
The current political and economic situation in Europe can be characterized by two major trends. On the
one hand, political and especially economic integration on the international level is taking place. On the other
hand, the regions, of which nations are comprised, are gaining in importance and autonomy. Hence, there is an
increased need for reliable forecasts in order to support decision-making processes at the regional level.
This is particularly true for a federal country like Germany, where regional heterogeneity primarily manifests
itself in the distinction between the Eastern and Western L¨ ander (singular Land) due to the legacy of the past
as well as in substantial differences in the economic structure within each group. This implies that regional
forecasts might diverge from the forecasts made for the whole country, which hence cannot serve as a meaningful
guide for decision making at the regional level.
In this paper, we forecast the annual growth rates of GDP for each of the 16 German L¨ ander. To the
best of our knowledge, this is the first attempt in the literature that addresses this question for all German
L¨ ander simultaneously as most of the studies attempt to forecast German GDP on the aggregate level. These
studies include Langmantel (1999), Hinze (2003), Dreger and Schumacher (2004), Mittnik and Zadrozny (2004),
Kholodilin and Siliverstovs (2005), and Schumacher (2005), among others, who use several variants of the fore-
casting methodology of Stock and Watson (2002) based on diffusion indices in order to predict the developments
in the German GDP. At the same time, there are two studies that construct forecasts for the individual German
L¨ ander, Bandholz and Funke (2003) and Dreger and Kholodilin (2006) who forecast the GDP of Hamburg and
of Berlin, respectively, again, using the diffusion indices.
The fact that GDP data for individual German L¨ ander are not available on a quarterly basis severely reduces
the post-re-unification data base to 16 annual observations for the period from 1991 to 2006. This may explain
the small number of studies aiming at forecasting German GDP on the L¨ ander level.
In this paper, we circumvent the problem of data collection for each regional entity by pooling the annual
growth rates of GDP into a panel and correspondingly utilizing panel data models for forecasting. The advan-
tages of such a pooling approach for forecasting have been widely demonstrated in a series of articles for diverse
data sets such as Baltagi and Griffin (1997); Baltagi et al. (2003) — for gasoline demand, Baltagi et al. (2000)
— for cigarette demand, Baltagi et al. (2002) — for electricity and natural gas consumption, Baltagi et al.
4
Page 5
(2004) — for Tobin’s q estimation, and Br¨ ucker and Siliverstovs (2006) — for international migration, among
others.
In addition to pooling, accounting for spatial interdependence between regions may prove beneficial for the
purposes of forecasting. Spatial dependence implies that due to spillover effects (e.g., commuter labor and trade
flows). Neighboring regions may have similar economic performance and hence the location matters. However,
the number of studies that illustrate the usefulness of accounting for (possible) spatial dependence effects across
cross sections in the forecasting exercise is still limited. For example, Elhorst (2005), Baltagi and Li (2006),
and Longhi and Nijkamp (2007) demonstrate forecast superiority of models accounting for spatial dependence
across regions using data on demand for cigarettes from states of the USA, demand for liquor in the American
states, and German regional labor markets, respectively. However, only Longhi and Nijkamp (2007) conduct
quasi real-time forecasts for period t + h (h > 0) based on the information available in period t. On the other
hand, the forecasts made in Elhorst (2005) and Baltagi and Li (2006) are not real-time forecasts, since they
take advantage of the whole information set that is available in the forecast period, t + h.
Applications of panel data models accounting for spatial effects for the forecasting of regional GDP are even
more limited. To our knowledge, there is only one paper treating this issue, namely that of Polasek et al. (2007),
who make long-term forecasts of the GDP of 99 Austrian regions, but do not evaluate their accuracy in a formal
way.
Thus, the main contribution of this paper is the construction of GDP forecasts for all German L¨ ander
simultaneously. Our additional contribution to the literature is that in order to make forecasts of regional GDP
we employ panel data models that allow not only for temporal interdependence in the regional growth rates,
but also take into account their spatial interdependence. The advantage of our approach is that it is suited to
conduct forecasts in the real time. We also demonstrate the usefulness of our approach by formal methods.
The paper is structured in the following way. In section 2 the data are described. Section 3 presents different
econometric forecasting models. In section 4 the estimation results are reported, whereas section 5 evaluates
the forecasting performance of alternative models. Finally, section 6 concludes.
5
Page 6
2Data properties
For our estimation and forecasting we use annual real GDP data of the 16 German L¨ ander. The data cover the
period 1991-2006 and can be downloaded from the webpage of the Statistical Office of Baden-W¨ urtemberg (Ar-
beitskreis Volkswirtschaftliche Gesamtrechnungen der L¨ ander). The data are seasonally adjusted and expressed
in terms of chain indices with the base year 2000.
Before we estimate the models described in Section 3, we take a look at the descriptive statistics of the data
under consideration displayed in Table 1. In this table, the basic descriptive statistics of the growth rates of
real GDP in form of the mean, maximum, minimum, and the standard deviation are summarized at three levels
of aggregation: for all German L¨ ander, separately for the Western L¨ ander group and for the Eastern L¨ ander
group as well as for each of the L¨ ander individually.
The specific economic dynamics of the Eastern L¨ ander in the first half of the 1990s reflect the re-unification
growth effect that was mainly driven by expansionary government interventions (see Vesper (1998) and Bach and
Vesper (2000) for a detailed analysis of fiscal policies during this period). The market-oriented transformation
of the formerly centrally planned economy in Eastern Germany and the rebuilding of the infrastructure in the
Eastern L¨ ander implied public per capita spending that was far above the Western level from the start and
rising until the mid-1990s (from 128% in 1992 to 145% in 1995). This expansionary government program was
fuelled by both extensive transfers from West to East (starting at 65 billion DM in 1991 and peaking at 118
billion DM in 1996) and deficit spending in the Eastern L¨ ander whose per capita debt quickly approached the
Western levels (within the first 5 years the ratio rose from 11% to 90%). Furthermore, due to tax privileges
(like special depreciation allowances) the construction sector boomed in the first decade with the East-West
ratio of per capita investment more than doubled from 67% in 1991 to 180% in 1996 (residential construction
more than tripled in the same period). The special factors that heavily influenced the catching-up process lost
momentum after 1995 (no further increase, but stagnation or even decrease of the indicators). Therefore, we
have chosen to split the whole period 1992-2006 into two sub-periods: from 1992 till 1995 and from 1996 till
2006.
In the first sub-period the growth rates of the Eastern L¨ ander were much higher than those of the Western
L¨ ander. After 1995, this difference has vanished such that in the second sub-period real GDP growth rates in
6
Page 7
both groups became very similar. Precise figures on the magnitude of the difference in growth rates of Eastern
and Western L¨ ander for the period from 1992 till 1995 can be found in Table 1. In this period, in all Eastern
L¨ ander, excluding Berlin, the mean growth rates of real GDP was about 10% per annum, such that the average
growth rate computed for all Eastern L¨ ander is 8.7, which is about 17 times higher than the average growth
rate of 0.5 reported for the Western L¨ ander in this sub-period. Furthermore, from 1992 till 1995 there were
no negative growth rates in any of the Eastern L¨ ander, whereas the Western L¨ ander experienced the negative
growth rates of real GDP during this sub-period.
In the second sub-period (from 1996 till 2006), the growth rates of real GDP become more or less similar in
both L¨ ander groups. The mean growth rates are of about the same magnitude (0.9 for the Eastern L¨ ander vs
1.4 for the Western L¨ ander), with virtually the same standard deviation of 1.4 and 1.6, respectively.
This marked difference between the magnitude of real GDP growth rates in the Eastern and Western L¨ ander
in the first sub-period and the fact that it vanished in the second sub-period prompts us to introduce a step
dummy variable in our regression models that takes the value of one in the period from 1992 up to and including
1995 and the value of zero otherwise. Observe that this step dummy is applicable only for the six Eastern L¨ ander
as namely for those L¨ ander the properties of real GDP growth rates are drastically different in both sub-periods.
As far as the properties of the real GDP growth rates in the Western L¨ ander are concerned, we assume that
those did not change over the whole period.
We have chosen to interact this step dummy with the autoregressive coefficient in our regression models in
order to account for the fact that the persistence in real GDP growth rates in the Eastern L¨ ander in the first
period seems to be much more pronounced than in the second sub-period.
3Dynamic panel data models
In this section we describe the econometric models that we are using for forecasting the growth rates of real
GDP of the German L¨ ander. In these otherwise standard models we include the re-unification boom dummy
that takes into account specific macroeconomic dynamics of the Eastern L¨ ander in the first half of the 1990s.
In this paper, we examine a standard set of dynamic panel data (DPD) models starting with individual
autoregressive (AR) models, which can be considered as a particular case of DPD models with unrestricted
7
Page 8
parameters, through fixed-effects models, which impose homogeneity restrictions on the slope parameters, to
pooled models, which impose homogeneity restrictions on both intercept and slope parameters. In addition
to standard fixed-effects and pooled models, we also consider fixed-effects and pooled models that account for
spatial dependence.
As a benchmark model, with which all other models will be compared, we use a linear individual AR(1)
model (IOLS) and estimate it for each L¨ ander separately:
yit= αi+ βi1yit−1+ βi2Iityit−1+ εit
εit∼ N.I.D.(0,σ2
i)(1)
where yitis the annual growth rate of real GDP of i-th Land; Iitis a step dummy, which from now on will be
referred to as a re-unification boom dummy. The dummy is defined as follows:
1992-19951996-2006
Eastern L¨ ander
Iit= 1
Iit= 0
Western L¨ ander
Iit= 0
Iit= 0
In addition, given the short time dimension of our data, it should be noted that the OLS estimator of
the parameters of individual AR(1) models is biased due to insufficient degrees of freedom as pointed out in
Ramanathan (1995).
The next model we consider is the pooled panel (POLS or PGMM depending on the estimation method)
model:
yit= α + β1yit−1+ β2Iityit−1+ εit
εit∼ N.I.D.(0,σ2)(2)
which imposes the homogeneity restriction on both intercept and slope coefficients across all the L¨ ander.
An alternative model is the fixed-effects (FEOLSor FEGMM) model that allows for region-specific intercepts:
yit= αi+ β1yit−1+ β2Iityit−1+ εit
εit∼ N.I.D.(0,σ2)(3)
The fixed-effects model represents an intermediate case between the individual (IOLS) and pooled panel (POLS
and PGMM) models. It is not too restrictive as the pooled model, which assumes equal average growth rates in
8
Page 9
all L¨ ander, and yet allows to take advantage of panel dimension. From the economic point of view, fixed effects
capture differences in growth rates between L¨ ander related to their heterogeneous economic structure.
Additionally, we consider the following two types of models that account for spatial correlation that might
exist between the L¨ ander. One may expect to find the dynamic (stagnating) L¨ ander being the neighbors of
dynamic (stagnating) L¨ ander due to cross-border spillovers (commuter labor and trade flows).
The spatial dependence is accounted for using an N × N matrix of spatial weights W, which is based on
the existence of common borders between the L¨ ander1. The typical element of this matrix, wij, is equal to 1 if
two corresponding L¨ ander have a common border, and 0 otherwise. All the elements on the main diagonal of
matrix W are equal to zero. The constructed weights matrix is normalized such that all the elements in each
row sum up to one.
First, we model the spatial dependence by means of spatial lags of the dependent variable. We examine both
pooled and fixed-effects versions of this model. The pooled spatial Durbin model (PSDM
MLE) can be written
as follows:
yit= α + β1yit−1+ β2Iityit−1+ ρ
N
?
j=1
wijyjt+ εit
εit∼ N.I.D.(0,σ2)(4)
The fixed-effects spatial Durbin model (FESDM
MLE) is
yit= αi+ β1yit−1+ β2Iityit−1+ ρ
N
?
j=1
wijyjt+ εit
εit∼ N.I.D.(0,σ2)(5)
where ρ is the spatial autoregressive parameter and N is the number of L¨ ander.
The second type of models addresses spatial correlation through a spatial autoregressive error structure,
as suggested by Elhorst (2005). Again, we distinguish between pooled and fixed-effects models. Due to their
specific nature, those models are estimated by the Maximum Likelihood method. The pooled spatial error model
1We also have considered a matrix of spatial weights based on the distance between the capitals of the L¨ ander. Following
Baumont et al. (2002) we constructed four distance-decay weights matrices depending on four different distance cutoff values: first
quartile, median, second quartile, and maximum distance. However, the forecast accuracy of the models based on these weights
matrices was generally inferior to that of the models with a weights matrix based on common borders. Therefore, in order to save
space we chose not to report the corresponding results but we make them available upon request.
9
Page 10
(PSEM
MLE) has the following form:
yit= α + β1yit−1+ β2Iityit−1+ uit
uit= λ?N
j=1wijujt+ εit
εit∼ N.I.D.(0,σ2)(6)
The fixed-effects spatial error model (FESEM
MLE) can be expressed as:
yit= αi+ β1yit−1+ β2Iityit−1+ uit
uit= λ?N
j=1wijujt+ εit
εit∼ N.I.D.(0,σ2)(7)
where λ is the coefficient of spatial error autoregression.
We have estimated IOLS, POLS, and FEOLSusing the OLS method. It is known from the literature that
in the context of dynamic panel data models the OLS estimator is subject to simultaneous equation bias. In
order to address this problem we have used the GMM estimator of Arellano and Bond (1991) to estimate the
fixed-effects model without spatial autoregressive lags. Notice that the GMM estimator uses the first-difference
transformation, which omits the time-invariant variables (in our case, the L¨ ander-specific intercepts). These
were recovered using the following two-step procedure. In the first step, the slope parameters are estimated
using the first differences of the data. In the second step, the estimated parameters are plugged into the equation
for the levels of data and the fitted values are calculated. The fixed effects for the FEGMMmodel are obtained
as the L¨ ander-specific averages of difference between actual and fitted values.
Although from the theoretical perspective, the GMM estimators should be preferred to the OLS estimators
when applied to dynamic panels with small time dimension, in what follows we use the OLS estimators, since in
the forecasting context a biased but stable estimator may still deliver a more accurate forecasting performance
than an unbiased but unstable one2.
The remaining PSDM
MLE, FESDM
MLE, PSEM
MLEand FESEM
MLEmodels were estimated using the Maximum Likelihood
method as implemented in the Matlab codes of Paul Elhorst.
2We thank an anonymous referee for pointing this out.
10
Page 11
4 Estimation results
The estimates of the temporal and spatial autoregressive coefficients of all the models are presented in Table
2. At first, we report a summary of the estimates of the temporal autoregressive coefficient ? α1obtained for a
model estimated for each Land separately. The results of this exercise reveal quite large heterogeneity in the
obtained values. For all 16 L¨ ander considered, the minimum value of the autoregressive coefficient estimate is
−0.186 and the maximum is 0.230, while the median value is 0.052.
We also report a summary of the estimates of the interaction between the re-unification boom dummy and
the growth rates in Eastern regions,?
the corresponding estimates lie in the interval from 0.379 to 0.780 with a median value of 0.647. Such values
β2, computed in the regression for each Eastern Land. The magnitudes of
of the estimated coefficient support our observation — made earlier in Section 2 — that the persistence in real
GDP growth rates in the Eastern L¨ ander was much higher during the period from 1992 till 1995 than that
during the period from 1996 till 2006.
Note that the individual autoregressive models seem to provide a rather poor fit to the data as the values of
the R2often lie near zero. The corresponding median is 0.038. This is most likely due to the rather short period
used in the estimation as well as the rather low persistence in the real GDP growth rates for the majority of
observations.
The next two columns of Table 2 contain the estimation results obtained for the pooled model (equation
2) and for the fixed-effects model (equation 3) using OLS. For the POLS model, the estimated value of the
autoregressive coefficient is 0.162, which is significant at the 1% level, whereas for the FEOLS model the
corresponding value is 0.064 and is significant at the 5% level.
The value of the estimate of the interaction term between re-unification boom dummy and growth rates in
Eastern L¨ ander,?
even at the 1% level in both cases. Thus, our estimation results are concordant with those obtained from the
β2, is 0.589 and 0.653 for the pooled and the fixed-effects models, respectively. It is significant
individual autoregressions that the persistence in growth rates of real GDP in the Eastern L¨ ander was much
higher during 1992-1995 than during 1996-2006.
In the fixed-effects FEGMM model estimated by GMM, the autoregressive coefficient is much lower in
absolute magnitude and is very close to zero. In contrast, the coefficient of the re-unification boom dummy is
11
Page 12
of similar magnitude compared to the models estimated by OLS. The appropriateness of this GMM estimator
is illustrated by the following specification tests. The Sargan test has a value of 15.44 with a p-value of 1.000,
which implies that the null of the instruments’ validity cannot be rejected. The significance of the AR(1) test
(test statistic -2.482, p-value 0.013) and insignificance of the AR(2) test (test statistic -1.487, p-value 0.137)
suggest that the assumption of serially uncorrelated errors is not violated.
Finally, the last four columns of Table 2 contain the parameter estimates of the models that allow for spatial
effects. The first two models are spatial Durbin models. The estimates of the autoregressive parameters and
the interaction term between re-unification boom dummy and growth rates in Eastern regions for these models
are quite similar to those obtained for the models without spatial effects. In addition, the estimates of spatial
autoregressive coefficients, ? ρ, of the pooled and fixed-effects models are 0.097 (p=0.110) and 0.119 (p=0.062),
respectively. Nevertheless, for the forecasting purposes, Elhorst (2005) recommends using the models that
account for spatial dependence even if spatial autocorrelation coefficient is not significantly different from zero.
The last two models, PSEM
MLEand FESEM
MLE, are those with spatially correlated errors. The estimates of the
autoregressive and interaction term between re-unification boom dummy and growth rates in Eastern L¨ ander
parameters are close to those obtained in the corresponding panel models with and without spatial effects
estimated by OLS. In contrast to the spatially autoregressive models, the spatial error models point to a strong
positive and statistically significant spatial correlation as measured by the coefficient λ. Indeed,?λ takes values
of 0.607 and 0.577, respectively, depending on whether we allow for fixed effects or not. Therefore, one would
expect that accounting for spatial effects using this type of model will result in an increased forecasting accuracy
compared to the models without spatial effects.
To summarize, on the basis of our estimation results we conclude the following. First, the growth rates
of real GDP of the German L¨ ander exhibit rather low temporal dependence, except for the period from 1992
till 1995, when the Eastern L¨ ander enjoyed exceptionally high growth rates. Our interaction term between
re-unification boom dummy and growth rates in Eastern regions introduced to capture this effect turns out to
be positive and highly significant in all models that we considered in this paper. Second, the growth rates of
real GDP exhibit substantial spatial dependence in the current period. Hence, it remains to check whether
allowance for this spatial dependence will result in improved forecasts of regional GDP growth rates.
12
Page 13
5Forecasting performance
For each model we forecast recursively the h-year growth rates of real GDP, ∆hyi,t+h= yi,t+h− yitfor h =
1,2,...,5 for all 16 L¨ ander over the forecasting period encompassing 5 years from 2002 up to 2006. This
procedure gives us (5 − (h − 1)) × N forecasts for the h-year growth rate.
For each model, the parameter estimates were obtained using an expanding window of observations. Thus,
the first estimation period is 1993-2001, based on which the forecasts of ∆1yi,2002,∆2yi,2003,...,∆5yi,2006are
made. Next, the model is re-estimated for the period 1993-2002 and the forecasts ∆1yi,2003,∆2yi,2004,...,∆4yi,2006
are computed, etc.
For all models, except spatial Durbin models, the forecasts were made in a standard way. The forecasts of
the spatial Durbin models are conducted using the two-step procedure3. In order to illustrate this procedure,
it is worthwhile re-writing the spatial Durbin models (4) and (5) in the following matrix form for the pooled:
y = αıNT+ β1y−1+ β2(D ? y−1) + ρWy + ε
(8)
for the fixed-effects versions:
y = (ıT⊗ IN)α + β1y−1+ β2(D ? y−1) + ρWy + ε
(9)
where y is a NT ×1 vector of the yitstacked by year and region such that the first N observations refer to the
first year, etc. Correspondingly, y−1is a NT × 1 vector of the yi,t−1stacked by year and region. The matrix
D is a NT × 1 matrix which structure corresponds to the re-unification boom dummy Iitreported in Table 3.
Then, (D ? y−1) denotes the interaction term between the re-unification boom dummy and the growth rates in
Eastern regions, where ? is the Hadamar product, or element-by-element multiplication operator. IN, IT, and
INT are the unit matrices with dimensions N × N, T × T, and NT × NT, respectively. The NT × NT matrix
W = IT⊗ W is the block-diagonal matrix with the N × N matrix W of spatial weights on its main diagonal,
where is ⊗ a Kronecker product. ıNT and ıT are the NT and T unit vectors, respectively, such that α and α
are correspondingly a common intercept and an N × 1 vector of cross-section specific intercepts in the pooled
3The authors thank two anonymous referees for drawing our attention to this forecasting procedure.
13
Page 14
and the fixed-effects spatial Durbin models.
The models (8) and (9) can be re-written in the following reduced form:
(INT− ρW)y
=
αıNT+ β1y−1+ β2(D ? y−1) + ε
y
=(INT− ρW)−1[αıNT+ β1y−1+ β2(D ? y−1)] + (INT− ρW)−1ε
(10)
(INT− ρW)y
=(ıT⊗ IN)α + β1y−1+ β2(D ? y−1) + ε
y
=(INT− ρW)−1[(ıT⊗ IN)α + β1y−1+ β2(D ? y−1)] + (INT− ρW)−1ε
(11)
where only the past values of y appear on the right-hand side of the equations.
The multi-step ahead forecasts from the spatial Durbin models can now be obtained as follows. First, we
estimate the parameters of the models (8) and (9), as outlined above. Second, we use the reduced form equations
(10) and (11) in order to generate the forecasts.
The results of our forecasting exercise are reported in Table 3. The forecasting performance is measured by
the total root mean square forecast error (Total RMSFE) calculated for all years and over all regions for each
forecasting horizon, h = 1,2,...,5. The individual autoregressive model serves as a benchmark, to which the
forecasting performance of all other models is compared. Hence, the relative total RMSFE measures the gains
in forecasting accuracy from pooling and from accounting for spatial dependence.
The results of our forecasting exercise further strengthen the evidence previously reported in a number of
studies such as Baltagi and Griffin (1997); Baltagi et al. (2003), Baltagi et al. (2000), Baltagi et al. (2002),
Baltagi et al. (2004), and Br¨ ucker and Siliverstovs (2006), among others, that pooling helps to improve forecast
accuracy. As seen, the pooled OLS model produces an RMSFE that is about 9% lower than that reported
for the individual AR(1) models. Allowing for the presence of fixed effects, however, does not lead to further
improvements in forecast accuracy. The likely reason is that the relatively short time span of our data impedes
on a precise estimation of region-specific intercepts. An F-test for the absence of fixed effects does not reject
the null hypothesis (F(15,208) = 1.091 with p-value equal 0.366).
Despite their theoretical appeal, the fixed-effects model estimated by GMM produces even poorer forecasting
14
Page 15
performance. Possible reason is that the GMM has poor finite sample properties, as documented in a number
of Monte Carlo studies (see Arellano and Bond (1991), Kiviet (1995), Ziliak (1997), and Alonso-Borrego and
Arellano (1999)).
As expected, the application of pooled and fixed-effects models accounting for spatial effects results in a
better forecast accuracy compared not only to the benchmark model but also to the pooled and fixed-effects
models, which do not take into account spatial effects.
The largest forecast accuracy gain is achieved when the pooled models are used. The best forecast perfor-
mance is delivered by PSEM
MLEclosely followed by PSDM
MLE. This ranking remains the same over all forecasting
horizons. More importantly, the relative forecast accuracy improvement with respect to the benchmark model
increases with longer forecasting horizon. For example, at h = 1 the ratio of total RMSFE of PSEM
MLEand PSDM
MLE
with respect to that of the benchmark model is 0.906 and 0.908, respectively, which represents forecast accuracy
improvement of 9.4% and 9.2%, correspondingly. At the forecasting horizon h = 5, this improvement constitutes
40.4% and 36.8%, respectively. The POLSmodel is ranked as the third best model where the similar pattern is
also observed to somewhat lesser degree — the corresponding relative RMSFE at h = 1 is 0.915 and at h = 5 is
0.680. Observe that the forecasting accuracy of the pooled models accounting for the spatial dependence also
gets larger relatively to the pooled model without spatial effects as forecasting horizon increases. Thus, the
ratios of the total RMSFE of the POLSmodel with that of the PSEM
MLEat the 1-year and the 5-year forecasting
horizons are 0.989 and 0.876, respectively.
The fixed-effects models with and without spatial effects can be ranked according to their forecast accuracy as
follows: FESDM
MLE, FESEM
MLE, FEOLS, and FEGMM. This ranking remains unchanged across all forecast horizons.
Again, the two best models are those accounting for spatial dependence. Their forecasting performance relative
to that of the fixed-effects models without spatial effects also tends to increase with longer forecast horizons.
6 Summary
In this paper, we have addressed the forecasting of h-year growth rates of real GDP for of each of the 16 German
L¨ ander using dynamic panel data models, h = 1,2,...,5. Based on the results of Bach and Vesper (2000), we
account for the exceptional behavior of the Eastern L¨ ander economies during the re-unification boom in the
15
View other sources
Hide other sources
-
Available from Boriss Siliverstovs · 18 Dec 2012
-
Available from ualg.pt