3. Cross-country growth regressions: from theory to empirics
The stylized facts of economic growth have led to two major themes in the development of formal econometric analyses of growth. The first theme revolves around the question of convergence: are contemporary differences in aggregate economies transient over sufficiently long time horizons? The second theme concerns the identification of growth determinants: which factors seem to explain observed differences in growth? These questions are closely related in that each requires the specification of a statistical model of cross-country growth differences from which the effects on growth of various factors, including initial conditions, may be identified.
In this section, we describe how statistical models of cross-country growth differences have been derived from theoretical growth models.Section 3.1 provides a general theoretical framework for understanding growth dynamics. The framework is explicitly neoclassical and represents the basis for most empirical growth work; even those studies that have attempted to produce evidence in favor of endogenous or other alternative growth theories have generally used the neoclassical model as a baseline from which to explore deviations. Section 3.2 examines the relationship between this theoretical model of growth dynamics and the specification of a growth regression. This transition from theory to econometrics produces the canonical cross-country growth regression.
3.1. Growth dynamics: basic ideas
For economy i at time t, let Yi,t denote output, Li,t the labor force (assumed to obey
The important empirical implication of Equation (8) is that, in a cross-section of countries, we should observe a negative relationship between average rates of growth and initial levels of output over any time period - countries that start out below their balanced growth path must grow relatively quickly if they are to catch up with other countries that have the same levels of steady-state output per effective worker and initial efficiency.
This is closely related to the hypothesis of conditional convergence, which is often understood to mean that countries converge to parallel growth paths, the levels of which are assumed to be a function of a small set of variables.12 Note, however, that a negative coefficient on initial income in a cross-country growth regression does not automatically imply conditional convergence in this sense, because countries might instead simply be moving toward their own different steady-state growth paths.3.2. Cross-country growth regressions
[1] We provide formal definitions of convergence in Section 4.1.
Mankiw, Romer and Weil assume that Ai,0 is unobservable and that g + δ is known. These assumptions mean that (14) is linear in the logs of various observable variables and therefore amenable to standard regression analysis.
Mankiw, Romer and Weil argue that Ai,0 should be interpreted as reflecting not just technology, which they assume to be constant across countries, but country-specific influences on growth such as resource endowments, climate and institutions. They assume these differences vary randomly in the sense that
Using data from a group of 98 countries over the period 1960 to 1985, Mankiw, Romer and Weil produce regression estimates of β = -0.299, a = 0.48 and φ = 0.23.13 14,15 Mankiw, Romer and Weil are unable to reject the overidentifying restrictions present in (16). While this result is echoed in studies such as Knight, Loayza and Villanueva (1993), other authors, Caselli, Esquivel and Lefort (1996), for example, are able to reject the restrictions.
Many cross-country regression studies have attempted to extend Mankiw, Romer and Weil by adding additional control variables Zi to the regression suggested by (16). Relative to Mankiw, Romer and Weil, such studies may be understood as allowing for predictable heterogeneneity in the steady-state growth term gi and initial technology term Ai,0 that are assumed constant across i in (16). Formally, the gi — β log Ai,0 terms

in (6) are replaced with g - β log A + πZi - βei rather than with g - β log A - βei which produced (16). (As far as we know, empirical work universally ignores the fact that log(ni + g + 5) should also be replaced with log(ni + gi + 5).) This produces the cross country growth regression
c√ -U /7)
The regression described by (17) does not identify whether the controls Zi are correlated with steady-state growth gi or the initial technology term Ai 0. For this reason, a believer in a common steady-state growth rate will not be dissuaded by the finding that particular choices of Zi help predict growth beyond the Solow regressors. Nevertheless, it seems plausible that the controls Zi may sometimes function as proxies for predicting differences in efficiency growth gi rather than in the initial technology Ai 0. As argued in Temple (1999), even if all countries have the same total factor productivity (TFP) growth in the long run, over a twenty- or thirty-year sample the assumption of equal TFP growth is highly implausible, so the variables in Zi can explain these differences. That being said, the attribution of the predictive content of Zi to initial technology versus steady state growth will entirely depend on a researcher’s prior beliefs.
It is possible that proper accounting of the log(ni + gi + 5) term would allow for some progress in identifying gi versus Ai 0 effects since gi effects would imply a nonlinear relationship between Zi and overall growth γi; however this nonlinearity may be too subtle to uncover given the relatively small data sets available to growth researchers.The canonical cross-country growth regression may be understood as a version of (17) when the cross-coefficient restrictions embedded in (17) are ignored (which is usually the case in empirical work). A generic representation of the regression is
where Xi contains a constant, log(ni∙ + g + 5), logsκ,i and logsH,i. The variables spanned by log yi, î and Xi thus represent those growth determinants that are suggested by the Solow growth model whereas Zi represents those growth determinants that lie outside Solow’s original theory.16 The distinction between the Solow variables and Zi is important in understanding the empirical literature. While the Solow variables usually appear in different empirical studies, reflecting the treatment of the Solow model as a baseline for growth analysis, choices concerning which Zi variables to include vary greatly.
Equation (18) represents the baseline for much of growth econometrics. These regressions are sometimes known as Barro regressions, given Barro’s extensive use of such
[1] We distinguish log yi 0 from the other Solow variables because of the role it plays in analysis of convergence; see Section 4 for detailed discussion.
regressions to study alternative growth determinants starting with Barro (1991). This regression model has been the workhorse of empirical growth research.17 In modern empirical analyses, the equation has been generalized in a number of dimensions.
Some of these extensions reflect the application of (18) to time series and panel data settings. Other generalizations have introduced nonlinearities and parameter heterogeneity. We will discuss these variants below.3.3. Interpreting errors in growth regressions
Our development of the relationship between cross-country growth regressions and neoclassical growth theories illustrates the standard practice of adding regression errors in an ad hoc fashion. Put differently, researchers usually derive a deterministic growth relationship and append an error in order to capture whatever aspects of the growth process are omitted from the model that has been developed. One problem with this practice is that some types of errors have important implications for the asymptotics of estimators. Binder and Pesaran (1999) conduct an exhaustive study of this question, one important conclusion of which is that if one generalizes the assumption of a constant rate of technical change so that technical change follows a random walk, this induces nonstationarity in many levels series, raising attendant unit root questions.
Beyond issues of asymptotics, the ad hoc treatment of regression errors leaves unanswered the question of what sorts of implicit substantive economic assumptions are made by a researcher who does this. Brock and Durlauf (2001a) address this issue using the concept of exchangeability. Basically, their argument is that in a regression such as (18), a researcher typically thinks of the errors εi as interchangeable across observations: different patterns of realized errors are equally likely to occur if the realizations are permuted across countries. In other words, the information available to a researcher about the countries is not informative about the error terms.
Exchangeability is a mathematical formalization of this idea and is defined as follows. For each observation i, there exists an associated information set Fi available to the researcher.
In the growth context, Fi may include knowledge of a country’s history or culture as well as any “economic” variables that are known. A definition of exchangeability (formally, F -conditional exchangeability) is
17 Such regressions appear to have been employed earlier by Grier and Tullock (1989) and Kormendi and Meguire (1985). The reason these latter two studies seem to have received less attention than warranted by their originality is, we suspect, due to their appearance before endogenous growth theory emerged as a primary area of macroeconomic research, in turn placing great interest on the empirical evaluation of growth theories. To be clear, Barro’s development is original to him and his linking of cross-country growth regressions to alternative growth theories was unique.
Many criticisms of growth regressions amount to arguments that exchangeability has been violated. For example, omitted regressors induce exchangeability violations as these regressors are elements of F. Parameter heterogeneity also leads to nonexchangeability. For these cases, the failure of nonexchangeability calls into question the interpretation of the regression. This is not always the case; heteroskedasticity in errors violates exchangeability but does not induce interpretation problems for coefficients.
Brock and Durlauf argue that exchangeability produces a link between substantive social science knowledge and error structure, i.e. this knowledge may be used to evaluate the plausibility of exchangeability. They suggest that a good empirical practice would be for researchers to question whether the errors in a model are exchangeable, and if not, determine whether the violation invalidates the purposes for which the regression is being used. This cannot be done in an algorithmic fashion, but as is the case with empirical work quite generally, requires judgments by the analyst. See Draper et al. (1993) for further discussion of the role of exchangeability in empirical work.
3.