26  Linear Models with Heterogeneous Coefficients

26.1 Linearity and Heterogeneity

26.1.1 Models with Homogeneous Slopes

We begin our journey where standard textbooks and first-year foundational courses in econometrics leave off. The “standard” linear models considered in such courses often assume homogeneity in individual responses to covariates (e.g., Hansen (2022)). A common cross-sectional specification is:

\[ y_i = \bbeta'\bx_i + u_{i}, \tag{26.1}\] where \(i=1, \dots, N\) indexes cross-sectional units.

In panel data, models often include unit-specific \((i)\) and time-specific \((t)\) intercepts while maintaining a common slope vector \(\bbeta\):

\[ y_{it} = \alpha_i + \delta_t + \bbeta'\bx_{it} + u_{it}. \tag{26.2}\]

26.1.2 Heterogeneity in Slopes.

However, modern economic theory rarely supports the assumption of homogeneous slopes \(\bbeta.\) Theoretical models recognize that observationally identical individuals, firms, and countries can respond differently to the same stimulus. In a linear model, this requires us to consider more flexible models with heterogeneous coefficients:

  1. Cross-sectional model (26.1) generalizes to

    \[ y_i = \bbeta_{i}'\bx + u_i. \tag{26.3}\]

  2. Panel data model (26.2) generalizes to

    \[ y_{it} = \bbeta_{it}'\bx_{it} + u_{it}. \tag{26.4}\]

Such models are worth studying, as they naturally arise in a variety of contexts:

  • Structural models with parametric restrictions: Certain parametric restrictions yield linear relationships in coefficients. An example is given by firm-level Cobb-Douglas production functions where firm-specific productivity differences induce heterogeneous coefficients (Combes et al. (2012); Sury (2011)).

  • Binary covariates and interaction terms: if all covariates are binary and all interactions are included, a linear model encodes all treatment effects without loss of generality (see, e.g., Wooldridge (2005)).

  • Log-linearized models: Nonlinear models may be approximated by linear models around a steady-state. For example, Heckman and Vytlacil (1998) demonstrate how the nonlinear Card (2001) education model simplifies to a heterogeneous linear specification after linearization.

26.2 Mean Group Estimator

Pesaran and Smith (1995) show that simple averages of the estimated coefficients (known as mean group, MG, estimates) result in consistent estimates of the underlying population means of the parameters when the time-series dimension of the data is sufficiently large. Whilst it is not possible to be sure about when \(T\) is sufficiently large, Monte Carlo evidence suggests that reliable estimates can be obtained with \(T\ge 30\) and \(N\ge 20,\) when output growth is not very persistent.

References

Card, David. 2001. Estimating the Return to Schooling: Progress on Some Persistent Econometric Problems.” Econometrica 69 (5): 1127–60. https://doi.org/10.1111/1468-0262.00237.
Combes, Pierre Philippe, Gilles Duranton, Laurent Gobillon, Diego Puga, and Sébastien Roux. 2012. The Productivity Advantages of Large Cities: Distinguishing Agglomeration From Firm Selection.” Econometrica 80 (6): 2543–94. https://doi.org/10.3982/ecta8442.
Hansen, Bruce. 2022. Econometrics. Princeton University Press.
Heckman, James, and Edward Vytlacil. 1998. Instrumental variables methods for the correlated random coefficient model.” Journal of Human Resources 33 (4): 974–87.
Pesaran, M. Hashem, and Ron P. Smith. 1995. Estimating long-run relationships from dynamic heterogeneous panels.” Journal of Econometrics 6061: 473–77.
Sury, Tavneet. 2011. Selection and Comparative Advantage in Technology Adoption.” Econometrica 79 (1): 159–209. https://doi.org/10.3982/ecta7749.
Wooldridge, Jeffrey M. 2005. Fixed-effects and related estimators for correlated random-coefficient and treatment-effect panel data models.” The Review of Economics and Statistics 87 (May): 385–90.