How to use alpaca

Estimating a binary-choice model with individual and time fixed effects

In the following we utilize an example from labor economics to demonstrate the capabilities of feglm(). More precisely, we use a balanced micro panel data set from the Panel Study of Income Dynamics to analyze the intertemporal labor force participation of 1,461 married women observed for nine years. A similar empirical illustration is used in Fernández-Val (2009).

Before we start, we briefly inspect the data set to get an idea about its structure and potential covariates.

data(psid, package = "bife")
head(psid)

##    ID LFP KID1 KID2 KID3     INCH AGE TIME
## 1:  1   1    1    1    1 58807.81  26    1
## 2:  1   1    1    0    2 41741.87  27    2
## 3:  1   1    0    1    2 51320.73  28    3
## 4:  1   1    0    1    2 48958.58  29    4
## 5:  1   1    0    1    2 53634.62  30    5
## 6:  1   1    0    0    3 50983.13  31    6

ID and TIME are individual and time-specific identifiers, LFP is an indicator equal to one if a woman is in labor force, KID1 - KID3 are the number of children in a certain age group, INCH is the annual income of the husband, and AGE is the age of a woman.

First, we use a specification similar to Fernández-Val (2009) and estimate a static model of female labor supply where we control for individual and time-specific unobserved heterogeneity (so called individual and time fixed effects).

library(alpaca)
stat <- feglm(
  LFP ~ KID1 + KID2 + KID3 + log(INCH) | ID + TIME,
  data   = psid,
  family = binomial("probit")
  )
summary(stat)

## binomial - probit link
## 
## LFP ~ KID1 + KID2 + KID3 + log(INCH) | ID + TIME
## 
## Estimates:
##            Estimate Std. error z value Pr(> |z|)    
## KID1      -0.676898   0.056301 -12.023   < 2e-16 ***
## KID2      -0.344370   0.049897  -6.902  5.14e-12 ***
## KID3      -0.007045   0.035344  -0.199     0.842    
## log(INCH) -0.234128   0.054403  -4.304  1.68e-05 ***
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
## 
## residual deviance= 6069.65,
## null deviance= 8152.05,
## n= 5976, l= [664, 9]
## 
## ( 7173 observation(s) deleted due to perfect classification )
## 
## Number of Fisher Scoring Iterations: 7

As glm(), the summary statistic of the model provides detailed information about the coefficients and some information about the model fit (residual deviance and null deviance). Furthermore, we report statistics that are specific to fixed effects models. More precisely, we learn that only 5,976 observations out of 13,149 contribute to the identification of the structural parameters. This is indicated by the message that 7,173 observations are deleted due to perfect classification. With respect to binary choice models those are observations that are related to women who never change their labor force participation status during the nine years observed. Thus those women were either always employed or unemployed.¹ Overall the estimation results are based on 664 women observed for nine years.

Because coefficients itself are not very meaningful, econometricians are usually interested in so called partial effects (also known as marginal or ceteris paribus effects). A commonly used statistic is the average partial effect. alpaca offers a post-estimation routine to estimate average partial effects and their corresponding standard errors.²

apes.stat <- getAPEs(stat)
summary(apes.stat)

## Estimates:
##             Estimate Std. error z value Pr(> |z|)    
## KID1      -0.0880155  0.0077937 -11.293   < 2e-16 ***
## KID2      -0.0447776  0.0068196  -6.566  5.17e-11 ***
## KID3      -0.0009161  0.0050062  -0.183     0.855    
## log(INCH) -0.0304432  0.0077090  -3.949  7.85e-05 ***
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

A widespread reason that prevents the use of non-linear fixed effects models in practice is the so-called incidental parameter bias problem (IPP) first mentioned by Neyman and Scott (1948). Fortunately, for classical panel data sets, like in this example, there already exist several asymptotic bias corrections tackling the IPP (see Fernández-Val and Weidner (2018) for an overview).³ Our package provides a post-estimation routine that applies the analytical bias correction derived by Fernández-Val and Weidner (2016).⁴ Technical details on how the bias correction is accelerated using the method of alternating projections can be found in Czarnowske and Stammann (2020).

stat.bc <- biasCorr(stat)
summary(stat.bc)

## binomial - probit link
## 
## LFP ~ KID1 + KID2 + KID3 + log(INCH) | ID + TIME
## 
## Estimates:
##            Estimate Std. error z value Pr(> |z|)    
## KID1      -0.596285   0.055528 -10.738   < 2e-16 ***
## KID2      -0.303346   0.049517  -6.126     9e-10 ***
## KID3      -0.006117   0.035211  -0.174  0.862081    
## log(INCH) -0.207061   0.053928  -3.840  0.000123 ***
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
## 
## residual deviance= 6069.65,
## null deviance= 8152.05,
## n= 5976, l= [664, 9]
## 
## ( 7173 observation(s) deleted due to perfect classification )
## 
## Number of Fisher Scoring Iterations: 7

apes.stat.bc <- getAPEs(stat.bc)
summary(apes.stat.bc)

## Estimates:
##            Estimate Std. error z value Pr(> |z|)    
## KID1      -0.096501   0.007620 -12.664   < 2e-16 ***
## KID2      -0.049093   0.006766  -7.255  4.01e-13 ***
## KID3      -0.000990   0.004987  -0.198     0.843    
## log(INCH) -0.033510   0.007588  -4.416  1.00e-05 ***
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Whereas analytical bias corrections for static models get more and more attention in applied work, it is not well known that they can also be used for dynamic models with fixed effects.

Before we can adjust our static to a dynamic specification, we first have to generate a lagged dependent variable.

library(data.table)
setDT(psid)
psid[, LLFP := shift(LFP), by = ID]

Contrary to the bias correction for the static models, we need to additionally provide a bandwidth parameter (L) that is required for the estimation of spectral densities (see Hahn and Kuersteiner (2011)). Fernández-Val and Weidner (2016) suggest to do a sensitivity analysis and try different values for L but not larger than four. Note that in this case the order of factors to be concentrated out, specified in the second part of the formula, is important (cross-sectional identifier first and time identifier second).

dyn <- feglm(
  LFP ~ LLFP + KID1 + KID2 + KID3 + log(INCH) | ID + TIME,
  data   = psid,
  family = binomial("probit")
  )
dyn.bc <- biasCorr(dyn, L = 1L)
summary(dyn.bc)

## binomial - probit link
## 
## LFP ~ LLFP + KID1 + KID2 + KID3 + log(INCH) | ID + TIME
## 
## Estimates:
##           Estimate Std. error z value Pr(> |z|)    
## LLFP       1.01607    0.04759  21.350   < 2e-16 ***
## KID1      -0.45387    0.06811  -6.664  2.67e-11 ***
## KID2      -0.15736    0.06116  -2.573   0.01008 *  
## KID3       0.01561    0.04406   0.354   0.72315    
## log(INCH) -0.18833    0.06231  -3.023   0.00251 ** 
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1
## 
## residual deviance= 4777.58,
## null deviance= 6549.14,
## n= 4792, l= [599, 8]
## 
## ( 1461 observation(s) deleted due to missingness )
## ( 6896 observation(s) deleted due to perfect classification )
## 
## Number of Fisher Scoring Iterations: 6

apes.dyn.bc <- getAPEs(dyn.bc)
summary(apes.dyn.bc)

## Estimates:
##            Estimate Std. error z value Pr(> |z|)    
## LLFP       0.186310   0.006686  27.864   < 2e-16 ***
## KID1      -0.072321   0.007832  -9.235   < 2e-16 ***
## KID2      -0.025074   0.007003  -3.580  0.000343 ***
## KID3       0.002487   0.005008   0.497  0.619447    
## log(INCH) -0.030009   0.007002  -4.286  1.82e-05 ***
## ---
## Signif. codes:  0 ‘***’ 0.001 ‘**’ 0.01 ‘*’ 0.05 ‘.’ 0.1 ‘ ’ 1

Note that in this specification (with individual and time fixed effects) also observations related to a specific time period where all women are either in labor force or not can be dropped. However this is very unlikely in practice.↩︎
The routine is currently restricted to binary choice models but will be extended in the future.↩︎
Cruz-Gonzalez, Fernández-Val, and Weidner (2017) apply the same bias correction to a pseudo panel of bilateral trade flows.↩︎
See footnote 2.↩︎

How to use alpaca

Generalized linear models with fixed effects

Estimating a binary-choice model with individual and time fixed effects

Further Features

References