Processing math: 100%

MathType

Sunday, 26 June 2016

Within and Between Estimator with Stata (Panel)





WITHIN ESTIMATOR (The xtreg,fe command)

The individual-spesific-effects model for the scalar dependent variable yit specifies that;
                               
                             yit=α+xitβ+εt                             (1)

where xit are regressor, αi are random individual-spesific-effects, and εit  is and idiosyncratic error.

In the fixed-effect (FE) model, the αi in the model Eq(1) can be eliminated by subtraction of the corresponding model for individual means;

                ˉyi=αi+ˉxiβ+ˉεi                               (2)

where, for example ˉx=T1iTit=1xit

Subtracts Eq(1) to Eq (2)

                (yitˉyi)=(xitˉxit)β+(εitˉεi)                            (3)


Because αi has been eliminated, OLS leads to consistent estimates of β even if  αi is correlated with xit  as in case of the FE model.

This results give great advantage of panel data.

The disadvantage is inability to estimate the coefficients or a time-invariant regressor.

Also within estimator will be relatively imprecise for time-varying regressors that vary little over time.

Stata actually fit the model;

(yitˉyi+ˉˉy)=α+(xitˉxit+ˉˉx)β+(εitˉεi+ˉˉε)                     (4)

where, for example ˉˉy=(1/N)ˉyi  is the grand mean of yit .

This parameterization has the advantage of providing an intercept estimate, the average of the individual effects αi, while yielding the same slope estimate β  as that from the within model.

The within estimator is computed by using xtreg command with the fe option.
The default standard error assume that after controlling for αi , the error εit is i.i.d.

The vce (robust) option relaxes this assumption and provides cluster-robust standard error, provided that observations are independent over i  and N .

To estimate Eq(4) using same variables before when we discuss within and between variation ,

xtreg lwage exp exp2 wks ed,fe vce(cluster id)


 

















Coefficient of edu is not identified because the data on education is time-invariant.


WITHIN ESTIMATOR (LSDV Regression- areg command)

Another name for the within estimator is the least-square dummy-variable (LSDV) estimator.

This because it can be shown to equal the estimator obtained from OLS estimation of yit  on xit and N individual-specific indicator variables dj,it,j=1,...,N, where dj,it,=1  for the it-th observation if j=1 , and dj,it,j=0 otherwise.

Thus, we fit the model;

                yit=(Nj=1αidj,itxitβ)+εit                                                (5)


This equivalence of LSDV and within estimators does not carry over to nonlinear models.
To estimate Eq(5) using same variable before,

areg lwage exp exp2 wks ed, absorb(id) vce(cluster id)



 


 


The coefficient estimates are the same as those from xtreg,fe.

The cluster-robust standard error differ because of difference small-sample correction.

Thus, xtreg,fe should be used.

This difference arises because inference for areg is designed for case where N  is fixed and T , whereas we are considering short panel case, where T  is fixed and N.





BETWEEN ESTIMATOR (The xtreg,be command)

Uses only between or cross-section variation in the data and is the OLS estimator from the regression of ˉyi on xit.

Because only cross-section variation in the data is used, the coefficient of any individual-invariant  regressor (such as time dummies) cannot be identified.

The between estimator is inconsistent in the FE model but consistent in the RE model.

To explain this, average the individual-effects model Eq(1) to obtain between model;

                ˉyi=αi+ˉxiβ+(αiα+ˉεi)                  (6)

The between estimator is the OLS estimator in this model.

Consistency requires that the error term (αiα+ˉεi) be uncorrelated with xit

This is the case if αi is a random effect but not if αi is a fixed effect.

To estimate Eq(6) using same variable before, in Command window;

xtreg lwage exp exp2 ed,be

 















The estimates and standard error are closer to those obtained from pooled OLS than from within estimation.



No comments:

Post a Comment