Many economic relationship are
dynamic in nature and one of the advantages of panel data is that they allow
the researcher to better understand the dynamics of adjustment. Some economic
model suggest that current behavior depends upon past behavior, so in many
cases we would like to estimate a dynamic model on an individual level. The
ability to do so is unique for panel data.
The dynamic relationship of panel data is characterized by
the presence of a lagged dependent variable among the repressors;
yit=x′itβ+γyi,t−1+x′itβ+αi+εit (1)
where it is assumed that εitis IID(0,σ2ε) .
The basic problem with the lagged dependent variable
included in the model;
1) For
the FE estimator;
yit−ˉyi=γ(yi,t−1−ˉyi,−1)+(xit−ˉxi.)β+(εit−ˉεi) (2)
the within transformation wipe out the αi , but
(yi,t−1−ˉyi,−1) will be correlated with (εit−ˉεi) even if the εit
are not serially correlated. This is because yi,−1 is correlated with ˉεi,
the latter average contains εi,t−1, which is obviously correlated with yi,t−1.
2) For
the RE estimator, in order to apply GLS, quasi-demeaning is performed and (yi,t−1−θˉyi,−1) will be correlated with (εit−θˉεi).
That means, for a dynamic panel data model, the estimator is
biased and inconsistent, whether the effects are treated as fixed or random.
This bias is of order 1/T
and disappears only if T→∞ . The bias can be serious when T is small and N→∞ .
To see why the biased and inconsistent is exist when T is fixed and N→∞ , let we
first consider the case where there are no exogenous variables included ;
yit=x′itβ+γyi,t−1+αi+εit, |λ|<1 (3)
Assumed that we have observations on yit for period t=0,1,..T .
The FE estimator for γ ;
ˆγFE=∑Ni=1∑Tt=1(yit−ˉyi)(yi,t−1−ˉyi,−1)∑Ni=1∑Tt=1(yi,t−1−ˉyi,−1)2 (4)
where ˉyit=(1/T)∑Tt=1yit and ˉyi,−1=(1/T)∑Tt=1yi,t−1.
The properties of ˆγFE
can be shown by substitute Eq(3)
into Eq(4);
ˆγFE=γ+(1/(NT))∑Ni=1∑Tt=1(εit−ˉεi)(yi,t−1−ˉyi,−1)(1/(NT))∑Ni=1∑Tt=1(yi,t−1−ˉyi,−1)2 (5)
The estimator for FE in Eq(5) is biased and inconsistent for
N→∞
and fixed T because the last term in the right-hand side
of Eq(5) does not have expectation zero and does not converge to zero if N→∞. Nickell (1981) and Hsio (2003) state that;
plimn→∞1NTN∑i=1T∑t=1(εit−ˉεi)(yi,t−1−ˉyi,−1)=−σ2εT2⋅(T−1)−Tγ+γT(1−γ)2≠0
(6)
Thus, for fixed T we have inconsistent estimator. This inconsistency is
nothing to do with αi as these is eliminated in estimation.
The problem is that the within transformed lagged dependent
variable is correlated with the within transformed error as we see in Eq(2) and
Eq(5). If T→∞ , Eq(6) converge to 0 so that the FE estimator
is consistent for γ
if both T→∞ and N→∞.
To solve the
inconsistency problem, Anderson and Hsio (1981) proposed the instrumental
variable (IV) estimator for the γ. Lets we start first with a different transformation
to eliminate the individual effects αi with first differences;
yit−yi,t−1=γ(yi,t−1−yi,t−2)+(εit−εi,t−1) t=2,..T (7)
Estimation Eq(7) by OLS will lead inconsistent estimator of γ because yi,t−1 and εi,t−1
are correlated, even T→∞ .
The transformation specification Eq(7) suggests an IV approach
.For example, yi,t−2
is correlated with (yi,t−1−yi,t−2) but not with εi,t−1.
This suggests and IV estimator for γ by Anderson and Hsio(1981);
ˆγIV=∑Ni=1∑Tt=2yi,t−2(yit−yi,t−1)∑Ni=1∑Tt=2yi,t−2(yi,t−1−yi,t−2) (8)
and the condition for consistency of estimator in Eq(8);
plim = 1N(T−1)N∑i=1T∑t=2(εit−εi,t−1)yi,t−2=0 (9)
and T→∞, or N→∞, or both. Note that (εit−εi,t−1)
is MA(1).
Anderson and Hsio (1981) also proposed an alternative, where
(yi,t−2−yi,t−3) is used as an instrument;
ˆγ(2)IV=∑Ni=1∑Tt=3(yi,t−2−yi,t−3)(yit−yi,t−1)∑Ni=1∑Tt=3(yi,t−2−yi,t−3)(yi,t−1−yi,t−2) (10)
and the condition for consistency of estimator in Eq(10);
plim = 1N(T−2)N∑i=1T∑t=1(εit−εi,t−1)(yi,t−2−yi,t−3)=0 (11)
Consistency of both Eq(8) and Eq(11) is guaranteed as long as εit has no autocorrelation.
We see that in Eq(10) the IV estimator requires an
additional lag to construct the instrument and lead ‘lost’ in one sample
period. The question is, which estimator
we should use? Eq(8) or Eq(10)?
This is not an issues as a method of moment (MM) approach
can unify the estimators and eliminate the disadvantages of reduced sample
size.
The moment condition for Eq(9) become;
plim = 1N(T−1)N∑i=1T∑t=2(εit−εi,t−1)yi,t−2=E{(εit−εi,t−1)yi,t−2}0 (12)
Similary for Eq(11)
plim = 1N(T−2)N∑i=1T∑t=1(εit−εi,t−1)(yi,t−2−yi,t−3)=E{(εit−εi,t−1)(yi,t−2−yi,t−3)}=0
(13)
Both IV estimator impose one moment condition in estimation.
But, as we know, imposing more moment condition increase the
efficiency of the estimators.
Follow on this, Arellano and Bond(1991) then suggest that
the list of instrument can be extended by exploiting additional moment and
letting their number vary with t . To do this, they keep T fixed.
Lets T=4
, then the moment condition for period t=2 become;
E{(εi2−εi,1)yi0}=0
which means the variable yi0 is valid instrument, since it is
highly correlated with (yi1−yi0) and not correlated with (εi2−εi,1).
For t=3,
we have
E{(εi3−εi,2)yi1}=0
and, we also hold that
E{(εi3−εi,2)yi0}=0
where yi0 and yi1 are correlated with (yi2−yi1) and not correlated with
(εi3−εi,2).
And then, for period t=4, we have three moment conditions and there
valid instruments;
E{(εi4−εi,3)yi0}=0
E{(εi4−εi,3)yi1}=0
E{(εi4−εi,3)yi2}=0
One can continue this fashion, the set of valid instruments
becomes (yi0,yi2...yi,T−2).
All these moment conditions can be exploited is a General
Method of Moment (GMM) framework. For a general sample size T , the vector of
transformed error terms become;
and the matrix of instruments;
Each row in the matrix Zi
contains the instruments that are
valid for given period. Consequently, the set of all moment conditions can be
written as
E{Z′iΔεi}=0 (16)
To derive the GMM estimator, written Eq(16) as
E{Z′i(Δyi−γΔyi,−1)}=0
(17)
Typically, the number of moment condition will exceed the
number of unknown parameters, and we estimate γ by minimizing quadratic expression in
term of the corresponding sample moments.
minγ [1NN∑i=1Z′i(Δyi−γΔyi,−1)]′WN[1NN∑i=1Z′i(Δyi−γΔyi,−1)] (18)
where WN is a symmetric positive
definite weighting matrix. Differentiating Eq(18) with respect to γ and
solving for γ give the GMM estimator;
ˆγGMM=((N∑i=1Δy′i,−1Zi)WN(N∑i=1Zi′Δyi,−1))−1×(N∑i=1Δy′i,−1Zi)WN(N∑i=1Zi′Δyi)
(19)
The GMM approach does not impose that εit
is i.i.d. over individuals and time. Note that the absence of autocorrelation
was needed to guarantee that the moment condition is valid. So, it advisable
(for a small sample) to impose the absence of autocorrelation in εit, combined with a homoscedasticity
assumption.
Alvarez and Arellano (2003) show that, in general, the GMM
estimator is also consistent when both T→∞ and N→∞. But, for the large T , the GMM estimator will close to the
FE estimator, which provides a more attractive alternative.
Estimation with Stata
For dynamic
panel estimation , we use the abdata.dta.
The
aim is to estimate a model for employment in a panel of companies in UK and the
model estimated is based on Arellano-Bond (1991);
nit=α1ni,t−1+α2ni,t−2+β1wit+β2wi,t−1+β3kit+β4ki,t−1+β5ki,t−2+β6ysit+β7ysi,t−1+β8ysi,t−2+λt+ui+εit (20)
Where;
n
= log of employee
w = log of per-employee real wage
k = log of gross capital stock
ys = log of output of each industry
λt= time effect
ui=
time-invariant unobservable
εit = idiosyncratic error
and the T=7
and N=140
Lets now we estimate the model Eq(20) with the IV estimation
base on the Anderson & Hsio (1981). We will use the xtvireg command estimation;
xtivreg n (L(1/2).n L(0/1).w L(0/2).(k ys)
yr1980-yr1984 = L(2/3).n L(0/1).w L(0/2).(k ys) yr1980-yr1984),fd nocons
Now, lets
we estimate the Eq(20) again but with the Arellano and Bond(1991) method, or
GMM method.
xtabond n L(0/1).w L(0/2).(k ys)
yr1979-yr1984 year, lags(2) vce(robust)
It is
important to test H0 : error not correlated at the second order i.e.
dynamics correctly specified. Of course,
H0 : error not correlated at the first order is always rejected
because in the first difference equation errors – MA(1).
To perform the Arellano-Bond test for first-
and second-order autocorrelation in the first-difference errors;
estat
abond
The
results for Arellano-Bond test shows
that our estimation does not present evidence that the model is misspecified
(no autocorrelation at second order).
Beside the xtabond,we also can use the user-written xtabond2 command by Roodman (2009) to get the exactly same results.
xtabond2 n L(1/2).n L(0/1).w L(0/2).(k ys)
yr1979-yr1984,gmm(n,laglimits(2 .)) iv(L(0/1).w L(0/2).(k ys) yr1979-yr1984)
noleveleq
The results
from the xtbond2 shows that our estimation
does not present evidence that the model is misspecified (no autocorrelation at
second order).
For the
Sargan test for overidentifying, only
for homokedastic error term does the Sargan test have an asymptotic chi-squared
distribution. In fact, Arellano and bond (1991) show that one-step Sargan test
overrejects in the presence of heteroskedasticity.The results above presents
strong evidence against the null hypothesis that the overidentifying restriction
are valid, or the population moment condition are correct. Rejecting this null
hypothesis implies that we need to reconsider our model or our instruments.
Some
consideration need to take note.
1. First, all the explanatory variables
different from the lagged dependent variable are assumed to be strictly
exogenous, i.e. E(xit,εis)=0 , ∀t,s=1,...,T,∀i=1,..N.
2. The gmm option specifies a set of variables
to be used as bases for “GMM-style” instrument sets describe in Holtz-Eakin,
Newey and Rosen (1988) and Arellano and Bond (1991).
3. By default, xtbond2
uses for each time
period, all variable lags of the specified variables in levels dated t−1 or earlier as
instruments for the first difference equation, and the contemporaneous first
differences as instruments in the level equations.
4. The suboption laglimit(a
b)can override these default. For the
first-difference equations, lagged levels dated t−a to t−b are used as instruments. For the level
equation, the first-difference dated t+a+1 is normally used. Note that a and b can each be missing
intending to
infinity; they can even be negative, implying “forward” lags (Areallano and
Bover, 1995).
5. There are different ways of writing gmm for eq(diff). For example, the use of yi,t−2 and its lags as IVs for yi,t−1 in
equation in first-differences can be written as : gmm(y,laglimits(2 .)),or
gmm(L2.y,laglimits(0 .)),or gmm(L.y,laglimits(1 .))or the default gmm(L.y).