If individual effect
(cross-sectional or time specific effect) does
not exist (ui=0) , OLS produces efficient and
consistent parameter estimates;
yit=β0+β1xit+ui+vit (1)
and we assumed that (ui=0) .
OLS consists of five
core assumptions (Greene,2008; Kennedy,2008)
o Linearity – the model is
linear function.
o Exogeneity – expected
value of disturbance is zero or disturbance are not correlated with any
regressor.
o Homoscedasticity &
no autocorrelation.
o Not stochastic for the
independent variable but fixed in repeated samples.
o Full rank – there is no
exact linear relationship among independent variables
There are
several strategies for estimating a fixed effect model; the least squares dummy
variable (LSDV) model, within estimation and between estimation.
Random
Effects (RE) Model
In FE model we had
discuss in here, the estimation goal of FE is to eliminate ui
because it is thought to be correlated with
one or more of the xit.
But, suppose we
assume ui is uncorrelated with each explanatory variable in all time
periods. Then using a transformation to eliminate ui
will results inefficient estimators.
Eq(1) becomes a RE
model when we assume that the unobserved effect ui is uncorrelated with each explanatory variable;
Cov(xij,ui)=0 (2)
The ideal RE
assumptions include all the FE assumptions plus the additional requirement that
ui is independent
of all explainatory variables in all time periods.
If we assume
the unobserved the ui is correlated with any explanatory variables, we
should use first differencing or FE.
To estimate RE,
we define the composite error term as wit=ui+vit, then
the Eq (2.1) can be written as;
yit=β0+β1xit+wit (3)
Because
is in the
composite error in each time period, the wit are serially correlated across time.
Under the RE
assumptions;
Corr(wit,wis)=σ2u/(σ2u+σ2v),t≠s (4)
where σ2u=Var(ui) and σ2v=Var(vit)
The RE is estimated
by GLS when the covariance structure is known, and by FGLS or EGLS when the
covariance structure of composite error is unknown.
Compared to FE
model, a RE model is relatively difficult to estimate. In FGLS, we first have
to estimate θ using ˆσ2u and ˆσ2v .
The ˆσ2u comes from the
between effect estimation (group mean regression) and ˆσ2v
is derived from the RSS of the within effect estimation of the deviation of
residuals from group means of residual;
ˆθ=1−√ˆσ2vTˆσ2u+ˆσ2v=1−√ˆσ2vTˆσ2between
(5)
where ˆσ2u=ˆσ2between−ˆσ2vT, where ˆσ2between=RSSbetweenn−k−1 ,
ˆσ2v=RSSwithinnT−n−k=e′ewithinnT−n−k=∑ni=1∑Tt=1(vit−ˉvi)2nT−n−k, where
are the residual of the LSDV.
Then, the dependent
variable, independent variables, and the intercept term need to be transformed
as follows;
y∗it=yit−ˆθˉyi (6)
x∗it=xit−ˆθˉxi (7)
β∗0=1−ˆθ (8)
Finally, run OLS on
those transformed variables , Eq(6), (7) and (8) with the traditional
intercept suppressed;
y∗it=β∗0+β1x∗it+ε∗it (9)
Estimation
using Stata
For our
discussion on the RE using Stata, lets we use the data airline.dta again
as we discuss the FE model in here and we want to estimate the effects of output,
fuel and loading factor to the cost of airline companies;
costit=β0+β1outputit+β2fuelit+β3loadit+vit (10)
where;
costit =
cost of airline companies
outputit =
revenue passenger mile (output index)
fuelit = fuel prices
loadit = loading factor (average capacity utilization of
the fleet)
Now, lets
us regress the Eq(10) by the pooled OLS
reg cost output
fuel load
Now, lets we regress
the RE model. The estimation of the RE model require that we need to estimate
the Eq(3) first and then get the value of θ
manually as in
Eq(5). After that, we need to transform the data based on the value of
as in Eq(6), Eq(7)
and Eq(8) also in manually and then regress the RE model by OLS as in Eq(9).
In Stata, we can
skip the procedure of calculation and estimation manually from Eq(5) through
Eq(9). Thanks to Stata for the command xtreg,re which the Stata estimate the Eq(9)
automatically to get the output in RE estimation.
Before we run the xtreg
command, we need to specifies first the cross-sectional and time series
variables,
xtset airline year
To estimate the RE
model as in Eq(9);
xtreg cost output fuel load,re theta
The sigma_u
and sigma_e
are square roots of the variance
components for groups and errors, respectively (0.0156=0.12492,0.036=0.06012).
Note that the RSS is 0.0602 displayed under sigma_e.
The rho
represents the ratio of individual specific error variance to the
composite (entire) error variance, 0.8119=0.12492/(0.12492+0.06012).
A large ratio – individual specific error account a
large proportion of the composite error variance.
For this RE estimation, the individual specific
error can explain 81% of entire composite error variance.
This ratio may be interpreted as a goodness-of-fit
of RE model.