LSE LSE Research Laboratory LSE
LSE Research Laboratory (RLAB)

Abstract for:

Semiparametric Regression Analysis under Imputation for Missing Response Data

Wolfgang  Haerdle,  Oliver  Linton,  Qihua  Wang,  May 2003
Paper No' EM/2003/454: | Full paper (pdf)
Save Reference as: BibTeX BibTeX File | Endote EndNote Import File
Keywords: Asymptotic normality; empirical likelihood; semiparametric imputation.

JEL Classification:

Is hard copy/paper copy available? YES - Paper Copy Still In Print.
This Paper is published under the following series: Econometrics
Share this page: Google Bookmarks Google Bookmarks | Facebook Facebook | Twitter Twitter

Abstract:

We develop inference tools in a semiparametric regression model with missing response data. A semiparametric regression imputation estimator, a marginal average estimator and a (marginal) propensity score weighted estimator are defined. All the estimators are proved to be asymptotically normal, with the same asymptotic variance. They achieve the semiparametric efficiency bound in the homoskedastic Gaussian case. We show that the Jackknife method can be used to consistently estimate the asymptotic variance. Our model and estimators are defined with a view to avoid the curse of dimensionality, and that severely limits the applicability of existing methods. The empirical likelihood method is developed. It is shown that when missing responses are imputed using the semiparametric regression method the empirical log-likelihood is asymptotically a scaled chi-square variable. An adjusted empirical log-likelihood ratio, which is asymptotically standard chi-square, is obtained. Also, a bootstrap empirical log-likelihood ratio is derived and its distribution is used to approximate that of the imputed empirical log-likelihood ratio. A simulation study is conducted to compare the adjusted and bootstrap empirical likelihood with the normal approximation-based method in terms of coverage accuracies and average lengths of confidence intervals. Based on biases and standard errors, a comparison is also made by simulation between the proposed estimators and the related estimators. Furthermore, a real data analysis is given to illustrate our methods.