Consider the semiparametric regression model
y = x\sp{\tau}\beta + g(t) + e where x and t are covariates,
is a p-vector of unknown parameters, g is an unknown smooth function, and e is the error term with mean 0 and variance
\sigma\sp2 > 0. This model arises naturally in situations where some covariates are difficult to be formulated into the model in a parametric fashion. There have
... [Show full abstract] been several approaches to estimating the parameter of interest , all depending on some kinds of smoothing parameters inherent in the treatment of the nonparametric component g. For example, the kernel smoothing estimator \ \beta\sb{2h}, proposed in Speckman (1988), depends on the bandwidth h. So we face the important issue of how to choose those smoothing parameters. Although this issue has been extensively studied in the context of non-parametric regression, there has been little work on this semiparametric setting. In this thesis, we investigate asymptotic properties of the automatic bandwidth choice and the resulting data-driven kernel smoothing estimator \ \beta\sb{2\ h} of the parameter The bandwidth is chosen to minimize a general data-driven bandwidth selector which includes such traditional methods as Mallow's C\sb{L} criterion, CV and GCV. Asymptotic optimality of is proved and its asymptotic normality is established. The data-driven estimator \ \beta\sb{2\ h} is shown to be -consistent by establishing an asymptotic normality. We further study the accuracy of this normal approximation and show that, contrary to what might be expected, it can not attain the optimal Berry-Esseen rate n\sp{-1/2}. Consequently, the one-sided confidence interval of based on this normal approximation is not first order accurate, causing potential poor coverage of the true parameter. To overcome this drawback, an estimator is proposed to reduce the bias of \ \beta\sb{2h}. The data-driven version of the proposed estimator successfully attains the optimal normal approximation rate n\sp{-1/2}. The corresponding one-sided confidence interval of is thus first order accurate. Simulation studies show that not only does the proposed estimator provide smaller bias and more accurate confidence interval in terms of coverage, as expected by our theoretical results, it also has better control of variance than \ \beta\sb{2h} for both deterministic and automatic bandwidth choices. Thesis (Ph.D., Statistics)--Northwestern University, 1998.