
Whitney K. Newey- Massachusetts Institute of Technology
Whitney K. Newey
- Massachusetts Institute of Technology
About
206
Publications
27,785
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
49,731
Citations
Introduction
Current institution
Publications
Publications (206)
Multidimensional heterogeneity and endogeneity are important features of a wide class of econometric models. With control variables to correct for endogeneity, nonparametric identification of treatment effects requires strong support conditions. To alleviate this requirement, we consider varying coefficients specifications for the conditional expec...
This paper introduces an estimator for the average of heterogeneous elasticities of taxable income (ETI), addressing key econometric challenges posed by nonlinear budget sets. Building on an isoelastic utility framework, we derive a linear-in-logs taxable income specification that incorporates the entire budget set while allowing for individual-spe...
Bias correction can often improve the finite sample performance of estimators. We show that the choice of bias correction method has no effect on the higher‐order variance of semiparametrically efficient parametric estimators, so long as the estimate of the bias is asymptotically linear. It is also shown that bootstrap, jackknife, and analytical bi...
We consider estimation of parameters defined as linear functionals of solutions to linear inverse problems. Any such parameter admits a doubly robust representation that depends on the solution to a dual linear inverse problem, where the dual solution can be thought as a generalization of the inverse propensity function. We provide the first source...
In this paper we address the problem of bias in machine learning of parameters following covariate shifts. Covariate shift occurs when the distribution of input features change between the training and deployment stages. Regularization and model selection associated with machine learning biases many parameter estimates. In this paper, we propose an...
This article surveys the development of nonparametric models and methods for estimation of choice models with nonlinear budget sets. The discussion focuses on the budget set regression, that is, the conditional expectation of a choice variable given the budget set. Utility maximization in a nonparametric model with general heterogeneity reduces the...
In this paper, we study nonparametric estimation of instrumental variable (IV) regressions. Recently, many flexible machine learning methods have been developed for instrumental variable estimation. However, these methods have at least one of the following limitations: (1) restricting the IV regression to be uniquely identified; (2) only obtaining...
Many causal and structural effects depend on regressions. Examples include policy effects, average derivatives, regression decompositions, average treatment effects, causal mediation, and parameters of economic structural models. The regressions may be high‐dimensional, making machine learning useful. Plugging machine learners into identifying equa...
jats:title>Summary We provide adaptive inference methods, based on $\ell _1$ regularization, for regular (semiparametric) and nonregular (nonparametric) linear functionals of the conditional expectation function. Examples of regular functionals include average treatment effects, policy effects, and derivatives. Examples of nonregular functionals in...
Often semiparametric estimators are asymptotically equivalent to a sample
average. The object being averaged is referred to as the influence function.
The influence function is useful in formulating primitive regularity conditions
for asymptotic normality, in efficiency comparions, for bias reduction, and for
analyzing robustness. We show that the...
High-dimensional linear models with endogenous variables play an increasingly important role in the recent econometric literature. In this work, we allow for models with many endogenous variables and make use of many instrumental variables to achieve identification. Because of the high-dimensionality in the structural equation, constructing honest...
We derive general, yet simple, sharp bounds on the size of the omitted variable bias for a broad class of causal parameters that can be identified as linear functionals of the conditional expectation function of the outcome. Such functionals encompass many of the traditional targets of investigation in causal inference studies, such as, for example...
Many causal and policy effects of interest are defined by linear functionals of high-dimensional or non-parametric regression functions. $\sqrt{n}$-consistent and asymptotically normal estimation of the object of interest requires debiasing to reduce the effects of regularization and/or model selection on the object of interest. Debiasing is typica...
Debiased machine learning is a meta algorithm based on bias correction and sample splitting to calculate confidence intervals for functionals (i.e. scalar summaries) of machine learning algorithms. For example, an analyst may desire the confidence interval for a treatment effect estimated with a neural network. We provide a nonasymptotic debiased m...
We give debiased machine learners of parameters of interest that depend on generalized linear regressions, which regressions make a residual orthogonal to regressors. The parameters of interest include many causal and policy effects. We give neural net learners of the bias correction that are automatic in only depending on the object of interest an...
We provide an adversarial approach to estimating Riesz representers of linear functionals within arbitrary function spaces. We prove oracle inequalities based on the localized Rademacher complexity of the function space used to approximate the Riesz representer and the approximation error. These inequalities imply fast finite sample mean-squared-er...
Significance
The drift-diffusion model (DDM) has been widely used in psychology and neuroeconomics to explain observed patterns of choices and response times. This paper provides an identification and characterization theorems for this model: We show that the parameters are uniquely pinned down and determine which datasets are consistent with some...
Multidimensional heterogeneity and endogeneity are important features of models with multiple treatments. We consider a heterogeneous coefficients model where the outcome is a linear combination of dummy treatment variables, with each variable representing a different kind of treatment. We use control variables to give necessary and sufficient cond...
Control variables provide an important means of controlling for endogeneity in econometric models with nonseparable and/or multidimensional heterogeneity. We allow for discrete instruments, giving identification results under a variety of restrictions on the way the endogenous variable and the control variables affect the outcome. We consider many...
Research Summary
Strategic management has seen numerous studies analyzing interaction terms in nonlinear models since Hoetker's (2007) best‐practice recommendations and Zelner's (2009) simulation‐based approach. We suggest an alternative recentering approach to assess the statistical and economic importance of interaction terms in nonlinear models....
Many objects of interest can be expressed as a linear, mean square continuous functional of a least squares projection (regression). Often the regression may be high dimensional, depending on many variables. This paper gives minimal conditions for root-n consistent and efficient estimation of such objects when the regression and the Riesz represent...
This paper gives a consistent, asymptotically normal estimator of the expected value function when the state space is high-dimensional and the first-stage nuisance functions are estimated by modern machine learning tools. First, we show that value function is orthogonal to the conditional choice probability, therefore, this nuisance function needs...
The drift diffusion model (DDM) is a model of sequential sampling with diffusion (Brownian) signals, where the decision maker accumulates evidence until the process hits a stopping boundary, and then stops and chooses the alternative that corresponds to that boundary. This model has been widely used in psychology, neuroeconomics, and neuroscience t...
Multidimensional heterogeneity and endogeneity are important features of a wide class of econometric models. We consider heterogenous coefficients models where the outcome is a linear combination of known functions of treatment and heterogenous coefficients. We use control variables to obtain identification results for average treatment effects. Wi...
Control variables provide an important means of controlling for endogeneity in econometric models with nonseparable and/or multidimensional heterogeneity. We allow for discrete instruments, giving identification results under a variety of restrictions on the way the endogenous variable and the control variables affect the outcome. We consider many...
Many objects of interest can be expressed as an L2 continuous functional of a regression, including average treatment effects, economic average consumer surplus, expected conditional covariances, and discrete choice parameters that depend on expectations. Debiased machine learning (DML) of these objects requires a learning a Riesz representer (RR)....
There are many interesting and widely used estimators of a functional with finite semiparametric variance bound that depend on nonparametric estimators of nuisance functions. We use cross-fitting (i.e. sample splitting) to construct novel estimators with fast remainder rates. We give cross-fit doubly robust estimators that use separate subsamples t...
Exact consumer’s surplus and deadweight loss are the most widely used welfare and economic efficiency measures. These measures can be computed from demand functions in straightforward ways. Nonparametric estimation can be used to estimate the welfare measures. In doing so, it seems important to account correctly for unobserved heterogeneity, given...
The linear regression model is widely used in empirical work in Economics, Statistics, and many other disciplines. Researchers often include many covariates in their linear model specification in an attempt to control for confounders. We give inference methods that allow for many covariates and heteroskedasticity. Our results are obtained using hig...
Robins et al. (2008, 2016) applied the theory of higher order influence functions (HOIFs) to derive an estimator of the mean of an outcome Y in a missing data model with Y missing at random conditional on a vector X of continuous covariates; their estimator, in contrast to previous estimators, is semiparametric efficient under minimal conditions. H...
Most modern supervised statistical/machine learning (ML) methods are explicitly designed to solve prediction problems very well. Achieving this goal does not imply that these methods automatically deliver good estimators of causal parameters. Examples of such parameters include individual regression coefficients, average treatment effects, average...
Many economic and causal parameters depend on nonparametric or high dimensional first steps. We give a general construction of locally robust/orthogonal moment functions for GMM, where first steps have no effect, locally, on average moment functions. Using these orthogonal moments reduces model selection and regularization bias, as is important in...
Many economic and causal parameters depend on nonparametric or high dimensional first steps. We give a general construction of locally robust/orthogonal moment functions for GMM, where moment conditions have zero derivative with respect to first steps. We show that orthogonal moment functions can be constructed by adding to identifying moments the...
Individual heterogeneity is an important source of variation in demand. Allowing for general heterogeneity is needed for correct welfare comparisons. We consider general heterogeneous demand where preferences and linear budget sets are statistically independent. Only the marginal distribution of demand for each price and income is identified from c...
This paper examines a general class of inferential problems in semiparametric
and nonparametric models defined by conditional moment restrictions. We
construct tests for the hypothesis that at least one element of the identified
set satisfies a conjectured (Banach space) "equality" and/or (a Banach lattice)
"inequality" constraint. Our procedure is...
Shape restrictions have played a central role in economics as both testable implications of theory and sufficient conditions for obtaining informative counterfactual predictions. In this paper we provide a general procedure for inference under shape restrictions in identified and partially identified models defined by conditional moment restriction...
The linear regression model is widely used in empirical work in Economics.
Researchers often include many covariates in their linear model specification
in an attempt to control for confounders. We give inference methods that allow
for many covariates and heteroskedasticity. Our results are obtained using
high-dimensional approximations, where the...
Non-standard distributional approximations have received considerable
attention in recent years. They often provide more accurate approximations in
small samples, and theoretical improvements in some cases. This paper shows
that the seemingly unrelated "many instruments asymptotics" and "small
bandwidth asymptotics" share a common structure, where...
The central concern of the paper is with the formulation of tests of neglected parameter heterogeneity appropriate for model environments specified by a number of unconditional or conditional moment conditions. We initially consider the unconditional moment restrictions framework. Optimal m-tests against moment condition parameter heterogeneity are...
Given the key role of the taxable income elasticity in designing an optimal tax system there are many studies attempting to estimate this elasticity. A problem with most of these studies is that strong functional form assumptions are used and that heterogeneity in preferences is not allowed for. Building on Blomquist and Newey (2002) we in this pap...
This paper considers identification and estimation of ceteris paribus effects
of continuous regressors in nonseparable panel models with time homogeneity.
The effects of interest are derivatives of the average and quantile structural
functions of the model. We find that these derivatives are identified with two
time periods for ``stayers", i.e. for...
In a recent paper, Hausman, Newey, Woutersen, Chao, and Swanson (2012) propose a new estimator, HFUL (Heteroscedasticity robust Fuller), for the linear model with endogeneity. This estimator is consistent and asymptotically normally distributed in the many instruments and many weak instruments asymptotics. Moreover, this estimator has moments, just...
This chapter shows how a weighted average of a forward and reverse Jackknife IV estimator (JIVE) yields estimators that are robust against heteroscedasticity and many instruments. These estimators, called HFUL (Heteroscedasticity robust Fuller) and HLIM (Heteroskedasticity robust limited information maximum likelihood (LIML)) were introduced by Hau...
In many economic models, objects of interest are functions which satisfy conditional moment restrictions. Economics does not restrict the functional form of these models, motivating nonparametric methods. In this paper we review identification results and describe a simple nonparametric instrumental variables (NPIV) estimator. We also consider a si...
In this paper, we analyze properties of the Continuous Updating Estimator (CUE) proposed by Hansen et al. (1996), which has been suggested as a solution to the finite sample bias problems of the two-step GMM estimator. We show that the estimator should be expected to perform poorly in finite samples under weak identification, in particular, the est...
In parametric, nonlinear structural models a classical sufficient condition
for local identification, like Fisher (1966) and Rothenberg (1971), is that the
vector of moment conditions is differentiable at the true parameter with full
rank derivative matrix. We derive an analogous result for the nonparametric,
nonlinear structural models, establishi...
This note considers an asymptotic property of the class of closest moments estimators. Each such estimator is obtained by setting a vector of sample moments close to corresponding population moments. It is shown that each such estimator is asymptotically equivalent to a GMM estimator, which has a quadratic distance function. An implication of this...
Asymptotic Properties of One-step Estimator Obtained from an Optimal Step-size - Volume 4 Issue 2 - Whitney K. Newey
This paper gives a test of overidentifying restrictions that is robust to many instruments and heteroskedasticity. It is based on a jackknife version of the Sargan test statistic, having a numerator that is the objective function minimized by the JIVE2 estimator of Angrist, Imbens, and Krueger (1999). Correct asymptotic critical values are derived...
Properties of GMM estimators are sensitive to the choice of instruments. Using many instruments leads to high asymptotic asymptotic efficiency but can cause high bias and/or variance in small samples. In this paper we develop and implement asymptotic mean square error (MSE) based criteria for instrumental variables to use for estimation of conditio...
This essay discusses the issues of identification and estimation of the average treatment effect and the average effect of treatment on the treated.
Instrumental variables are often associated with low estimator precision. This paper explores efficiency gains which might be achievable using moment conditions which are nonlinear in the disturbances and are based on flexible parametric families for error distributions. We show that these estimators can achieve the semiparametric efficiency bound...
In the last fifteen years there has been much work on nonparametric identification of causal effects in settings with endogeneity. Earlier, researchers focused on linear systems with additive residuals. However, such systems are often difficult to motivate by economic theory. In many cases it is precisely the nonlinearity of the system and the pres...
These volumes constitute the invited proceedings from the Ninth World Congress of the Econometric Society held on the campus of University College London on August 19–24, 2005. As co-chairs of the Program Committee for the Congress, one of our most pleasant tasks was to select topics and authors for fifteen invited symposia – each organized around...
These volumes constitute the invited proceedings from the Ninth World Congress of the Econometric Society held on the campus of University College London on August 19–24, 2005. As co-chairs of the Program Committee for the Congress, one of our most pleasant tasks was to select topics and authors for fifteen invited symposia – each organized around...
Properties of GMM estimators are sensitive to the choice of instrument. Using many instruments leads to high asymptotic asymptotic efficiency but can cause high bias and/or variance in small samples. In this paper we develop and implement asymptotic mean square error (MSE) based criteria for instrument selection in estimation of conditional moment...
This paper uses control variables to identify and estimate models with nonseparable, multidimensional disturbances. Triangular simultaneous equations models are considered, with instruments and disturbances that are independent and a reduced form that is strictly monotonic in a scalar disturbance. Here it is shown that the conditional cumulative di...
[enter Abstract Body]This paper derives the limiting distributions of alternative jackknife IV (JIV ) estimators and gives formulae for accompanying consistent standard errors in the presence of heteroskedasticity and many instruments. The asymptotic framework includes the many instrument sequence of Bekker (1994) and the many weak instrument seque...
Using many moment conditions can improve efficiency but makes the usual generalized method of moments (GMM) inferences inaccurate. Two-step GMM is biased. Generalized empirical likelihood (GEL) has smaller bias, but the usual standard errors are too small in instrumental variable settings. In this paper we give a new variance estimator for GEL that...
Nonseparable panel models are important in a variety of economic settings,
including discrete choice. This paper gives identification and estimation
results for nonseparable models under time homogeneity conditions that are like
"time is randomly assigned" or "time is an instrument." Partial identification
results for average and quantile effects a...
Abstract This paper gives identiflcation and estimation results for marginal efiects in nonlinear panel models. We flnd that linear flxed efiects estimators are not consistent, due in part to marginal efiects not being identifled. We derive bounds for marginal efiects and show that they can tighten rapidly as the number of time series observations...
These volumes constitute the invited proceedings from the Ninth World Congress of the Econometric Society held on the campus of University College London on August 19–24, 2005. As co-chairs of the Program Committee for the Congress, one of our most pleasant tasks was to select topics and authors for fifteen invited symposia – each organized around...
This paper gives a relatively simple, well behaved solution to the problem of many instruments in heteroskedastic data. Such settings are common in microeconometric applications where many instruments are used to improve efficiency and allowance for heteroskedasticity is generally important. The solution is a Fuller (1977) like estimator and standa...
This paper gives identification and estimation results for quantile and average effects in nonseparable panel models, when the distribution of period specific disturbances does not vary over time. Bounds are given for interesting effects with discrete regressors that are strictly exogenous or predetermined. We allow for location and scale time effe...
This essay discusses the issues of identification and estimation of the average treatment effect and the average effect of treatment on the treated.
Using many valid instrumental variables has the potential to improve efficiency but makes the usual inference procedures inaccurate. We give corrected standard errors, an extension of Bekker to nonnormal disturbances, that adjust for many instruments. We find that this adjustment is useful in empirical work, simulations, and in the asymptotic theor...
Modeling choices that are both discrete and continuous is important in several settings. The purpose of this article is to explore formulation and identification of such models when indirect utility functions are specified nonparametrically. Here we consider general nonseparable disturbances. We give identification results for nonseparable sample s...
It is common practice in econometrics to correct for heteroskedasticity.This paper corrects instrumental variables estimators with many instruments for heteroskedasticity.We give heteroskedasticity robust versions of the limited information maximum likelihood (LIML) and Fuller (1977, FULL) estimators; as well as heteroskedasticity consistent standa...
There are many environments where knowledge of a structural relationship is required to answer questions of interest. Also, nonseparability of a structural disturbance is a key feature of many models. Here, we consider nonparametric identification and estimation of a model that is monotonic in a nonseparable scalar disturbance, which disturbance is...
2SLS is by far the most-used estimator for the simultaneous equation problem. However, it is now well-recognized that 2SLS can exhibit substantial finite sample (second-order) bias when the model is over-identified and the first stage partial R2 is low. The initial recommendation to solve this problem was to do LIML, e.g.Bekker(1994) or Staiger and...
This paper develops a new nonparametric series estimator for the average treatment effect for the case with unconfounded treatment assignment, that is, where selection for treatment is on observables. The new estimator is efficient. In addition we develop an optimal procedure for choosing the smoothing parameter, the number of terms in the series b...
This paper develops a new efficient estimator for the average treatment effect, if selection for treatment is on observables. The new estimator is linear in the first-stage nonparametric estimator. This simplifies the derivation of the means squared error (MSE) of the estimator as a function of the number of basis functions that is used in the firs...
Using many moment conditions can improve efficiency but makes the usual GMM inferences inaccurate. Two step GMM is biased. Generalized empirical likelihood (GEL) has smaller bias but the usual standard errors are too small. In this paper we use alternative asymptotics, based on many weak moment conditions, that addresses this problem. This asymptot...
Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Economics, 1983. MICROFICHE COPY AVAILABLE IN ARCHIVES AND DEWEY Bibliography: leaves 169-170. by Whitney Kent Newey. Ph.D.
This 2005 volume contains the papers presented in honor of the lifelong achievements of Thomas J. Rothenberg on the occasion of his retirement. The authors of the chapters include many of the leading econometricians of our day, and the chapters address topics of current research significance in econometric theory. The chapters cover four themes: id...
This papers studies and compares the asymptotic bias of GMM and generalized empirical likelihood (GEL) estimators in the presence of estimated nuisance parameters. We consider cases in which the nuisance parameter is estimated from independent and identical samples. A simulation experiment is conducted for covariance structure models. Empirical lik...
Fixed effects estimators of panel models can be severely biased because of the well-known incidental parameters problem. We show that this bias can be reduced by using a panel jackknife or an analytical bias correction motivated by large T. We give bias corrections for averages over the fixed effects, as well as model parameters. We find large bias...
Conditional moment restrictions can be combined through GMM estimation to construct more efficient semiparametric estimators. This paper is about attainable efficiency for such estimators. We define and use a moment tangent set, the directions of departure from the truth allowed by the moments, to characterize when the semiparametric efficiency bou...
In an effort to improve the small sample properties of generalized method of moments (GMM) estimators, a number of alternative estimators have been suggested. These include empirical likelihood (EL), continuous updating, and exponential tilting estimators. We show that these estimators share a common structure, being members of a class of generaliz...
The purpose of this note is to show how semiparametric estimators with a small bias property can be constructed. The small bias property (SBP) of a semiparametric estimator is that its bias converges to zero faster than the pointwise and integrated bias of the nonparametric estimator on which it is based. We show that semiparametric estimators base...
Using many instruments can improve efficiency but makes the usual GMM in- ferences inaccurate. Two step GMM is biased. Generalized empirical likelihood (GEL) has smaller bias but the usual standard errors are too small. We address this problem by deriving the limiting distribution of GEL under alternative asymp- totics where the number of moments g...
We consider games with incomplete information a la Harsanyi, where the payoff of a player depends on an unknown state of nature as well as on the profile of chosen actions. As opposed to the standard model, players' preferences over state--contingent utility vectors are represented by arbitrary functionals. The definitions of Nash and Bayes equilib...
This paper is about efficient estimation and consistent tests of conditional moment restrictions. We use unconditional moment restrictions based on splines or other approximating functions for this purpose. Empirical likelihood estimation is particularly appropriate for this setting, because of its relatively low bias with many moment conditions. W...