Whitney K. Newey

Whitney K. Newey
  • Massachusetts Institute of Technology

About

206
Publications
27,785
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
49,731
Citations
Current institution
Massachusetts Institute of Technology

Publications

Publications (206)
Preprint
Full-text available
Multidimensional heterogeneity and endogeneity are important features of a wide class of econometric models. With control variables to correct for endogeneity, nonparametric identification of treatment effects requires strong support conditions. To alleviate this requirement, we consider varying coefficients specifications for the conditional expec...
Preprint
This paper introduces an estimator for the average of heterogeneous elasticities of taxable income (ETI), addressing key econometric challenges posed by nonlinear budget sets. Building on an isoelastic utility framework, we derive a linear-in-logs taxable income specification that incorporates the entire budget set while allowing for individual-spe...
Article
Full-text available
Bias correction can often improve the finite sample performance of estimators. We show that the choice of bias correction method has no effect on the higher‐order variance of semiparametrically efficient parametric estimators, so long as the estimate of the bias is asymptotically linear. It is also shown that bootstrap, jackknife, and analytical bi...
Preprint
We consider estimation of parameters defined as linear functionals of solutions to linear inverse problems. Any such parameter admits a doubly robust representation that depends on the solution to a dual linear inverse problem, where the dual solution can be thought as a generalization of the inverse propensity function. We provide the first source...
Preprint
In this paper we address the problem of bias in machine learning of parameters following covariate shifts. Covariate shift occurs when the distribution of input features change between the training and deployment stages. Regularization and model selection associated with machine learning biases many parameter estimates. In this paper, we propose an...
Article
This article surveys the development of nonparametric models and methods for estimation of choice models with nonlinear budget sets. The discussion focuses on the budget set regression, that is, the conditional expectation of a choice variable given the budget set. Utility maximization in a nonparametric model with general heterogeneity reduces the...
Preprint
Full-text available
In this paper, we study nonparametric estimation of instrumental variable (IV) regressions. Recently, many flexible machine learning methods have been developed for instrumental variable estimation. However, these methods have at least one of the following limitations: (1) restricting the IV regression to be uniquely identified; (2) only obtaining...
Article
Many causal and structural effects depend on regressions. Examples include policy effects, average derivatives, regression decompositions, average treatment effects, causal mediation, and parameters of economic structural models. The regressions may be high‐dimensional, making machine learning useful. Plugging machine learners into identifying equa...
Article
jats:title>Summary We provide adaptive inference methods, based on $\ell _1$ regularization, for regular (semiparametric) and nonregular (nonparametric) linear functionals of the conditional expectation function. Examples of regular functionals include average treatment effects, policy effects, and derivatives. Examples of nonregular functionals in...
Article
Full-text available
Often semiparametric estimators are asymptotically equivalent to a sample average. The object being averaged is referred to as the influence function. The influence function is useful in formulating primitive regularity conditions for asymptotic normality, in efficiency comparions, for bias reduction, and for analyzing robustness. We show that the...
Article
High-dimensional linear models with endogenous variables play an increasingly important role in the recent econometric literature. In this work, we allow for models with many endogenous variables and make use of many instrumental variables to achieve identification. Because of the high-dimensionality in the structural equation, constructing honest...
Preprint
Full-text available
We derive general, yet simple, sharp bounds on the size of the omitted variable bias for a broad class of causal parameters that can be identified as linear functionals of the conditional expectation function of the outcome. Such functionals encompass many of the traditional targets of investigation in causal inference studies, such as, for example...
Preprint
Many causal and policy effects of interest are defined by linear functionals of high-dimensional or non-parametric regression functions. $\sqrt{n}$-consistent and asymptotically normal estimation of the object of interest requires debiasing to reduce the effects of regularization and/or model selection on the object of interest. Debiasing is typica...
Preprint
Full-text available
Debiased machine learning is a meta algorithm based on bias correction and sample splitting to calculate confidence intervals for functionals (i.e. scalar summaries) of machine learning algorithms. For example, an analyst may desire the confidence interval for a treatment effect estimated with a neural network. We provide a nonasymptotic debiased m...
Preprint
We give debiased machine learners of parameters of interest that depend on generalized linear regressions, which regressions make a residual orthogonal to regressors. The parameters of interest include many causal and policy effects. We give neural net learners of the bias correction that are automatic in only depending on the object of interest an...
Preprint
We provide an adversarial approach to estimating Riesz representers of linear functionals within arbitrary function spaces. We prove oracle inequalities based on the localized Rademacher complexity of the function space used to approximate the Riesz representer and the approximation error. These inequalities imply fast finite sample mean-squared-er...
Article
Significance The drift-diffusion model (DDM) has been widely used in psychology and neuroeconomics to explain observed patterns of choices and response times. This paper provides an identification and characterization theorems for this model: We show that the parameters are uniquely pinned down and determine which datasets are consistent with some...
Preprint
Full-text available
Multidimensional heterogeneity and endogeneity are important features of models with multiple treatments. We consider a heterogeneous coefficients model where the outcome is a linear combination of dummy treatment variables, with each variable representing a different kind of treatment. We use control variables to give necessary and sufficient cond...
Article
Control variables provide an important means of controlling for endogeneity in econometric models with nonseparable and/or multidimensional heterogeneity. We allow for discrete instruments, giving identification results under a variety of restrictions on the way the endogenous variable and the control variables affect the outcome. We consider many...
Article
Research Summary Strategic management has seen numerous studies analyzing interaction terms in nonlinear models since Hoetker's (2007) best‐practice recommendations and Zelner's (2009) simulation‐based approach. We suggest an alternative recentering approach to assess the statistical and economic importance of interaction terms in nonlinear models....
Preprint
Full-text available
Many objects of interest can be expressed as a linear, mean square continuous functional of a least squares projection (regression). Often the regression may be high dimensional, depending on many variables. This paper gives minimal conditions for root-n consistent and efficient estimation of such objects when the regression and the Riesz represent...
Preprint
Full-text available
This paper gives a consistent, asymptotically normal estimator of the expected value function when the state space is high-dimensional and the first-stage nuisance functions are estimated by modern machine learning tools. First, we show that value function is orthogonal to the conditional choice probability, therefore, this nuisance function needs...
Preprint
The drift diffusion model (DDM) is a model of sequential sampling with diffusion (Brownian) signals, where the decision maker accumulates evidence until the process hits a stopping boundary, and then stops and chooses the alternative that corresponds to that boundary. This model has been widely used in psychology, neuroeconomics, and neuroscience t...
Preprint
Full-text available
Multidimensional heterogeneity and endogeneity are important features of a wide class of econometric models. We consider heterogenous coefficients models where the outcome is a linear combination of known functions of treatment and heterogenous coefficients. We use control variables to obtain identification results for average treatment effects. Wi...
Preprint
Full-text available
Control variables provide an important means of controlling for endogeneity in econometric models with nonseparable and/or multidimensional heterogeneity. We allow for discrete instruments, giving identification results under a variety of restrictions on the way the endogenous variable and the control variables affect the outcome. We consider many...
Preprint
Full-text available
Many objects of interest can be expressed as an L2 continuous functional of a regression, including average treatment effects, economic average consumer surplus, expected conditional covariances, and discrete choice parameters that depend on expectations. Debiased machine learning (DML) of these objects requires a learning a Riesz representer (RR)....
Article
Full-text available
There are many interesting and widely used estimators of a functional with finite semiparametric variance bound that depend on nonparametric estimators of nuisance functions. We use cross-fitting (i.e. sample splitting) to construct novel estimators with fast remainder rates. We give cross-fit doubly robust estimators that use separate subsamples t...
Article
Exact consumer’s surplus and deadweight loss are the most widely used welfare and economic efficiency measures. These measures can be computed from demand functions in straightforward ways. Nonparametric estimation can be used to estimate the welfare measures. In doing so, it seems important to account correctly for unobserved heterogeneity, given...
Article
Full-text available
The linear regression model is widely used in empirical work in Economics, Statistics, and many other disciplines. Researchers often include many covariates in their linear model specification in an attempt to control for confounders. We give inference methods that allow for many covariates and heteroskedasticity. Our results are obtained using hig...
Article
Full-text available
Robins et al. (2008, 2016) applied the theory of higher order influence functions (HOIFs) to derive an estimator of the mean of an outcome Y in a missing data model with Y missing at random conditional on a vector X of continuous covariates; their estimator, in contrast to previous estimators, is semiparametric efficient under minimal conditions. H...
Article
Full-text available
Most modern supervised statistical/machine learning (ML) methods are explicitly designed to solve prediction problems very well. Achieving this goal does not imply that these methods automatically deliver good estimators of causal parameters. Examples of such parameters include individual regression coefficients, average treatment effects, average...
Article
Full-text available
Many economic and causal parameters depend on nonparametric or high dimensional first steps. We give a general construction of locally robust/orthogonal moment functions for GMM, where first steps have no effect, locally, on average moment functions. Using these orthogonal moments reduces model selection and regularization bias, as is important in...
Preprint
Many economic and causal parameters depend on nonparametric or high dimensional first steps. We give a general construction of locally robust/orthogonal moment functions for GMM, where moment conditions have zero derivative with respect to first steps. We show that orthogonal moment functions can be constructed by adding to identifying moments the...
Article
Full-text available
Individual heterogeneity is an important source of variation in demand. Allowing for general heterogeneity is needed for correct welfare comparisons. We consider general heterogeneous demand where preferences and linear budget sets are statistically independent. Only the marginal distribution of demand for each price and income is identified from c...
Article
Full-text available
This paper examines a general class of inferential problems in semiparametric and nonparametric models defined by conditional moment restrictions. We construct tests for the hypothesis that at least one element of the identified set satisfies a conjectured (Banach space) "equality" and/or (a Banach lattice) "inequality" constraint. Our procedure is...
Preprint
Shape restrictions have played a central role in economics as both testable implications of theory and sufficient conditions for obtaining informative counterfactual predictions. In this paper we provide a general procedure for inference under shape restrictions in identified and partially identified models defined by conditional moment restriction...
Article
Full-text available
The linear regression model is widely used in empirical work in Economics. Researchers often include many covariates in their linear model specification in an attempt to control for confounders. We give inference methods that allow for many covariates and heteroskedasticity. Our results are obtained using high-dimensional approximations, where the...
Article
Full-text available
Non-standard distributional approximations have received considerable attention in recent years. They often provide more accurate approximations in small samples, and theoretical improvements in some cases. This paper shows that the seemingly unrelated "many instruments asymptotics" and "small bandwidth asymptotics" share a common structure, where...
Article
The central concern of the paper is with the formulation of tests of neglected parameter heterogeneity appropriate for model environments specified by a number of unconditional or conditional moment conditions. We initially consider the unconditional moment restrictions framework. Optimal m-tests against moment condition parameter heterogeneity are...
Article
Given the key role of the taxable income elasticity in designing an optimal tax system there are many studies attempting to estimate this elasticity. A problem with most of these studies is that strong functional form assumptions are used and that heterogeneity in preferences is not allowed for. Building on Blomquist and Newey (2002) we in this pap...
Article
This paper considers identification and estimation of ceteris paribus effects of continuous regressors in nonseparable panel models with time homogeneity. The effects of interest are derivatives of the average and quantile structural functions of the model. We find that these derivatives are identified with two time periods for ``stayers", i.e. for...
Article
Full-text available
In a recent paper, Hausman, Newey, Woutersen, Chao, and Swanson (2012) propose a new estimator, HFUL (Heteroscedasticity robust Fuller), for the linear model with endogeneity. This estimator is consistent and asymptotically normally distributed in the many instruments and many weak instruments asymptotics. Moreover, this estimator has moments, just...
Article
Full-text available
This chapter shows how a weighted average of a forward and reverse Jackknife IV estimator (JIVE) yields estimators that are robust against heteroscedasticity and many instruments. These estimators, called HFUL (Heteroscedasticity robust Fuller) and HLIM (Heteroskedasticity robust limited information maximum likelihood (LIML)) were introduced by Hau...
Article
In many economic models, objects of interest are functions which satisfy conditional moment restrictions. Economics does not restrict the functional form of these models, motivating nonparametric methods. In this paper we review identification results and describe a simple nonparametric instrumental variables (NPIV) estimator. We also consider a si...
Article
In this paper, we analyze properties of the Continuous Updating Estimator (CUE) proposed by Hansen et al. (1996), which has been suggested as a solution to the finite sample bias problems of the two-step GMM estimator. We show that the estimator should be expected to perform poorly in finite samples under weak identification, in particular, the est...
Article
In parametric, nonlinear structural models a classical sufficient condition for local identification, like Fisher (1966) and Rothenberg (1971), is that the vector of moment conditions is differentiable at the true parameter with full rank derivative matrix. We derive an analogous result for the nonparametric, nonlinear structural models, establishi...
Article
This note considers an asymptotic property of the class of closest moments estimators. Each such estimator is obtained by setting a vector of sample moments close to corresponding population moments. It is shown that each such estimator is asymptotically equivalent to a GMM estimator, which has a quadratic distance function. An implication of this...
Article
Asymptotic Properties of One-step Estimator Obtained from an Optimal Step-size - Volume 4 Issue 2 - Whitney K. Newey
Article
This paper gives a test of overidentifying restrictions that is robust to many instruments and heteroskedasticity. It is based on a jackknife version of the Sargan test statistic, having a numerator that is the objective function minimized by the JIVE2 estimator of Angrist, Imbens, and Krueger (1999). Correct asymptotic critical values are derived...
Article
Full-text available
Properties of GMM estimators are sensitive to the choice of instruments. Using many instruments leads to high asymptotic asymptotic efficiency but can cause high bias and/or variance in small samples. In this paper we develop and implement asymptotic mean square error (MSE) based criteria for instrumental variables to use for estimation of conditio...
Article
Full-text available
This essay discusses the issues of identification and estimation of the average treatment effect and the average effect of treatment on the treated.
Article
Full-text available
Instrumental variables are often associated with low estimator precision. This paper explores efficiency gains which might be achievable using moment conditions which are nonlinear in the disturbances and are based on flexible parametric families for error distributions. We show that these estimators can achieve the semiparametric efficiency bound...
Article
Full-text available
In the last fifteen years there has been much work on nonparametric identification of causal effects in settings with endogeneity. Earlier, researchers focused on linear systems with additive residuals. However, such systems are often difficult to motivate by economic theory. In many cases it is precisely the nonlinearity of the system and the pres...
Article
These volumes constitute the invited proceedings from the Ninth World Congress of the Econometric Society held on the campus of University College London on August 19–24, 2005. As co-chairs of the Program Committee for the Congress, one of our most pleasant tasks was to select topics and authors for fifteen invited symposia – each organized around...
Article
These volumes constitute the invited proceedings from the Ninth World Congress of the Econometric Society held on the campus of University College London on August 19–24, 2005. As co-chairs of the Program Committee for the Congress, one of our most pleasant tasks was to select topics and authors for fifteen invited symposia – each organized around...
Article
Properties of GMM estimators are sensitive to the choice of instrument. Using many instruments leads to high asymptotic asymptotic efficiency but can cause high bias and/or variance in small samples. In this paper we develop and implement asymptotic mean square error (MSE) based criteria for instrument selection in estimation of conditional moment...
Article
Full-text available
This paper uses control variables to identify and estimate models with nonseparable, multidimensional disturbances. Triangular simultaneous equations models are considered, with instruments and disturbances that are independent and a reduced form that is strictly monotonic in a scalar disturbance. Here it is shown that the conditional cumulative di...
Article
[enter Abstract Body]This paper derives the limiting distributions of alternative jackknife IV (JIV ) estimators and gives formulae for accompanying consistent standard errors in the presence of heteroskedasticity and many instruments. The asymptotic framework includes the many instrument sequence of Bekker (1994) and the many weak instrument seque...
Article
Full-text available
Using many moment conditions can improve efficiency but makes the usual generalized method of moments (GMM) inferences inaccurate. Two-step GMM is biased. Generalized empirical likelihood (GEL) has smaller bias, but the usual standard errors are too small in instrumental variable settings. In this paper we give a new variance estimator for GEL that...
Article
Full-text available
Nonseparable panel models are important in a variety of economic settings, including discrete choice. This paper gives identification and estimation results for nonseparable models under time homogeneity conditions that are like "time is randomly assigned" or "time is an instrument." Partial identification results for average and quantile effects a...
Article
Full-text available
Abstract This paper gives identiflcation and estimation results for marginal efiects in nonlinear panel models. We flnd that linear flxed efiects estimators are not consistent, due in part to marginal efiects not being identifled. We derive bounds for marginal efiects and show that they can tighten rapidly as the number of time series observations...
Article
These volumes constitute the invited proceedings from the Ninth World Congress of the Econometric Society held on the campus of University College London on August 19–24, 2005. As co-chairs of the Program Committee for the Congress, one of our most pleasant tasks was to select topics and authors for fifteen invited symposia – each organized around...
Article
This paper gives a relatively simple, well behaved solution to the problem of many instruments in heteroskedastic data. Such settings are common in microeconometric applications where many instruments are used to improve efficiency and allowance for heteroskedasticity is generally important. The solution is a Fuller (1977) like estimator and standa...
Article
Full-text available
This paper gives identification and estimation results for quantile and average effects in nonseparable panel models, when the distribution of period specific disturbances does not vary over time. Bounds are given for interesting effects with discrete regressors that are strictly exogenous or predetermined. We allow for location and scale time effe...
Article
Full-text available
This essay discusses the issues of identification and estimation of the average treatment effect and the average effect of treatment on the treated.
Article
Full-text available
Using many valid instrumental variables has the potential to improve efficiency but makes the usual inference procedures inaccurate. We give corrected standard errors, an extension of Bekker to nonnormal disturbances, that adjust for many instruments. We find that this adjustment is useful in empirical work, simulations, and in the asymptotic theor...
Article
Modeling choices that are both discrete and continuous is important in several settings. The purpose of this article is to explore formulation and identification of such models when indirect utility functions are specified nonparametrically. Here we consider general nonseparable disturbances. We give identification results for nonseparable sample s...
Article
Full-text available
It is common practice in econometrics to correct for heteroskedasticity.This paper corrects instrumental variables estimators with many instruments for heteroskedasticity.We give heteroskedasticity robust versions of the limited information maximum likelihood (LIML) and Fuller (1977, FULL) estimators; as well as heteroskedasticity consistent standa...
Article
There are many environments where knowledge of a structural relationship is required to answer questions of interest. Also, nonseparability of a structural disturbance is a key feature of many models. Here, we consider nonparametric identification and estimation of a model that is monotonic in a nonseparable scalar disturbance, which disturbance is...
Article
Full-text available
2SLS is by far the most-used estimator for the simultaneous equation problem. However, it is now well-recognized that 2SLS can exhibit substantial finite sample (second-order) bias when the model is over-identified and the first stage partial R2 is low. The initial recommendation to solve this problem was to do LIML, e.g.Bekker(1994) or Staiger and...
Article
This paper develops a new nonparametric series estimator for the average treatment effect for the case with unconfounded treatment assignment, that is, where selection for treatment is on observables. The new estimator is efficient. In addition we develop an optimal procedure for choosing the smoothing parameter, the number of terms in the series b...
Article
This paper develops a new efficient estimator for the average treatment effect, if selection for treatment is on observables. The new estimator is linear in the first-stage nonparametric estimator. This simplifies the derivation of the means squared error (MSE) of the estimator as a function of the number of basis functions that is used in the firs...
Article
Using many moment conditions can improve efficiency but makes the usual GMM inferences inaccurate. Two step GMM is biased. Generalized empirical likelihood (GEL) has smaller bias but the usual standard errors are too small. In this paper we use alternative asymptotics, based on many weak moment conditions, that addresses this problem. This asymptot...
Article
Thesis (Ph.D.)--Massachusetts Institute of Technology, Dept. of Economics, 1983. MICROFICHE COPY AVAILABLE IN ARCHIVES AND DEWEY Bibliography: leaves 169-170. by Whitney Kent Newey. Ph.D.
Article
This 2005 volume contains the papers presented in honor of the lifelong achievements of Thomas J. Rothenberg on the occasion of his retirement. The authors of the chapters include many of the leading econometricians of our day, and the chapters address topics of current research significance in econometric theory. The chapters cover four themes: id...
Chapter
Full-text available
This papers studies and compares the asymptotic bias of GMM and generalized empirical likelihood (GEL) estimators in the presence of estimated nuisance parameters. We consider cases in which the nuisance parameter is estimated from independent and identical samples. A simulation experiment is conducted for covariance structure models. Empirical lik...
Article
Fixed effects estimators of panel models can be severely biased because of the well-known incidental parameters problem. We show that this bias can be reduced by using a panel jackknife or an analytical bias correction motivated by large T. We give bias corrections for averages over the fixed effects, as well as model parameters. We find large bias...
Article
Conditional moment restrictions can be combined through GMM estimation to construct more efficient semiparametric estimators. This paper is about attainable efficiency for such estimators. We define and use a moment tangent set, the directions of departure from the truth allowed by the moments, to characterize when the semiparametric efficiency bou...
Article
In an effort to improve the small sample properties of generalized method of moments (GMM) estimators, a number of alternative estimators have been suggested. These include empirical likelihood (EL), continuous updating, and exponential tilting estimators. We show that these estimators share a common structure, being members of a class of generaliz...
Article
The purpose of this note is to show how semiparametric estimators with a small bias property can be constructed. The small bias property (SBP) of a semiparametric estimator is that its bias converges to zero faster than the pointwise and integrated bias of the nonparametric estimator on which it is based. We show that semiparametric estimators base...
Article
Full-text available
Using many instruments can improve efficiency but makes the usual GMM in- ferences inaccurate. Two step GMM is biased. Generalized empirical likelihood (GEL) has smaller bias but the usual standard errors are too small. We address this problem by deriving the limiting distribution of GEL under alternative asymp- totics where the number of moments g...
Article
Full-text available
We consider games with incomplete information a la Harsanyi, where the payoff of a player depends on an unknown state of nature as well as on the profile of chosen actions. As opposed to the standard model, players' preferences over state--contingent utility vectors are represented by arbitrary functionals. The definitions of Nash and Bayes equilib...
Article
This paper is about efficient estimation and consistent tests of conditional moment restrictions. We use unconditional moment restrictions based on splines or other approximating functions for this purpose. Empirical likelihood estimation is particularly appropriate for this setting, because of its relatively low bias with many moment conditions. W...

Network

Cited By