Comparing Population Distributions from Bin-aggregated Sample Data: An Application to Historical Height Data from France

Institut d'Anàlisi Econòmica (CSIC), Barcelona, Spain.
Economics and human biology (Impact Factor: 1.9). 05/2011; 9(4):419-37. DOI: 10.1016/j.ehb.2011.05.002
Source: PubMed


We develop a methodology to estimate underlying (continuous) population distributions from bin-aggregated sample data through the estimation of the parameters of mixtures of distributions that allow for maximal parametric flexibility. The statistical approach we develop enables comparisons of the full distributions of height data from potential army conscripts across France's 88 departments for most of the nineteenth century. These comparisons are made by testing for differences-of-means stochastic dominance. Corrections for possible measurement errors are also devised by taking advantage of the richness of the data sets. Our methodology is of interest to researchers working on bin-aggregated or histogram-type data, something that is still widely done since much of the information that is publicly available is in that form, often due to restrictions based on confidentiality concerns.

Download full-text


Available from: David E. Sahn,
  • [Show abstract] [Hide abstract]
    ABSTRACT: Since the pioneering study of Le Roy Ladurie and his team, the idea that mean height can be considered as a reliable indicator of the standard of living has emerged from a long debate among historians and economists. Considering height in this respect, nineteenth-century France, unlike most Western countries, did not pay an urban penalty. Thanks to a substantial set of individual data (105,324 observations), based on the draft lottery of Frenchmen born in the year 1848, we are able to prove that this “French exception” did not, in fact, exist. The larger the town, the shorter were the conscripts. Among the towns, Paris had the shortest conscripts. By combining individual data with the agricultural survey of 1852, we are able to identify those factors that compensated for this urban penalty—that were positively correlated with height: nutritional availability, the literacy rate, and life expectancy.
    Cliometrica 01/2014; 8(1). DOI:10.1007/s11698-013-0095-1 · 0.48 Impact Factor
  • Source
    [Show abstract] [Hide abstract]
    ABSTRACT: This paper discusses the many dimensions of health inequality, a multi-faceted concept, which examines the dispersion in the distribution of health spending, the provision of health services, health capabilities and health outcomes. The underlying concern is motivated by issues of both social justice and economic efficiency, recognizing health's central role as a condition of human existence. While the paper includes a wide-ranging discussion of conceptual issues, most of it is devoted to introducing approaches to empirical analysis of health inequality, ranging from fiscal incidence to decomposition analysis. It also focuses on the distinction between the univariate and gradient approaches. The former involves making comparisons of cardinal or scalar indicators of health inequality and distributions of health, regardless of whether health is correlated with welfare measured along other dimensions. This univariate approach measures pure inequalities in health in a fashion that is similar to what is done for income distribution. In contrast, the gradient approach generally focuses on making comparisons of health across populations with different social and economic characteristics.
    African Development Review 12/2012; 24(4). DOI:10.1111/1467-8268.12001 · 0.70 Impact Factor