Carl P. Schmertmann’s research while affiliated with Florida State University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (40)


Figure 1: Bias in e 0 from 1% under-registration of deaths or under-enumeration of population at different ages
Figure 2: Life expectancy bias if 1% of those with true age y have a reported age of x
Figure 3: Mortality rate bias caused by age misreporting on 1% of death or census records, using derivative formula
Figure 4: Sources of bias in mortality rate estimates
Figure 5: Ratios of estimated to true mortality, by age, if all census and death records are subject to age misstatement (P = Q = Π i , i ∈ {CR, AA, IN })

+4

Data errors in mortality estimation: Formal demographic analysis of under-registration, under-enumeration, and age misreporting
  • Article
  • Full-text available

August 2024

·

53 Reads

Demographic Research

Carl Schmertmann

·

·

Download

Figure 2: Unpenalized spline s = Bθ with 36 degrees of freedom: maximum likelihood fit for a simulated small-area sample with known mortality rates. True mortality rates, in grey, are HMD rates for Portuguese females 1970-1979
Figure 4: Alternative fits to simulated small-population mortality data from Figure 3. P-spline fits are minimum-BIC curves
Figure 5: Median errors and 10-90% intervals by age, population size, and fitting method. Each plotted point is the average error lnˆµlnˆ lnˆµ x − ln µ true
D-splines: Estimating rate schedules using high-dimensional splines with empirical demographic penalties

June 2021

·

25 Reads

·

4 Citations

Demographic Research

Background: High-dimensional parametric models with penalized likelihood functions strike a good balance between bias and variance for estimating continuous age schedules from large samples. The penalized spline (P-spline) approach is particularly useful for these purposes, but it in small samples it can often produce implausible age schedule estimates. Objective: I propose and evaluate a new type of P-spline model for estimating demographic rate schedules. These estimators, which I call D-splines, regularize and smooth high-dimensional splines by using demographic patterns rather than generic mathematical rules. Methods: I compare P-spline estimates of age-specific mortality rates to three alternative D-spline estimators, over a large number of simulated small populations with known rates. The penalties for the D-spline estimators are derived from patterns in the Human Mortality Database. Results: For mortality estimates in small populations, D-spline estimators generally have lower errors than standard P-splines. Conclusions: Using penalties based on demographic information about patterns and variability in rate schedules improves P-spline estimators for small populations. Contribution: This paper expands demographers' toolkit by developing a new category of P-spline estimators that are more reliable for estimating mortality in small populations.


District-Level Life Expectancy in Germany

July 2020

·

47 Reads

·

48 Citations

Deutsches Ärzteblatt international

Background: Identifying regions with low life expectancy is important to policy makers, in particular for allocating resources in the health system. Life expectancy estimates for small regions are, however, often unreliable and lead to statistical uncertainties when the underlying populations are relatively small. Methods: We combine the most recent German data available (2015-2017) with a Bayesian model that includes several methodological advances. This allows us to estimate male and female life expectancy with good precision for all 402 German districts and to quantify the uncertainty of those estimates. Results: Across districts, life expectancy varies between 75.8 and 81.2 years for men and from 81.8 to 85.7 years for women. The spatial pattern is similar for women and men. Rural districts in eastern Germany and some districts of the Ruhr region have relatively low life expectancy. Districts with relatively high life expectancies cluster in Baden-Wuerttemberg and southern Bavaria. Exploratory analysis shows that average income, population density, and number of physicians per 100 000 inhabitants are not strongly correlated with life expectancy at district level. In contrast, indicators that point to particularly disadvantaged segments of the population (unemployment rate, welfare benefits) are better predictors of life expectancy. Conclusions: We do not find a consistent urban-rural gap in life expectancy. Our results suggest that policies that improve living standards for poorer segment of the population are the most likely to reduce the existing differences in life expectancy.



Population Pyramids Yield Accurate Estimates of Total Fertility Rates

January 2020

·

107 Reads

·

12 Citations

Demography

The primary fertility index for a population, the total fertility rate (TFR), cannot be calculated for many areas and periods because it requires disaggregation of births by mother’s age. Here we discuss a flexible framework for estimating TFR using inputs as minimal as a population pyramid. We develop five variants, each with increasing complexity and data requirements. We test accuracy across a diverse set of data sources that comprise more than 2,400 fertility schedules with known TFR values, including the Human Fertility Database, Demographic and Health Surveys, U.S. counties, and nonhuman species. We show that even the simplest and least accurate variant has a median error of only 0.09 births per woman over 2,400 fertility schedules, suggesting accurate TFR estimation over a wide range of demographic conditions. We anticipate that this framework will extend fertility analysis to new subpopulations, periods, geographies, and even species. To demonstrate the framework’s utility in new applications, we produce subnational estimates of African fertility levels, reconstruct historical European TFRs for periods up to 150 years before the collection of detailed birth records, and estimate TFR for the United States conditional on race and household income.



Bayesian estimation of total fertility from a population's age–sex structure

June 2019

·

46 Reads

·

7 Citations

Statistical Modelling

We investigate a modern statistical approach to a classic deterministic demographic estimation technique. When vital event registration is missing or inadequate, it is possible to approximate a population's total fertility rate (TFR) from information about its distribution by age and sex. For example, if under-five child mortality is low then TFR is often close to seven times the child/woman ratio (CWR), the number of 0–4 year olds per 15–49-year-old woman. We analyse the formal relationship between CWR and TFR to identify sources of uncertainty in indirect estimates. We construct a Bayesian model for the statistical distribution of TFR conditional on the population's age–sex structure, in which unknown demographic quantities in the standard approximation are parameters with prior distributions. We apply the model in two case studies: to a small indigenous population in the Amazon region of Brazil that has extremely high fertility rates, and to the set of 159 counties in the US state of Georgia. A statistical approach yields important insights into the sources of error in indirect estimation, and their relative magnitudes.


Bayesian Estimation of Age-Specific Mortality and Life Expectancy for Small Areas With Defective Vital Records

July 2018

·

147 Reads

·

88 Citations

Demography

High sampling variability complicates estimation of demographic rates in small areas. In addition, many countries have imperfect vital registration systems, with coverage quality that varies significantly between regions. We develop a Bayesian regression model for small-area mortality schedules that simultaneously addresses the problems of small local samples and underreporting of deaths. We combine a relational model for mortality schedules with probabilistic prior information on death registration coverage derived from demographic estimation techniques, such as Death Distribution Methods, and from field audits by public health experts. We test the model on small-area data from Brazil. Incorporating external estimates of vital registration coverage though priors improves small-area mortality estimates by accounting for underregistration and automatically producing measures of uncertainty. Bayesian estimates show that when mortality levels in small areas are compared, noise often dominates signal. Differences in local point estimates of life expectancy are often small relative to uncertainty, even for relatively large areas in a populous country like Brazil.


Bayesian estimation of age-specific mortality and life expectancy for small areas with defective vital records

February 2018

·

16 Reads

·

1 Citation

We develop a Bayesian regression model for small-area mortality schedules that simultaneously addresses the problems of small local samples and underreporting of deaths. We combine a relational model for mortality schedules with probabilistic prior information on death registration coverage – derived from demographic estimation techniques such as Death Distribution Methods, and from field audits done by public health experts. We test the model on small-area data from Brazil. Incorporating external estimates of vital registration coverage though priors improves small-area mortality estimates by accounting for under-registration, and by automatically producing measures of uncertainty.


Figure 1 of 1
Estimating age- and sex-specific mortality rates for small areas with TOPALS regression: an application to Brazil in 2010

December 2016

·

279 Reads

·

48 Citations

Revista Brasileira de Estudos de População

A alta variabilidade dos dados nos registros vitais, em razão do baixo número de pessoas expostas, impõe sérios problemas para estimação da mortalidade por idade e sexo em pequenas áreas. Muitas abordagens atuais, incluindo as mais utilizadas no Brasil, estimam as taxas específicas de mortalidade assumindo pressupostos matemáticos rígidos sobre o verdadeiro padrão etário da mortalidade. Padronização indireta, por exemplo, assume que todas as áreas dentro de uma área maior (microrregiões em uma mesorregião, por exemplo) possuem um padrão de mortalidade idêntico, com diferença constante no nível das taxas logarítmicas por idade. Propomos um método estatístico mais flexível que combina regressão Poisson com um modelo relacional denominado TOPALS (DE BEER, 2012). Usamos o novo método para estimar as taxas específicas de mortalidade em pequenas áreas no Brasil (estados, mesorregiões, microrregiões e municípios) em 2010. Resultados para o estado de Minas Gerais mostram diferenças notáveis no padrão de mortalidade por idade entre pequenas áreas adjacentes, demonstrando as vantagens do uso de um método de estimação mais flexível.


Citations (34)


... 6 Table B.3 in Supplementary materials B. LE 35 for manual workers is 0.7 year lower when reintegrating the currently inactive: this value is smaller than the change over time which was estimated for this OCs) interpretable parameters, it imposes a rigid structure, and the choice of the most suitable model can vary significantly among different OCs. Some studies have therefore used and developed alternative models based on smoothing methods to reduce the randomness of the data without imposing a predetermined structure, thus preserving the richness of the dataset while enhancing interpretability [17,[25][26][27][28][29][30]. ...

Reference:

Unpacking occupational and sex divides to understand the moderate progress in life expectancy in recent years (France, 2010’s)
D-splines: Estimating rate schedules using high-dimensional splines with empirical demographic penalties

Demographic Research

... These methods have since been employed to estimate fertility, mortality, and migration at the global, regional, national, and subnational levels (e.g. Alexander et al., 2017;Alkema & New, 2014;Alkema et al., 2011;Assunção et al., 2005;Azose & Raftery, 2015;Leknes & Løkken, 2020;Schmertmann & Hauer, 2019;Schmertmann et al., 2013). Bryant and Zhang (2018) describe a Bayesian statistical framework which uses multiple, often unreliable, data sources to estimate and forecast entire demographic systems, including births, deaths, internal migration, and international migration) with age, sex, area, and time details. ...

Bayesian estimation of total fertility from a population's age–sex structure
  • Citing Article
  • June 2019

Statistical Modelling

... In order to minimise statistical uncertainties and random fluctuations in small districts with low death counts when calculating the district-specific life expectancies, the life tables were calculated with collapsed age groups and for a cumulative period of three calendar years. In the literature, there are also approaches that use estimation techniques for this purpose [48,49], and their results may differ from the approach used here, which should be considered when interpreting and comparing the results. ...

District-Level Life Expectancy in Germany
  • Citing Article
  • July 2020

Deutsches Ärzteblatt international

... Like above, the recast procedure consisted of calculating the number of speakers in each birth cohort that would have been alive at each point in the past given available population and mortality estimates (see Baseline Populations and Recast procedure). The average number of children, on the other hand, was obtained by applying the xTFR formula proposed by Hauer and Schmertmann [45] to counts that were randomly drawn from the distributions that define the baseline populations described above. Its form is Applying it to data referring to the entire population of a given language, we obtain the average number of child speakers per adult speaker instead of the average number of children per woman. ...

Population Pyramids Yield Accurate Estimates of Total Fertility Rates
  • Citing Article
  • January 2020

Demography

... A união de indivíduos brancos e amarelos 24,25 , se deu pelo fato de o último grupo corresponder a apenas 0,59% dos óbitos analisados e por apresentarem perfil de mortalidade similares. A redistribuição Bayesiana é empregada com o intuito de minimizar a subnotificação do registro de óbitos por uma causa específica, ocasionada pela classificação destes como óbitos por causas mal definidas ou muito generalistas (códigos garbage), bem como para suavizar possíveis flutuações decorrentes do número reduzido de casos, especialmente quando se trata de desfechos raros ou quando a área analisada não dispõe de um número amostral considerado suficientemente grande 26 . Neste estudo, de acordo com o Capítulo II da CID-10, os códigos garbage redistribuídos foram C76 (Neoplasia maligna de outras localizações e de localizações mal definidas), C79 (Neoplasia maligna secundária de outras localizações) e C80 (Neoplasia Maligna, sem Especificação de Localização) para todas as neoplasias e, adicionalmente, C55 (Neoplasia maligna do útero, porção não especificada) para a neoplasia do colo do útero. ...

Bayesian Estimation of Age-Specific Mortality and Life Expectancy for Small Areas With Defective Vital Records
  • Citing Article
  • July 2018

Demography

... However, estimating mortality rates in small areas is still challenging for demographers and epidemiologists in low-and middle-income countries, including Brazil [6][7][8][9][10]. These challenges are due to (1) a limited number of events and exposure in small areas which add stochastic error on the estimates [11][12][13][14][15][16][17], and (2) low levels of completeness of death registrations, which may be related to under-reporting of death counts in vital records, a kind of systematic error particularly common in developing countries [18][19][20][21][22]. ...

Estimating age- and sex-specific mortality rates for small areas with TOPALS regression: an application to Brazil in 2010

Revista Brasileira de Estudos de População

... They suggested the consideration of the exposure unit related to the event of interest (usually human population) and its incorporation into the Knox test to produce more reliable results. However, years later, Schmertmann (2015) found the persistence of some bias in the results produced by the KH test, which were then corrected with the provision of a new empirical test based on the Metropolis-Hastings algorithm (Hastings, 1970;Metropolis et al., 1953) that does not modify the spatial and temporal margins of the observed map, avoiding the bias that sometimes affects the KH test. In summary, overlooking spatiotemporal risk variations can bias the results yielded by the standard Knox test and lead to the over-detection of significant spatio-temporal intervals under which the nearrepeat phenomenon is not actually taking place, or to the under-detection of intervals that really are affected by the phenomenon (the over-detection or under-detection will depend on the direction of the bias). ...

Adjusting for population shifts and covariates in space-time interaction tests: Space-Time Interaction Tests
  • Citing Article
  • May 2015

Biometrics

... This comparative study seeks to provide insight into the appropriateness and dependability of these methods in various demographic contexts. The current body of research provides several techniques for predicting fertility, including principal component analysis, functional data models, time series models, and Bayesian approaches [4][5][6][7][8][9]. Alkema et al. (2011) conducted a thorough examination of different forecasting methods, emphasizing the significant challenges posed by uncertainty in demographic projections [10]. ...

Bayesian Forecasting of Cohort Fertility
  • Citing Article
  • June 2014

... The indirect estimates for demographic characteristics were among the pioneering tasks practiced by the United States of America (USA) established back in 1977. These attempts ultimately lead; publication of the United Nations (UN) Manual X (United Nations, 1983;Schmertmann et al., 2013). ...

Bayes plus Brass: Estimating Total Fertility for Many Small Areas from Sparse Census Data

Population Studies