Paul R. Rosenbaum’s research while affiliated with University of Pennsylvania and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (358)


A new design for observational studies applied to the study of the effects of high school football on cognition late in life
  • Article

December 2024

·

3 Reads

The Annals of Applied Statistics

Katherine Brumberg

·

Dylan S. Small

·

Paul R. Rosenbaum

Figure 1: Balance of the covariate "age" by contrast value, +1 or −1, in each of six type of blocks.
Figure 2: Balance of the covariate "age × LE" or x 3 × w ′ by contrast value, +1 or −1, in each of six type of blocks.
Figure 3: Duration of unemployment, with and without an increase in benefits duration (B/b) in block type 2 at high R and block type 5 at low R, after (a) and before (b). Contrast weights, h g = 1 or h g = −1, appear at the top. The right boxplot shows 1400 = 700+700 difference-in-difference estimates, pooling blocks of types 2 and 5, with symmetric transformation of the tails to better visualize the center of the distribution.
Effect Aliasing in Observational Studies
  • Preprint
  • File available

August 2024

·

29 Reads

In experimental design, aliasing of effects occurs in fractional factorial experiments, where certain low order factorial effects are indistinguishable from certain high order interactions: low order contrasts may be orthogonal to one another, while their higher order interactions are aliased and not identified. In observational studies, aliasing occurs when certain combinations of covariates -- e.g., time period and various eligibility criteria for treatment -- perfectly predict the treatment that an individual will receive, so a covariate combination is aliased with a particular treatment. In this situation, when a contrast among several groups is used to estimate a treatment effect, collections of individuals defined by contrast weights may be balanced with respect to summaries of low-order interactions between covariates and treatments, but necessarily not balanced with respect to summaries of high-order interactions between covariates and treatments. We develop a theory of aliasing in observational studies, illustrate that theory in an observational study whose aliasing is more robust than conventional difference-in-differences, and develop a new form of matching to construct balanced confounded factorial designs from observational data.

Download


Optimal refinement of strata to balance covariates

July 2024

·

3 Reads

Biometrics

What is the best way to split one stratum into two to maximally reduce the within-stratum imbalance in many covariates? We formulate this as an integer program and approximate the solution by randomized rounding of a linear program. A linear program may assign a fraction of a person to each refined stratum. Randomized rounding views fractional people as probabilities, assigning intact people to strata using biased coins. Randomized rounding is a well-studied theoretical technique for approximating the optimal solution of certain insoluble integer programs. When the number of people in a stratum is large relative to the number of covariates, we prove the following new results: (i) randomized rounding to split a stratum does very little randomizing, so it closely resembles the linear programming relaxation without splitting intact people; (ii) the linear relaxation and the randomly rounded solution place lower and upper bounds on the unattainable integer programming solution; and because of (i), these bounds are often close, thereby ratifying the usable randomly rounded solution. We illustrate using an observational study that balanced many covariates by forming matched pairs composed of 2016 patients selected from 5735 using a propensity score. Instead, we form 5 propensity score strata and refine them into 10 strata, obtaining excellent covariate balance while retaining all patients. An R package optrefine at CRAN implements the method. Supplementary materials are available online.


Exposure to Operative Anesthesia in Childhood and Subsequent Neurobehavioral Diagnoses: A Natural Experiment Using Appendectomy

May 2024

·

34 Reads

·

3 Citations

Anesthesiology

Jeffrey H Silber

·

Paul R Rosenbaum

·

·

[...]

·

Background Observational studies of anesthetic neurotoxicity may be biased because children requiring anesthesia commonly have medical conditions associated with neurobehavioral problems. This study takes advantage of a natural experiment associated with appendicitis, in order to determine if anesthesia and surgery in childhood were specifically associated with subsequent neurobehavioral outcomes. Methods We identified 134,388 healthy children with appendectomy and examined the incidence of subsequent externalizing or behavioral disorders (conduct, impulse control, oppositional defiant, or attention-deficit/hyperactivity disorder); or internalizing or mood/anxiety disorders (depression, anxiety, or bipolar disorder) when compared to 671,940 matched healthy controls as identified in Medicaid data between 2001-2018. For comparison, we also examined 154,887 otherwise healthy children admitted to the hospital for pneumonia, cellulitis, and gastroenteritis, of which only 8% received anesthesia, and compared them to 774,435 matched healthy controls. We also examined the difference-in-differences between matched appendectomy patients and their controls and matched medical admission patients and their controls. Results Compared to controls, children with appendectomy were more likely to have subsequent behavioral disorders (the hazard ratio (HR) was 1.04 (95% CI 1.01, 1.06), P = 0.0010), and mood/anxiety disorders (HR: 1.15 (95% CI 1.13, 1.17), P < 0.0001). Relative to controls, children with medical admissions were also more likely to have subsequent behavioral (HR: 1.20 (95% CI 1.18, 1.22), P < 0.0001), and mood/anxiety (HR: 1.25 (95% CI 1.23, 1.27), P < 0.0001) disorders. Comparing the difference between matched appendectomy patients and their matched controls to the difference between matched medical patients and their matched controls, medical patients had more subsequent neurobehavioral problems than appendectomy patients. Conclusions Although there is an association between neurobehavioral diagnoses and appendectomy, this association is not specific to anesthesia exposure, and is stronger in medical admissions. Medical admissions, generally without anesthesia exposure, displayed significantly higher rates of these disorders than appendectomy-exposed patients.


Assessing the Ambulatory Surgery Center Volume-Outcome Association

January 2024

·

26 Reads

·

1 Citation

JAMA SURGERY

Importance In surgical patients, it is well known that higher hospital procedure volume is associated with better outcomes. To our knowledge, this volume-outcome association has not been studied in ambulatory surgery centers (ASCs) in the US. Objective To determine if low-volume ASCs have a higher rate of revisits after surgery, particularly among patients with multimorbidity. Design, Setting, and Participants This matched case-control study used Medicare claims data and analyzed surgeries performed during 2018 and 2019 at ASCs. The study examined 2328 ASCs performing common ambulatory procedures and analyzed 4751 patients with a revisit within 7 days of surgery (defined to be either 1 of 4735 revisits or 1 of 16 deaths without a revisit). These cases were each closely matched to 5 control patients without revisits (23 755 controls). Data were analyzed from January 1, 2018, through December 31, 2019. Main Outcomes and Measures Seven-day revisit in patients (cases) compared with the matched patients without the outcome (controls) in ASCs with low volume (less than 50 procedures over 2 years) vs higher volume (50 or more procedures). Results Patients at a low-volume ASC had a higher odds of a 7-day revisit vs patients who had their surgery at a higher-volume ASC (odds ratio [OR], 1.21; 95% CI, 1.09-1.36; P = .001). The odds of revisit for patients with multimorbidity were higher at low-volume ASCs when compared with higher-volume ASCs (OR, 1.57; 95% CI, 1.27-1.94; P < .001). Among patients with multimorbidity in low-volume ASCs, for those who underwent orthopedic procedures, the odds of revisit were 84% higher (OR, 1.84; 95% CI, 1.36-2.50; P < .001) vs higher-volume centers, and for those who underwent general surgery or other procedures, the odds of revisit were 36% higher (OR, 1.36; 95% CI, 1.01-1.83; P = .05) vs a higher-volume center. The findings were not statistically significant for patients without multimorbidity. Conclusions and Relevance In this observational study, the surgical volume of an ASC was an important indicator of patient outcomes. Older patients with multimorbidity should discuss with their surgeon the optimal location of their care.


Mortality Among Older Medical Patients at Flagship Hospitals and Their Affiliates

December 2023

·

14 Reads

·

2 Citations

Journal of General Internal Medicine

We define a “flagship hospital” as the largest academic hospital within a hospital referral region and a “flagship system” as a system that contains a flagship hospital and its affiliates. It is not known if patients admitted to an affiliate hospital, and not to its main flagship hospital, have better outcomes than those admitted to a hospital outside the flagship system but within the same hospital referral region. To compare mortality at flagship hospitals and their affiliates to matched control patients not in the flagship system but within the same hospital referral region. A matched cohort study The study used hospitalizations for common medical conditions between 2018-2019 among older patients age ≥ 66 years. We analyzed 118,321 matched pairs of Medicare patients admitted with pneumonia (N=57,775), heart failure (N=42,531), or acute myocardial infarction (N=18,015) in 35 flagship hospitals, 124 affiliates, and 793 control hospitals. 30-day (primary) and 90-day (secondary) all-cause mortality. 30-day mortality was lower among patients in flagship systems versus control hospitals that are not part of the flagship system but within the same hospital referral region (difference= -0.62%, 95% CI [-0.88%, -0.37%], P<0.001). This difference was smaller in affiliates versus controls (-0.43%, [-0.75%, -0.11%], P=0.008) than in flagship hospitals versus controls (-1.02%, [-1.46%, -0.58%], P<0.001; difference-in-difference -0.59%, [-1.13%, -0.05%], P=0.033). Similar results were found for 90-day mortality. The study used claims-based data. In aggregate, within a hospital referral region, patients treated at the flagship hospital, at affiliates of the flagship hospital, and in the flagship system as a whole, all had lower mortality rates than matched controls outside the flagship system. However, the mortality advantage was larger for flagship hospitals than for their affiliates.


Impact of Hospital Affiliation With a Flagship Hospital System on Surgical Outcomes

October 2023

·

11 Reads

·

2 Citations

Annals of Surgery

Objective To compare general surgery outcomes at flagship systems, flagship hospitals, and flagship hospital affiliates versus matched controls. Summary Background Data It is unknown whether flagship hospitals perform better than flagship hospital affiliates for surgical patients. Methods Using Medicare claims for 2018 to 2019, we matched patients undergoing inpatient general surgery in flagship system hospitals to controls who underwent the same procedure at hospitals outside the system but within the same region. We defined a “flagship hospital” within each region as the major teaching hospital with the highest patient volume that is also part of a hospital system; its system was labeled a “flagship system.” We performed 4 main comparisons: patients treated at any flagship system hospital versus hospitals outside the flagship system; flagship hospitals versus hospitals outside the flagship system; flagship hospital affiliates versus hospitals outside the flagship system; and flagship hospitals versus affiliate hospitals. Our primary outcome was 30-day mortality. Results We formed 32,228 closely matched pairs across 35 regions. Patients at flagship system hospitals (32,228 pairs) had lower 30-day mortality than matched control patients [3.79% vs. 4.36%, difference=−0.57% (−0.86%, −0.28%), P <0.001]. Similarly, patients at flagship hospitals (15,571/32,228 pairs) had lower mortality than control patients. However, patients at flagship hospital affiliates (16,657/32,228 pairs) had similar mortality to matched controls. Flagship hospitals had lower mortality than affiliate hospitals [difference-in-differences=−1.05% (−1.62%, −0.47%), P <0.001]. Conclusions Patients treated at flagship hospitals had significantly lower mortality rates than those treated at flagship hospital affiliates. Hence, flagship system affiliation does not alone imply better surgical outcomes.


A Second Evidence Factor for a Second Control Group

August 2023

·

7 Reads

Biometrics

In an observational study of the effects caused by a treatment, a second control group is used in an effort to detect bias from unmeasured covariates, and the investigator is content if no evidence of bias is found. This strategy is not entirely satisfactory: two control groups may differ significantly, yet the difference may be too small to invalidate inferences about the treatment, or the control groups may not differ yet nonetheless fail to provide a tangible strengthening of the evidence of a treatment effect. Is a firmer conclusion possible? Is there a way to analyze a second control group such that the data might report measurably strengthened evidence of cause and effect, that is, insensitivity to larger unmeasured biases? Evidence factor analyses are not commonly used with a second control group: most analyses compare the treated group to each control group, but analyses of that kind are partially redundant; so, they do not constitute evidence factors. An alternative analysis is proposed here, one that does yield two evidence factors, and with a carefully designed test statistic, is capable of extracting strong evidence from the second factor. The new technical work here concerns the development of a test statistic with high design sensitivity and high Bahadur efficiency in a sensitivity analysis for the second factor. A study of binge drinking as a cause of high blood pressure is used as an illustration.



Citations (53)


... Anxiety disorders are highly prevalent mental health conditions globally, impacting an estimated 264 million individuals, with varying rates among different demographics and a higher incidence in women (6,7). In the United States, approximately 31% of adults will experience an anxiety disorder at some stage in their lives (8). The burden of anxiety disorders on public health systems and society is substantial, manifested through increased healthcare utilization, reduced quality of life, and the presence of comorbidities (9). ...

Reference:

Exploring the genetic and socioeconomic interplay between ADHD and anxiety disorders using Mendelian randomization
Exposure to Operative Anesthesia in Childhood and Subsequent Neurobehavioral Diagnoses: A Natural Experiment Using Appendectomy
  • Citing Article
  • May 2024

Anesthesiology

... The comparison is insensitive to a bias that increases the odds of assignment to h g = 1 rather than h g = −1 by a factor of Γ = 1.6, as the upper bound on the P -value is then 0.035. These calculations use the methods in Rosenbaum (2018Rosenbaum ( , 2024 as implemented in the gwgtRank function of the weightedRank package in R with the default settings. In a matched pair, a bias of Γ = 1.6 is equivalent to an unobserved covariate that doubles the odds of treatment and increases the odds of a positive pair difference in outcome by 5-fold (Rosenbaum & Silber (2009)). ...

Bahadur Efficiency of Observational Block Designs
  • Citing Article
  • June 2023

... First, to validate the PSM model configuration, we conducted a placebo test and performed multiple linear regressions on the matched samples. Following the methodologies of Imbens (2014), Smith and Todd (2001), and Zubizarreta et al. (2023), we randomly shuffled all samples from the treatment group and then randomly selected half of them to serve as a pseudo-control group. Using the original PSM configuration, we recalculated the green premium results to determine if the observed treatment effects were due to statistical noise, coincidence, or model specification errors, which might lead to significant treatment effects even with a randomly constructed pseudo-control group. ...

Handbook of Matching and Weighting Adjustments for Causal Inference
  • Citing Book
  • March 2023

... However, they are in line with numbers being reported for participants taking part in a study using the ClinSearch Acceptability Score Test ® conducted by Vallet et al. [39]. The rather vulnerable profile of included old participants might be due to the recruitment site being a hospital, thereby including sicker people compared to the general population [40]. As older adults are commonly dependent on the intake of medicines, these rather vulnerable participants are a good point of reference. ...

Defining Multimorbidity in Older Patients Hospitalized with Medical Conditions
  • Citing Article
  • November 2022

Journal of General Internal Medicine

... Therefore, this study uses the DID model to analyze the impact of CBMP on GF based on a quasi-natural experiment in which the PBOC includes green bonds in the scope of eligible collateral for MLF. Furthermore, to alleviate the interference of the sample size disparity between the treatment and control groups on the research outcomes, this study employs the propensity score matching (PSM) method proposed by Rosenbaum and Rubin (2023) to identify control group enterprises with the closest corporate characteristics to those in the treatment group. To this end, we use the PSM-DID model to analyze the impact of CBMP on GF. ...

Propensity Scores in the Design of Observational Studies for Causal Effects
  • Citing Article
  • September 2022

Biometrika

... Because of this, recent methods have aimed to balance covariates directly, rather than using propensity score estimates as a conduit for balance. This includes estimating propensity scores to optimize balance rather than model likelihood (Imai & Ratkovic, 2014;Vegetabile et al., 2020), algorithms that match subjects exactly on coarsened versions of covariates rather than propensity scores (Iacus et al., 2009(Iacus et al., , 2012, and algorithms that stratify, match, or reweight subjects such that covariate balance is optimized directly (Zubizarreta, 2012;Diamond & Sekhon, 2013;Zubizarreta et al., 2014;Chattopadhyay et al., 2020;Ben-Michael et al., 2021;Brumberg et al., 2022). ...

Using Randomized Rounding of Linear Programs to Obtain Unweighted Natural Strata that Balance Many Covariates
  • Citing Article
  • May 2022

Journal of the Royal Statistical Society Series A (Statistics in Society)

... These 1400 differences-in-differences are plotted in the final boxplot. To see the typical difference clearly, the vertical axis has its tails symmetrically transformed beyond ±β using a p = −1 transformation, where β is the 0.8 quantile of the absolute difference-in-differences (Rosenbaum (2022)). The median of the 1400 untransformed differences-in-differences is 2.3 weeks, the quartiles are −7.6 and 12.3, and the 10% and 90% points are −19.4 and 23.8. ...

A New Transformation of Treated-Control Matched-Pair Differences for Graphical Display
  • Citing Article
  • April 2022

The American Statistician

... Popular matching methods include Rosenbaum (1989), Iacus et al. (2011), and Diamond and Sekhon (2013). Recent optimal matching methods that capitalize on modern optimization using network flows include Hansen and Klopfer (2006), Pimentel et al. (2015), Yu et al. (2020), and Yu and Rosenbaum (2022). For recent optimization matching methods that leverage modern optimization using generic integer programming, see Zubizarreta (2012), Zubizarreta et al. (2014), and Cohn and Zubizarreta (2022). ...

Graded Matching for Large Observational Studies
  • Citing Article
  • March 2022

Journal of Computational and Graphical Statistics

... COVID-19, have also been described to immediately precede PD [25]. Similarly, the authors encountered people who developed PD shortly after undergoing surgery or anesthesia, although literature on this phenomenon is currently limited to studies examining the association between anesthesia and the subsequent risk of developing PD later in life, rather than exploring acute onsets of PD [26,27]. This hints at mechanisms that occur both during infections and psychological stress, such as increased inflammatory tone [28]. ...

Risk of Parkinson's disease after anaesthesia and surgery
  • Citing Article
  • January 2022

BJA British Journal of Anaesthesia

... An example is the effect of hormone replacement therapy (HRT) on cardiovascular disease (CVD), where multiple observational studies showed that HRT reduced the risk of CVD, 9,10 but subsequent randomized trials demonstrated that in fact HRT increases the risk of CVD. 11 If the original observational studies had conducted a sensitivity analysis, they would have found that an unmeasured confounder with a weak association with the exposure (odds ratio 1.13) would have been sufficient to explain away the observed protective association, 12 although it is worth noting that some controversy remains about the effect of HRT. 13 Before we proceed, it is important to clarify that we refer to sensitivity analyses as methodologies that aid in testing the extent to which varying violations of causal modeling assumptions would lead to different conclusions. This kind of sensitivity analysis must be distinguished from analyses that seek to test the extent to which statistical modeling assumptions would lead to different conclusions. ...

The information in covariate imbalance in studies of hormone replacement therapy
  • Citing Article
  • December 2021

The Annals of Applied Statistics