Jeremiah Birrell

Jeremiah Birrell
University of Massachusetts Amherst | UMass Amherst · Department of Mathematics and Statistics

PhD

About

46
Publications
3,556
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
270
Citations
Citations since 2017
28 Research Items
230 Citations
201720182019202020212022202301020304050
201720182019202020212022202301020304050
201720182019202020212022202301020304050
201720182019202020212022202301020304050
Introduction
I am an applied mathematician, currently working in statistical learning theory. Current and past research includes: 1) Design and analysis of methods for enhancing the adversarial robustness of deep learning models. 2) Design and analysis of novel algorithms for generative adversarial networks. 3) Robustness bounds for quantities of interest in stochastic systems. 4) Singular limits and homogenization of stochastic differential equations. 5) Numerical methods in kinetic theory.

Publications

Publications (46)
Preprint
We introduce the $ARMOR_D$ methods as novel approaches to enhancing the adversarial robustness of deep learning models. These methods are based on a new class of optimal-transport-regularized divergences, constructed via an infimal convolution between an information divergence and an optimal-transport (OT) cost. We use these as tools to enhance adv...
Article
Full-text available
We offer a survey of the matter-antimatter evolution within the primordial Universe. While the origin of the tiny matter-antimatter asymmetry has remained one of the big questions in modern cosmology, antimatter itself has played a large role for much of the Universe’s early history. In our study of the evolution of the Universe we adopt the positi...
Preprint
Full-text available
We offer a survey of the matter-antimatter evolution within the primordial Universe. While the origin of the tiny matter-antimatter asymmetry has remained one of the big questions in modern cosmology, antimatter itself has played a large role for much of the Universe's early history. In our study of the evolution of the Universe we adopt the positi...
Preprint
Full-text available
We propose a new family of regularized R\'enyi divergences parametrized not only by the order $\alpha$ but also by a variational function space. These new objects are defined by taking the infimal convolution of the standard R\'enyi divergence with the integral probability metric (IPM) associated with the chosen function space. We derive a novel du...
Preprint
Full-text available
Generative adversarial networks (GANs), a class of distribution-learning methods based on a two-player game between a generator and a discriminator, can generally be formulated as a minmax problem based on the variational representation of a divergence between the unknown and the generated distributions. We introduce structure-preserving GANs as a...
Article
Full-text available
Variational representations of divergences and distances between high-dimensional probability distributions offer significant theoretical insights and practical advantages in numerous research areas. Recently, they have gained popularity in machine learning as a tractable and scalable approach for training probabilistic models and for statistically...
Article
Full-text available
We develop a rigorous and general framework for constructing information-theoretic divergences that subsume both f-divergences and integral probability metrics (IPMs), such as the 1-Wasserstein distance. We prove under which assumptions these divergences, hereafter referred to as (f, Γ)-divergences, provide a notion of 'distance' between probabilit...
Preprint
Full-text available
We develop a general framework for constructing new information-theoretic divergences that rigorously interpolate between $f$-divergences and integral probability metrics (IPMs), such as the Wasserstein distance. These new divergences inherit features from IPMs, such as the ability to compare distributions which are not absolute continuous, as well...
Article
Quantifying the impact of parametric and model-form uncertainty on the predictions of stochastic models is a key challenge in many applications. Previous work has shown that the relative entropy rate is an effective tool for deriving path-space uncertainty quantification (UQ) bounds on ergodic averages. In this work we identify appropriate informat...
Preprint
Full-text available
Distributionally robust optimization (DRO) is a widely used framework for optimizing objective functionals in the presence of both randomness and model-form uncertainty. A key step in the practical solution of many DRO problems is a tractable reformulation of the optimization over the chosen model ambiguity set, which is generally infinite dimensio...
Preprint
Full-text available
We derive a new variational formula for the R\'enyi family of divergences, $R_\alpha(Q\|P)$, generalizing the classical Donsker-Varadhan variational formula for the Kullback-Leibler divergence. The objective functional in this new variational representation is expressed in terms of expectations under $Q$ and $P$, and hence can be estimated using sa...
Preprint
Full-text available
Variational representations of distances and divergences between high-dimensional probability distributions offer significant theoretical insights and practical advantages in numerous research areas. Recently, they have gained popularity in machine learning as a tractable and scalable approach for training probabilistic models and statistically dif...
Article
Full-text available
We study the small-mass (overdamped) limit of Langevin equations for a particle in a potential and/or magnetic field with matrix-valued and state-dependent drift and diffusion. We utilize a bootstrapping argument to derive a hierarchy of approximate equations for the position degrees of freedom that are able to achieve accuracy of order \(m^{\ell /...
Preprint
Full-text available
Rare events, and more general risk-sensitive quantities-of-interest (QoIs), are significantly impacted by uncertainty in the tail behavior of a distribution. Uncertainty in the tail can take many different forms, each of which leads to a particular ambiguity set of alternative models. Distributional robustness bounds over such an ambiguity set cons...
Chapter
This paper studies homogenization of stochastic differential systems. The standard example of this phenomenon is the small mass limit of Hamiltonian systems. We consider this case first from the heuristic point of view, stressing the role of detailed balance and presenting the heuristics based on a multiscale expansion. This is used to propose a ph...
Preprint
Full-text available
In this paper we provide performance guarantees for hypocoercive non-reversible MCMC samplers our results apply in particular to the Langevin equation, Hamiltonian Monte-Carlo, and the bouncy particle and zigzag samplers. Specifically, we establish a concentration inequality of Bernstein type for ergodic averages of bounded observables. As a conseq...
Preprint
Full-text available
Variational-principle-based methods that relate expectations of a quantity of interest to information-theoretic divergences have proven to be effective tools for obtaining robustness bounds under both parametric and non-parametric model-form uncertainty. Here, we extend these ideas to utilize information divergences that are specifically targeted a...
Poster
Full-text available
A non-perturbative framework for robust uncertainty bounds via goal-oriented information theory.
Preprint
Full-text available
Information-theory based variational principles have proven effective at providing scalable uncertainty quantification (i.e. robustness) bounds for quantities of interest in the presence of non-parametric model-form uncertainty. In this work, we combine such variational formulas with functional inequalities (Poincar\'e, $\log$-Sobolev, Liapunov fun...
Preprint
Full-text available
We show that the non-integer effective number of neutrinos $N^{\mathrm{eff}}_\nu$ can be understood as an effect of lepton $L$ asymmetry in the early Universe carried by the Dirac neutrino cosmic background. We show that $N_\nu^{\mathrm{eff}}=3.36\pm0.34$ (CMB only) and $N_\nu^{\mathrm{eff}}= 3.62\pm0.25$ (CMB and $H_0$) require a ratio between bar...
Article
Full-text available
We investigate entropy production in the small-mass (or overdamped) limit of Langevin–Kramers dynamics. The results generalize previous works to provide a rigorous derivation that covers systems with magnetic field as well as anisotropic (i.e. matrix-valued) drag and diffusion coefficients that satisfy a fluctuation–dissipation relation with state-...
Preprint
Full-text available
We study the small-mass (overdamped) limit of Langevin equations for a particle in a potential and/or magnetic field with matrix-valued and state-dependent drift and diffusion. The present work generalizes prior derivations of the homogenized equation for the position degrees of freedom in the $m\to 0$ limit. Specifically, we develop a hierarchy of...
Preprint
Full-text available
Current precision big bang nucleosynthesis (BBN) studies motivate us to revisit the neutron lifespan in the plasma medium of the early universe. The mechanism we explore is the Fermi-blocking of decay electrons and neutrinos by plasma. As result, neutrons live longer and we find a significant 6.4\% modification of neutron abundance in the BBN era a...
Article
Full-text available
We investigate entropy production in the small mass (or overdamped) limit of Langevin-Kramers dynamics. Our results apply to systems with magnetic field as well as matrix valued drag and diffusion coefficients that satisfy a version of the fluctuation dissipation relation with state dependent temperature. In particular, we generalize the anomalous...
Article
Full-text available
This paper studies homogenization of stochastic differential systems. The standard example of this phenomenon is the small mass limit of Hamiltonian systems. We consider this case first from the heuristic point of view, stressing the role of detailed balance and presenting the heuristics based on a multiscale expansion. This is used to propose a ph...
Article
Full-text available
We study the dynamics of an inertial particle coupled to forcing, dissipation, and noise in the small mass limit. We derive an expression for the limiting (homogenized) joint distribution of the position and (scaled) velocity degrees of freedom. In particular, weak convergence of the joint distributions is established, along with a bound on the con...
Article
We study the dynamics of a class of Hamiltonian systems with dissipation, coupled to noise, in a singular (small mass) limit. We derive the homogenized equation for the position degrees of freedom in the limit, including the presence of a noise-induced drift term. We prove convergence to the solution of the homogenized equation in probability and,...
Article
Full-text available
In the context of the half-centenary of Hagedorn temperature and the statistical bootstrap model (SBM) we present a short account of how these insights coincided with the establishment of the hot big-bang model (BBM) and helped resolve some of the early philosophical difficulties. We then turn attention to the present day context and show the domin...
Article
Full-text available
We study damped geodesic motion of a particle of mass $m$ on a Riemannian manifold, in the presence of an external force and noise. Lifting the resulting stochastic differential equation to the orthogonal frame bundle, we prove that, as $m \to 0$, its solutions converge to solutions of a limiting equation which includes a noise-induced drift term....
Article
Full-text available
We study how the hot Universe evolves and acquires the prevailing vacuum state, demonstrating that in specific conditions which are believed to apply, the Universe becomes frozen into the state with the smallest value of Higgs vacuum field $v=\langle h\rangle$, even if this is not the state of lowest energy. This supports the false vacuum dark ener...
Article
We investigate a family of integrals involving modified Bessel functions that arise in the context of neutrino scattering. Recursive formulas are derived for evaluating these integrals and their asymptotic expansions are computed. We prove in certain cases that the asymptotic expansion yields the exact result after a finite number of terms. In each...
Article
Full-text available
We present a novel method for detecting the relic neutrino background that takes advantage of structured quantum degeneracy to amplify the drag force from neutrinos scattering off a detector. Developing this idea, we present a characterization of the present day relic neutrino distribution in an arbitrary frame, including the influence of neutrino...
Article
Full-text available
The effective number of neutrinos, , obtained from CMB fluctuations accounts for all effectively massless degrees of freedom present in the Universe, including but not limited to the three known neutrinos. Using a lattice-QCD derived QGP equation of state, we constrain the observed range of in terms of the freeze-out of unknown degrees of freedom n...
Article
We present a computer assisted method for generating existence proofs and a posteriori error bounds for solutions to two point boundary value problems (BVPs). All truncation errors are accounted for and, if combined with interval arithmetic to bound the rounding errors, the computer generated results are mathematically rigorous. The method is formu...
Article
In this dissertation, we study the evolution and properties of the relic (or cosmic) neutrino distribution from neutrino freeze-out at $T=O(1)$ MeV through the free-streaming era up to today, focusing on the deviation of the neutrino spectrum from equilibrium. In particular, we demonstrate the presence of chemical non-equilibrium that continues to...
Article
Full-text available
Analysis of cosmic microwave background radiation fluctuations favors an effective number of neutrinos, $N_\nu>3$. This motivates a reinvestigation of the neutrino freeze-out process. Here we characterize the dependence of $N_\nu$ on the Standard Model (SM) parameters that govern neutrino freeze-out. We show that $N_\nu$ depends on a combination $\...
Article
Full-text available
It is known that the freeze-out of light particles in the early Universe can lead to a non-integer contribution to the effective number of neutrinos, $N_{\text{eff}}$. Using a lattice-QCD derived quark-gluon plasma equations of state we show that freeze-out at the time when the quark-gluon deconfined phase froze into hadrons near $T=150$ MeV leads...
Article
We present a novel method to solve the spatially homogeneous and isotropic relativistic Boltzmann equation. We employ a basis set of orthogonal polynomials dynamically adapted to allow emergence of chemical non-equilibrium. Two time dependent parameters characterize the set of orthogonal polynomials, the effective temperature $T(t)$ and phase space...
Article
Full-text available
We study the present day relic neutrino distribution in the Earth frame and characterize the dependence on neutrino mass and freeze-out conditions in early Universe. We present explicitly the neutrino velocity and de Broglie wavelength distributions. We characterize the expected neutrino drag force ${\cal O}(G_F^2)$ of a spherical detector in the h...
Article
Full-text available
We perform a model independent study of the neutrino momentum distribution at freeze-out, treating the freeze-out temperature as a free parameter. Our results imply that measurement of neutrino reheating, as characterized by the measurement of the effective number of neutrinos Nν, amounts to the determination of the neutrino kinetic freeze-out temp...
Article
Full-text available
We clarify in a quantitative way the impact that distinct chemical $T_c$ and kinetic $T_k$ freeze-out temperatures have on the reduction of the neutrino fugacity $\Upsilon_\nu$ below equilibrium, i.e. $\Upsilon_\nu<1$, and the increase of the neutrino temperature $T_\nu$ via partial reheating. We establish the connection between $\Upsilon_\nu$ and...
Article
Full-text available
We present the Einstein-Vlasov solution for the momentum distribution of the relic free-streaming neutrinos. We show that it is possible to explain the rise in the effective number of neutrinos $N_\nu$ from those present at the end of the big bang nucleosynthesis (BBN) $N_\nu(T_{BBN})=3.71^{+0.47}_{-0.45}$ towards $N_\nu(T_{r})=4.34^{+.086}_{-0.88}...
Article
Full-text available
We study models of strong first order `low' temperature electroweak phase transition. To achieve this we propose a class of Higgs effective potential models which preserve the known features of the present day massive phase. However, the properties of the symmetry restored massless phase are modified in a way that for a large parameter domain we fi...
Article
Full-text available
We study a family of quadratic stochastic differential equations in the plane, motivated by applications to turbulent transport of heavy particles. Using Lyapunov functions, we find a critical parameter value $\alpha_{1}=\alpha_{2}$ such that when $\alpha_{2}>\alpha_{1}$ the system is ergodic and when $\alpha_{2}<\alpha_{1}$ solutions are not defin...

Network

Cited By