Tom Abel’s research while affiliated with Kavli Institute for Particle Astrophysics and Cosmology, Stanford University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (230)


Figure 1. Shaded are all the query points that have a 1NN distance of ≤ í µí±Ÿ from the data points (in black). Thus, the fraction of the total box that is shaded gives us the CDF 1NN at this particular radius í µí±Ÿ (equation 8). This is a representative figure with circles in 2D, but the same argument holds with spheres in 3D.
Figure A1. í µí±‰ (í µí±Ÿ + Δí µí±Ÿ ) expressed as a sum; as written in equation A1. í µí±‰ (í µí±Ÿ ) is represented by the green region, í µí°´(í µí±Ÿ ) 4 í µí¼‹í µí±Ÿ 2 4 í µí¼‹ 3
Figure A2. Half of the cross sectional area of the toroid that contributes to í µí±‰ arc . The full volume is made by reflecting the blue figure about the vertical line AP, and then rotating it 360 • around the center C and axis OC.
Geometric Interpretations of the k-Nearest Neighbour Distributions
  • Preprint
  • File available

February 2025

·

10 Reads

Kwanit Gangopadhyay

·

Arka Banerjee

·

Tom Abel

The k-Nearest Neighbour Cumulative Distribution Functions are measures of clustering for discrete datasets that are fast and efficient to compute. They are significantly more informative than the 2-point correlation function. Their connection to N-point correlation functions, void probability functions and Counts-in-Cells is known. However, the connections between the CDFs and other geometric and topological spatial summary statistics are yet to be fully explored in the literature. This understanding will be crucial to find optimally informative summary statistics to analyse data from stage 4 cosmological surveys. We explore quantitatively the geometric interpretations of the kNN CDF summary statistics. We establish an equivalence between the 1NN CDF at radius r and the volume of spheres with the same radius around the data points. We show that higher kNN CDFs are equivalent to the volumes of intersections of k\ge k spheres around the data points. We present similar geometric interpretations for the kNN cross-correlation joint CDFs. We further show that the volume, or the CDFs, have information about the angles and arc lengths created at the intersections of spheres around the data points, which can be accessed through the derivatives of the CDF. We show this information is very similar to that captured by Germ Grain Minkowski Functionals. Using a Fisher analysis we compare the information content and constraining power of various data vectors constructed from the kNN CDFs and Minkowski Functionals. We find that the CDFs and their derivatives and the Minkowski Functionals have nearly identical information content. However, kNN CDFs are computationally orders of magnitude faster to evaluate. Finally, we find that there is information in the full shape of the CDFs, and therefore caution against using the values of the CDF only at sparsely sampled radii.

Download

Figure B1. The f Q (v Mpeak ) functions of the best-fit UM-SAGA (solid) and UM DR1 (dashed) models. The different colors represent the model relations at different redshifts. The new low-mass quenching mostly affects halos with v Mpeak  100 km s −1 .
The SAGA Survey. V. Modeling Satellite Systems around Milky Way–Mass Galaxies with Updated UniverseMachine

November 2024

·

9 Reads

The Astrophysical Journal

Yunchong Wang

·

Ethan O. Nadler

·

·

[...]

·

John F. Wu

Environment plays a critical role in shaping the assembly of low-mass galaxies. Here, we use the U niverse M achine (UM) galaxy–halo connection framework and Data Release 3 of the Satellites Around Galactic Analogs (SAGA) Survey to place dwarf galaxy star formation and quenching into a cosmological context. UM is a data-driven forward model that flexibly parameterizes galaxy star formation rates (SFRs) using only halo mass and assembly history. We add a new quenching model to UM, tailored for galaxies with m ⋆ ≲ 10 ⁹ M ⊙ , and constrain the model down to m ⋆ ≳ 10 ⁷ M ⊙ using new SAGA observations of 101 satellite systems around Milky Way (MW)–mass hosts and a sample of isolated field galaxies in a similar mass range from the Sloan Digital Sky Survey. The new best-fit model, “UM-SAGA,” reproduces the satellite stellar mass functions, average SFRs, and quenched fractions in SAGA satellites while keeping isolated dwarfs mostly star-forming. The enhanced quenching in satellites relative to isolated field galaxies leads the model to maximally rely on halo assembly to explain the observed environmental quenching. Extrapolating the model down to m ⋆ ∼ 10 6.5 M ⊙ yields a quenched fraction of ≳30% for isolated field galaxies and ≳80% for satellites of MW-mass hosts at this stellar mass. Spectroscopic surveys can soon test this specific prediction to reveal the relative importance of internal feedback, cessation of mass and gas accretion, satellite-specific gas processes, and reionization for the evolution of faint low-mass galaxies.


Small-scale signatures of primordial non-Gaussianity in k-nearest neighbour cumulative distribution functions

September 2024

·

2 Reads

·

8 Citations

Monthly Notices of the Royal Astronomical Society

Searches for primordial non-Gaussianity in cosmological perturbations are a key means of revealing novel primordial physics. However, robustly extracting signatures of primordial non-Gaussianity from non-linear scales of the late-time Universe is an open problem. In this paper, we apply k-Nearest Neighbour cumulative distribution functions, kNN-CDFs, to the quijote-png simulations to explore the sensitivity of kNN-CDFs to primordial non-Gaussianity. An interesting result is that for halo samples with Mh1014M_\mathrm{ h}\langle 10^{14} M_\odot h1h^{-1}, the kNN-CDFs respond to equilateral PNG in a manner distinct from the other parameters. This persists in the galaxy catalogues in redshift space and can be differentiated from the impact of galaxy modelling, at least within the halo occupation distribution (HOD) framework considered here. kNN-CDFs are related to counts-in-cells and, through mapping a subset of the kNN-CDF measurements into the count-in-cells picture, we show that our results can be modelled analytically. A caveat of the analysis is that we only consider the HOD framework, including assembly bias. It will be interesting to validate these results with other techniques for modelling the galaxy–halo connection, e.g. (hybrid) effective field theory or semi-analytical methods.


The AGORA High-resolution Galaxy Simulations Comparison Project. IV. Halo and Galaxy Mass Assembly in a Cosmological Zoom-in Simulation at z ≤ 2

June 2024

·

31 Reads

·

9 Citations

The Astrophysical Journal

In this fourth paper from the AGORA Collaboration, we study the evolution down to redshift z = 2 and below of a set of cosmological zoom-in simulations of a Milky Way mass galaxy by eight of the leading hydrodynamic simulation codes. We also compare this CosmoRun suite of simulations with dark matter-only simulations by the same eight codes. We analyze general properties of the halo and galaxy at z = 4 and 3, and before the last major merger, focusing on the formation of well-defined rotationally supported disks, the mass–metallicity relation, the specific star formation rate, the gas metallicity gradients, and the nonaxisymmetric structures in the stellar disks. Codes generally converge well to the stellar-to-halo mass ratios predicted by semianalytic models at z ∼ 2. We see that almost all the hydro codes develop rotationally supported structures at low redshifts. Most agree within 0.5 dex with the observed mass–metallicity relation at high and intermediate redshifts, and reproduce the gas metallicity gradients obtained from analytical models and low-redshift observations. We confirm that the intercode differences in the halo assembly history reported in the first paper of the collaboration also exist in CosmoRun , making the code-to-code comparison more difficult. We show that such differences are mainly due to variations in code-dependent parameters that control the time stepping strategy of the gravity solver. We find that variations in the early stellar feedback can also result in differences in the timing of the low-redshift mergers. All the simulation data down to z = 2 and the auxiliary data will be made publicly available.


Classical field approximation of ultralight dark matter: Quantum break times, corrections, and decoherence

April 2024

·

7 Reads

·

15 Citations

Physical Review D

The classical field approximation is widely used to better understand the predictions of ultralight dark matter. Here, we use the truncated Wigner approximation method to test the classical field approximation of ultralight dark matter. This method approximates a quantum state as an ensemble of independently evolving realizations drawn from its Wigner function. The method is highly parallelizable and allows the direct simulation of quantum corrections and decoherence times in systems many times larger than have been previously studied in reference to ultralight dark matter. Our study involves simulation of systems in 1, 2, and 3 spatial dimensions. We simulate three systems, the condensation of a Gaussian random field in three spatial dimensions, a stable collapsed object in three spatial dimensions, and the merging of two stable objects in two spatial dimensions. We study the quantum corrections to the classical field theory in each case. We find that quantum corrections grow exponentially during nonlinear growth with the timescale being approximately equal to the system dynamical time. In stable systems the corrections grow quadratically. We also find that the primary effect of quantum corrections is to reduce the amplitude of fluctuations on the de Broglie scale in the spatial density. Finally, we find that the timescale associated with decoherence due to gravitational coupling to baryonic matter is at least as fast as the quantum corrections due to gravitational interactions. These results are consistent with the predictions of the classical field theory being accurate.


Figure 2. The dark matter surface densities at z ∼ 2 with the halos identified by the ROCKSTAR halo finder drawn in white circles whose radii indicate 0.5R vir . Only the halos located within 300 comoving kpc from the host halo's center and those more massive than 10 7 h −1 M e in dark matter are drawn. Readers can readily see that the DMO runs have more satellite halos than the CosmoRuns. See Section 3.1 for more information.
The AGORA High-resolution Galaxy Simulations Comparison Project. V. Satellite Galaxy Populations in a Cosmological Zoom-in Simulation of a Milky Way–Mass Halo

March 2024

·

40 Reads

·

7 Citations

The Astrophysical Journal

We analyze and compare the satellite halo populations at z ∼ 2 in the high-resolution cosmological zoom-in simulations of a 10 ¹² M ⊙ target halo ( z = 0 mass) carried out on eight widely used astrophysical simulation codes ( Art-I , Enzo , Ramses , Changa , Gadget-3 , Gear , Arepo-t , and Gizmo ) for the AGORA High-resolution Galaxy Simulations Comparison Project. We use slightly different redshift epochs near z = 2 for each code (hereafter “ z ∼ 2”) at which the eight simulations are in the same stage in the target halo’s merger history. After identifying the matched pairs of halos between the CosmoRun simulations and the DMO simulations, we discover that each CosmoRun halo tends to be less massive than its DMO counterpart. When we consider only the halos containing stellar particles at z ∼ 2, the number of satellite galaxies is significantly fewer than that of dark matter halos in all participating AGORA simulations and is comparable to the number of present-day satellites near the Milky Way or M31. The so-called “missing satellite problem” is fully resolved across all participating codes simply by implementing the common baryonic physics adopted in AGORA and the stellar feedback prescription commonly used in each code, with sufficient numerical resolution (≲100 proper pc at z = 2). We also compare other properties such as the stellar mass–halo mass relation and the mass–metallicity relation. Our work highlights the value of comparison studies such as AGORA, where outstanding problems in galaxy formation theory are studied simultaneously on multiple numerical platforms.


The AGORA High-resolution Galaxy Simulations Comparison Project. VI. Similarities and Differences in the Circumgalactic Medium

February 2024

·

48 Reads

·

7 Citations

The Astrophysical Journal

We analyze the circumgalactic medium (CGM) for eight commonly-used cosmological codes in the AGORA collaboration. The codes are calibrated to use identical initial conditions, cosmology, heating and cooling, and star formation thresholds, but each evolves with its own unique code architecture and stellar feedback implementation. Here, we analyze the results of these simulations in terms of the structure, composition, and phase dynamics of the CGM. We show properties such as metal distribution, ionization levels, and kinematics are effective tracers of the effects of the different code feedback and implementation methods, and as such they can be highly divergent between simulations. This is merely a fiducial set of models, against which we will in the future compare multiple feedback recipes for each code. Nevertheless, we find that the large parameter space these simulations establish can help disentangle the different variables that affect observable quantities in the CGM, e.g., showing that abundances for ions with higher ionization energy are more strongly determined by the simulation’s metallicity, while abundances for ions with lower ionization energy are more strongly determined by the gas density and temperature.


Robust cosmological inference from non-linear scales with k -th nearest neighbor statistics

November 2023

·

7 Reads

·

12 Citations

Monthly Notices of the Royal Astronomical Society

We present the methodology for deriving accurate and reliable cosmological constraints from non-linear scales (<50 h−1 Mpc) with k-th nearest neighbor (kNN) statistics. We detail our methods for choosing robust minimum scale cuts and validating galaxy–halo connection models. Using cross-validation, we identify the galaxy–halo model that ensures both good fits and unbiased predictions across diverse summary statistics. We demonstrate that we can model kNNs effectively down to transverse scales of rp ∼ 3 h−1 Mpc and achieve precise and unbiased constraints on the matter density and clustering amplitude, leading to a 2% constraint on σ8. Our simulation-based model pipeline is resilient to varied model systematics, spanning simulation codes, halo finding, and cosmology priors. We demonstrate the effectiveness of this approach through an application to the Beyond-2p mock challenge. We propose further explorations to test more complex galaxy–halo connection models and tackle potential observational systematics.


Figure 4. The Fisher information of different statistics for σ 8 , m and w 0 when using DES Y3-like data. The power spectra and second moment probe only the Gaussian information and their contours o v erlap completely (the peach contour is hidden underneath the purple). Adding the third moment significantly impro v es the constraints, and the CDF, which approximately contains all moments, impro v es upon that a non-ne gligible but diminishing amount. The de generac y direction of second + third moments and the CDFs is also visibly different, and combining them leads to a further 20-30 per cent impro v ement in constraints. The black dashed lines in the diagonal panels show the 1D constraints from CDFs measured on a purely Gaussian field, and these are consistent with those from the other Gaussian statistics. The constraints are tabulated in Table 1 .
Beyond the 3rd moment: A practical study of using lensing convergence CDFs for cosmology with DES Y3

October 2023

·

34 Reads

·

20 Citations

Monthly Notices of the Royal Astronomical Society

Widefield surveys probe clustered scalar fields — such as galaxy counts, lensing potential, etc. — that are sensitive to different cosmological and astrophysical processes. Constraining such processes depends on the statistics that summarize the field. We explore the cumulative distribution function (CDF) as a summary of the galaxy lensing convergence field. Using a suite of N-body lightcone simulations, we show the CDFs’ constraining power is modestly better than the 2nd and 3rd moments, as CDFs approximately capture information from all moments. We study the practical aspects of applying CDFs to data, using the Dark Energy Survey (DES Y3) data as an example, and compute the impact of different systematics on the CDFs. The contributions from the point spread function and reduced shear approximation are 1%\lesssim 1\% of the total signal. Source clustering effects and baryon imprints contribute 1-10%. Enforcing scale cuts to limit systematics-driven biases in parameter constraints degrades these constraints a noticeable amount, and this degradation is similar for the CDFs and the moments. We detect correlations between the observed convergence field and the shape noise field at 13σ. The non-Gaussian correlations in the noise field must be modeled accurately to use the CDFs, or other statistics sensitive to all moments, as a rigorous cosmology tool.


Beyond the 3rd moment: A practical study of using lensing convergence CDFs for cosmology with DES Y3

August 2023

·

20 Reads

·

1 Citation

Widefield surveys of the sky probe many clustered scalar fields -- such as galaxy counts, lensing potential, gas pressure, etc. -- that are sensitive to different cosmological and astrophysical processes. Our ability to constrain such processes from these fields depends crucially on the statistics chosen to summarize the field. In this work, we explore the cumulative distribution function (CDF) at multiple scales as a summary of the galaxy lensing convergence field. Using a suite of N-body lightcone simulations, we show the CDFs' constraining power is modestly better than that of the 2nd and 3rd moments of the field, as they approximately capture the information from all moments of the field in a concise data vector. We then study the practical aspects of applying the CDFs to observational data, using the first three years of the Dark Energy Survey (DES Y3) data as an example, and compute the impact of different systematics on the CDFs. The contributions from the point spread function are 2-3 orders of magnitude below the cosmological signal, while those from reduced shear approximation contribute 1%\lesssim 1\% to the signal. Source clustering effects and baryon imprints contribute 110%1-10\%. Enforcing scale cuts to limit systematics-driven biases in parameter constraints degrades these constraints a noticeable amount, and this degradation is similar for the CDFs and the moments. We also detect correlations between the observed convergence field and the shape noise field at 13σ13\sigma. We find that the non-Gaussian correlations in the noise field must be modeled accurately to use the CDFs, or other statistics sensitive to all moments, as a rigorous cosmology tool.


Citations (59)


... Traditional analysis methods of N-point statistics beyond the power spectrum and bispectrum require large mock datasets for accurate covariance estimation. The same is true for alternative summary statistics such as k-nearest neighbours [18][19][20] or scattering wavelets [21][22][23][24][25]. Simulations-based inference methods and field-level analyses require generating many accurate realizations of the late-time density field for constraining model parameters and initial conditions reconstruction. As next-generation galaxy survey data become available from DESI [26,27], Euclid [28], LSST at the Vera C. Rubin Observatory [29], SPHEREx [30], and Subaru Prime Focus Spectrograph [31] ...

Reference:

Field-level emulation of cosmic structure formation with cosmology and redshift dependence
Small-scale signatures of primordial non-Gaussianity in k-nearest neighbour cumulative distribution functions
  • Citing Article
  • September 2024

Monthly Notices of the Royal Astronomical Society

... For this reason, systematically exploring the parameter space of simulated models, not to mention cosmological parameters and resolution, is unfeasible. This hinders a methodical comparison between the different simulation frameworks, which have mostly been conducted at the level of individual galaxies [40]. It is still an open question at what level different numerical schemes agree, with or without calibration to external observables and at different resolutions, and whether simulations are complete in the physics they are modeling. ...

The AGORA High-resolution Galaxy Simulations Comparison Project. IV. Halo and Galaxy Mass Assembly in a Cosmological Zoom-in Simulation at z ≤ 2

The Astrophysical Journal

... Owing to their high number densities, ultralight DM candidates are often described as behaving "classically" in laboratory settings [10,11,23] or when virialized inside the galaxy [24][25][26][27]. By this, one typically means that its statistics and dynamics can be accurately characterized by a classical field. ...

Classical field approximation of ultralight dark matter: Quantum break times, corrections, and decoherence
  • Citing Article
  • April 2024

Physical Review D

... Others restrict the FoF linking to be done on dark matter particles first, and subsequently rely on a post-processing step to assign stellar masses to each subhalo (e.g. Necib et al. 2019;Jung et al. 2024). In the main analysis of study, we use all particle types except black holes 7 during linking and unbinding, as per the recommended way of using rockstar-galaxies (Behroozi; private communication). ...

The AGORA High-resolution Galaxy Simulations Comparison Project. V. Satellite Galaxy Populations in a Cosmological Zoom-in Simulation of a Milky Way–Mass Halo

The Astrophysical Journal

... These subgrid models include prescriptions for gas cooling, star formation and feedback processes, and are a vital and active subject of research as they impact global galaxy features such as e.g. stellar masses (Crain et al. 2015;Bottrell et al. 2017;Wellons et al. 2023), morphologies (Snyder et al. 2015;Dubois et al. 2016;Correa et al. 2017;Rodriguez-Gomez et al. 2019;Bluck et al. 2019;Wang et al. 2019;Gensior et al. 2024), the ISM and CGM structure (Bournaud et al. 2010;Roca-Fabrega et al. 2021;Bieri et al. 2023;Strawn et al. 2024). ...

The AGORA High-resolution Galaxy Simulations Comparison Project. VI. Similarities and Differences in the Circumgalactic Medium

The Astrophysical Journal

... Traditional analysis methods of N-point statistics beyond the power spectrum and bispectrum require large mock datasets for accurate covariance estimation. The same is true for alternative summary statistics such as k-nearest neighbours [18][19][20] or scattering wavelets [21][22][23][24][25]. Simulations-based inference methods and field-level analyses require generating many accurate realizations of the late-time density field for constraining model parameters and initial conditions reconstruction. As next-generation galaxy survey data become available from DESI [26,27], Euclid [28], LSST at the Vera C. Rubin Observatory [29], SPHEREx [30], and Subaru Prime Focus Spectrograph [31] ...

Robust cosmological inference from non-linear scales with k -th nearest neighbor statistics

Monthly Notices of the Royal Astronomical Society

... Additional state-of-the-art higher-order approaches are discussed and reviewed by Ajani [12], and include Minkowski functionals [13], weak lensing peak counts and minimum counts [14], scattering transforms [15], wavelet phase harmonics [16], homology statistics [17], and PDF/CDFs [18] [19]. Finally, deep learning approaches at the field level have also been developed and applied to lensing mass maps [20][21] [22]. ...

Beyond the 3rd moment: A practical study of using lensing convergence CDFs for cosmology with DES Y3

Monthly Notices of the Royal Astronomical Society

... The local number density is given by the ensemble average over streams: The number density is shown Fig. 2(d). Since the phases are completely uncorrelated for a given energy, the stream ensemble average leads to a number density with no coherent time variation, consistent with the results of Ref. [39]. Averaging the number density oscillations requires observing many particles from different streams. ...

Testing the mean field theory of scalar field dark matter

SciPost Physics Proceedings

... This motivates the search for new summary statistics that can effectively extract information from the non-linear small scales. Some examples of these beyond-two-point summary statistics include: the bispectrum and other higher -point functions (Sefusatti et al. 2006;Hahn et al. 2020), one-point PDFs (Uhlemann et al. 2020), -nearest neighbor ( NN) distance distributions (Banerjee & Abel 2021a;Yuan et al. 2023), various topological summary statistics (Biagetti et al. 2021;Cisewski-Kehe et al. 2022;Ouellette et al. 2023;Yip et al. 2024), wavelet scattering transforms (Valogiannis & Dvorkin 2022), as well as many others. ...

2D k -th nearest neighbour statistics: a highly informative probe of galaxy clustering
  • Citing Article
  • April 2023

Monthly Notices of the Royal Astronomical Society

... With the summary statistics measured from the Abacus mocks, we can train emulators, which are surrogate models used to efficiently and accurately give predictions for the summary statistics, using flexible machine learning models such as the widely used neural networks [59,60,75,128] and Gaussian processes [112,115,[129][130][131][132][133][134][135]. ...

Full forward model of galaxy clustering statistics with AbacusSummit light cones
  • Citing Article
  • February 2023

Monthly Notices of the Royal Astronomical Society