Amit Singer’s research while affiliated with Princeton University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (27)


Fast Expansion Into Harmonics on the Ball
  • Article

April 2025

·

4 Reads

·

2 Citations

SIAM Journal on Scientific Computing

·

Nicholas F. Marshall

·

·

Amit Singer

Fast alignment of heterogeneous images in sliced Wasserstein distance

March 2025

·

4 Reads

Many applications of computer vision rely on the alignment of similar but non-identical images. We present a fast algorithm for aligning heterogeneous images based on optimal transport. Our approach combines the speed of fast Fourier methods with the robustness of sliced probability metrics and allows us to efficiently compute the alignment between two L×LL \times L images using the sliced 2-Wasserstein distance in O(L2logL)O(L^2 \log L) operations. We show that our method is robust to translations, rotations and deformations in the images.


Cryo-EM heterogeneity analysis using regularized covariance estimation and kernel regression

February 2025

·

10 Reads

·

9 Citations

Proceedings of the National Academy of Sciences

Proteins and the complexes they form are central to nearly all cellular processes. Their flexibility, expressed through a continuum of states, provides a window into their biological functions. Cryogenic electron microscopy (cryo-EM) is an ideal tool to study these dynamic states as it captures specimens in noncrystalline conditions and enables high-resolution reconstructions. However, analyzing the heterogeneous distributions of conformations from cryo-EM data is challenging. We present RECOVAR, a method for analyzing these distributions based on principal component analysis (PCA) computed using a REgularized COVARiance estimator. RECOVAR is fast, robust, interpretable, expressive, and competitive with state-of-the-art neural network methods on heterogeneous cryo-EM datasets. The regularized covariance method efficiently computes a large number of high-resolution principal components that can encode rich heterogeneous distributions of conformations and does so robustly thanks to an automatic regularization scheme. The reconstruction method based on adaptive kernel regression resolves conformational states to a higher resolution than all other tested methods on extensive independent benchmarks while remaining highly interpretable. Additionally, we exploit favorable properties of the PCA embedding to estimate the conformational density accurately. This density allows for better interpretability of the latent space by identifying stable states and low free-energy motions. Finally, we present a scheme to navigate the high-dimensional latent space by automatically identifying these low free-energy trajectories. We make the code freely available at https://github.com/ma-gilles/recovar .


Subspace method of moments for ab initio 3-D single-particle Cryo-EM reconstruction

October 2024

·

10 Reads

Jeremy Hoskins

·

Yuehaw Khoo

·

·

[...]

·

Yuguan Wang

Cryo-electron microscopy (Cryo-EM) is a widely-used technique for recovering the 3-D structure of biological molecules from a large number of experimentally generated noisy 2-D tomographic projection images of the 3-D structure, taken from unknown viewing angles. Through computationally intensive algorithms, these observed images are processed to reconstruct the 3-D structures. Many popular computational methods rely on estimating the unknown angles as part of the reconstruction process, which becomes particularly challenging at low signal-to-noise ratio. The method of moments (MoM) offers an alternative approach that circumvents the estimation of viewing angles of individual projection images by instead estimating the underlying distribution of the viewing angles, and is robust to noise given sufficiently many images. However, the method of moments typically entails computing high-order moments of the projection images, incurring significant storage and computational costs. To mitigate this, we propose a new approach called the subspace method of moments (subspace MoM), which compresses the first three moments using data-driven low-rank tensor techniques as well as expansion into a suitable function basis. The compressed moments can be efficiently computed from the set of projection images using numerical quadrature and can be employed to jointly recover the 3-D structure and the distribution of viewing angles. We illustrate the practical applicability of the subspace MoM in numerical experiments using up to the third-order moment, which significantly improves the resolution of MoM reconstructions compared to previous approaches.


Fast expansion into harmonics on the ball

June 2024

·

7 Reads

We devise fast and provably accurate algorithms to transform between an N×N×NN\times N \times N Cartesian voxel representation of a three-dimensional function and its expansion into the ball harmonics, that is, the eigenbasis of the Dirichlet Laplacian on the unit ball in R3\mathbb{R}^3. Given ε>0\varepsilon > 0, our algorithms achieve relative 1\ell^1 - \ell^\infty accuracy ε\varepsilon in time O(N3(logN)2+N3logε2)O(N^3 (\log N)^2 + N^3 |\log \varepsilon|^2), while their dense counterparts have time complexity O(N6)O(N^6). We illustrate our methods on numerical examples.


Moment-based metrics for molecules computable from cryo-EM images
  • Article
  • Full-text available

February 2024

·

32 Reads

·

2 Citations

Biological Imaging

Single-particle cryogenic electron microscopy (cryo-EM) is an imaging technique capable of recovering the high-resolution three-dimensional (3D) structure of biological macromolecules from many noisy and randomly oriented projection images. One notable approach to 3D reconstruction, known as Kam’s method, relies on the moments of the two-dimensional (2D) images. Inspired by Kam’s method, we introduce a rotationally invariant metric between two molecular structures, which does not require 3D alignment. Further, we introduce a metric between a stack of projection images and a molecular structure, which is invariant to rotations and reflections and does not require performing 3D reconstruction. Additionally, the latter metric does not assume a uniform distribution of viewing angles. We demonstrate the uses of the new metrics on synthetic and experimental datasets, highlighting their ability to measure structural similarity.

Download

Illustration of the Fourier shell correlation computed from a downsampled signal (i.e., the SFSC)
The measured signal is split into even and odd voxels for each dimension and the SFSC is computed between the respective pairs. The reported FSC is taken to be the average of the three pairs.
Conditions on the statistics of the signal and noise under which the SFSC accurately estimates the FSC
Each panel shows the image, power spectrum and associated SFSC for a signal that satisfies or fails to satisfy both Assumption 1 and Assumption 2. The SNR was set to 15 for each image with additive Gaussian noise that decays with spatial frequency when specified. The FSC was computed for each case using two synthetic images generated with the same parameters but independent noise. aBsignal = 100 Ų, Bnoise = 0 Ų. Both assumptions are met and the SFSC accurately estimates the FSC. bBsignal = 100 Ų, Bnoise = 50 Ų. The noise is not white Gaussian and the SFSC overestimates the FSC. cBsignal = 0 Ų, Bnoise = 0 Ų. The noise is white Gaussian but the signal does not have rapid decay. The SFSC underestimates the FSC. dBsignal = 20 Ų, Bnoise = 10 Ų. Neither assumption is met and the SFSC fails to estimate the FSC. This figure demonstrates that the naive SFSC provides an accurate estimate of the FSC only if Assumption 1 and Assumption 2 are met.
Corrections required for the SFSC to accurately estimate the FSC
The image and corresponding power spectrum in each column were generated with a specified SNR and a B-factor on both the signal and noise to exemplify each case. The FSC was computed for each case using two synthetic images generated with the same parameters but independent noise. a Phase shift correction, SNR = 10⁵, Bsignal = 150 Ų, Bnoise = 0 Ų. If the phase shift induced by downsampling is not corrected, the 2-D SFSC reduces to J0, a scaled zeroth order Bessel function of the first kind (see Supplementary Note 4). b Correction for the scaled variance, SNR = 15, Bsignal = 100 Ų, Bnoise = 0 Ų. Both assumptions on the signal and noise are met. The SFSC estimates the FSC according to Eq. (12) after adjusting for the scaled variance. c Whitening transform, SNR = 15, Bsignal = 100 Ų, Bnoise = 50 Ų. After applying a whitening transform, the SFSC estimates the FSC. d Upsampling, SNR = 10, Bsignal = 10 Ų, Bnoise = 0 Ų. If the signal does not have rapid decay but has been whitened, the SFSC estimates the FSC only after upsampling. These correcting factors extend the applicability of the SFSC.
Global resolution estimates from single maps
The SFSC is computed for each half map after applying the noise whitening and upsampling procedure. The noise is estimated by computing the spherically averaged power spectrum from the region outside a sphere encompassing the structure. The SFSC is approximately equal to the standard FSC for a EMD-24822 (grid points = 360³, voxel size = 1.05 Å), b EMD-13234 (grid points = 336³, voxel size = 1.7 Å) and c EMD-27648 (grid points = 416³, voxel size = 0.83 Å), but fails for d EMD-20278 (grid points = 288³, voxel size = 0.83 Å) due to the non-uniform noise which can be seen in the central slice images.
Denoising a reconstructed tomogram using the SFSC
a Slice of a reconstructed tomogram of C. elegans tissue from EMD-4869 (N × N = 928 × 928, pixel size = 13.7 Å). b Region of interest from a subsection of the tomogram (N × N = 464 × 464). c Slice of the tomogram selected vertically above the region of interest containing background noise. d Slice from the region of interest after applying a Wiener filter. e Conventional low-pass filter of the subsection at 66 Å; determined using the 1/7 threshold of the SFSC. Both the Wiener filtered and low-pass filtered images are displayed at a threshold of ±2 standard deviations of the pixel values. f SFSC computed from the tomogram subsection. g Spherically averaged power spectrum of the region of interest slice and the background noise slice. The Wiener filter computed from the SFSC provides significantly increased contrast compared to a low-pass filtering approach.
Self Fourier shell correlation: properties and application to cryo-ET

January 2024

·

87 Reads

·

5 Citations

Communications Biology

The Fourier shell correlation (FSC) is a measure of the similarity between two signals computed over corresponding shells in the frequency domain and has broad applications in microscopy. In structural biology, the FSC is ubiquitous in methods for validation, resolution determination, and signal enhancement. Computing the FSC usually requires two independent measurements of the same underlying signal, which can be limiting for some applications. Here, we analyze and extend on an approach to estimate the FSC from a single measurement. In particular, we derive the necessary conditions required to estimate the FSC from downsampled versions of a single noisy measurement. These conditions reveal additional corrections which we implement to increase the applicability of the method. We then illustrate two applications of our approach, first as an estimate of the global resolution from a single 3-D structure and second as a data-driven method for denoising tomographic reconstructions in electron cryo-tomography. These results provide general guidelines for computing the FSC from a single measurement and suggest new applications of the FSC in microscopy.


Self Fourier shell correlation: properties and application to cryo-ET

November 2023

·

57 Reads

The Fourier shell correlation (FSC) is a measure of the similarity between two signals computed over corresponding shells in the frequency domain and has broad applications in microscopy. In structural biology, the FSC is ubiquitous in methods for validation, resolution determination, and signal enhancement. Computing the FSC usually requires two independent measurements of the same underlying signal, which can be limiting for some applications. Here, we analyze and extend on an approach proposed by Koho et al. [1] to estimate the FSC from a single measurement. In particular, we derive the necessary conditions required to estimate the FSC from downsampled versions of a single noisy measurement. These conditions reveal additional corrections which we implement to increase the applicability of the method. We then illustrate two applications of our approach, first as an estimate of the global resolution from a single 3-D structure and second as a data-driven method for denoising tomographic reconstructions in electron cryo-tomography. These results provide general guidelines for computing the FSC from a single measurement and suggest new applications of the FSC in microscopy.


Figure A.6: Estimation of principal components and eigenvalues. (a) Sine of principal angles between the subspaces spanned by the estimated and exact top k eigenvectors for different estimators. The approximate SVD improves on the accuracy across all principal components compared to the inital one. (b) Eigenvalue estimates for the initial and approximate SVD estimator. The first estimator is on the correct order of magnitude but underestimates eigenvalues. A lower eigenvalue estimate is expected due to regularization. The second estimator is more accurate, especially for eigenvalues corresponding to well-estimated eigenvectors. (c) Visualization of the first six exact and estimated eigenvectors, showing increasing frequency and noise. Note that eigenvectors are unique up to sign, which accounts for the sign flip between exact and estimated.
A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation

November 2023

·

25 Reads

·

4 Citations

Proteins and the complexes they form are central to nearly all cellular processes. Their flexibility, expressed through a continuum of states, provides a window into their biological functions. Cryogenic-electron microscopy (cryo-EM) is an ideal tool to study these dynamic states as it captures specimens in non-crystalline conditions and enables high-resolution reconstructions. However, analyzing the heterogeneous distribution of conformations from cryo-EM data is challenging. Current methods face issues such as a lack of explainability, overfitting caused by lack of regularization, and a large number of parameters to tune; problems exacerbated by the lack of proper metrics to evaluate or compare heterogeneous reconstructions. To address these challenges, we present a white-box method based on principal component analysis (PCA) that can resolve intricate heterogeneity with similar expressive power to neural networks with significantly lower computational demands. We extend the ubiquitous Bayesian framework used in homogeneous reconstruction to automatically regularize principal components, overcoming overfitting concerns and removing the need for most parameters. We further exploit the conservation of density and distances endowed by the embedding in PCA space, opening the door to reliable free energy computation. We leverage the predictable uncertainty of image labels to generate high-resolution reconstructions and identify high-density trajectories in latent space. We make the code freely available at https://github.com/ma-gilles/recovar.



Citations (13)


... Since both estimators require computing ∥y − x ℓ ∥ over all candidate rotations, each with a complexity of O(d), the overall complexity of obtaining both estimators is at the same scale of O(Ld). In many cases, it is preferable to work in a transformed basis (where rotating the volume yields greater accuracy and reduces the effective d), though this may introduce an additional log d factor [25]. ...

Reference:

Bayesian Perspective for Orientation Estimation in Cryo-EM and Cryo-ET
Fast Expansion Into Harmonics on the Ball
  • Citing Article
  • April 2025

SIAM Journal on Scientific Computing

... We leverage a part-based Gaussian mixture model (GMM) of 3D density that enables CryoSPIRE to represent both conformational and compositional heterogeneity, unlike some existing deformation-based methods [13,33]. Further, it provides a naturally interpretable and physically plausible, part-based structure in contrast to existing latent variable methods based on linear density subspaces [10,32] or neural field models [19,20,47]. A key challenge with part-based GMMs concerns initialization and the discovery of parts. ...

Cryo-EM heterogeneity analysis using regularized covariance estimation and kernel regression
  • Citing Article
  • February 2025

Proceedings of the National Academy of Sciences

... The representation of viewing direction density can be further simplified by assuming invariance to inplane reflection. This can be realized by enforcing the spherical harmonic coefficients of odd degree in (62) to zero, i.e., b p,u = 0 if p is odd [75]. In this paper, we only assume the invariance to in-plane rotations but our method can be easily modified to accommodate the additional assumption. ...

Moment-based metrics for molecules computable from cryo-EM images

Biological Imaging

... In the absence of ground truth, the problem is more challenging, as any metric must distinguish between signal and noise, and cannot assume that noise is independently sampled between object pixels. For this reason, the single-image 'self-FRC' is often invalid in the setting of ptychography [52]. ...

Self Fourier shell correlation: properties and application to cryo-ET

Communications Biology

... Although the SFSC is not always applicable, there are many situations that can benefit from having an estimate of the SSNR from a single measurement. For example, in single particle cryo-EM, there are a growing number of methods which generate 3-D structures from manifold embeddings and do not produce independent half maps with which to compute the standard FSC [33][34][35][36] . Thus there is a need for alternative methods to estimate signal and noise statistics. ...

A Bayesian Framework for Cryo-EM Heterogeneity Analysis using Regularized Covariance Estimation

... In our motivating example -computing the continuous Fourier transform -the DHT arises from the discretization of the radially symmetric Fourier integral. The DHT also appears in a wide range of applications including imaging [18,42,28], statistics [26,14], and separation of variables methods in partial differential equations [6,2,43]. In many such applications, a fully nonuniform DHT is desired, as the relevant frequencies ω j may not be equispaced, and the most efficient quadrature rule for discretizing (1.3) may have nodes r k which are also not equispaced. ...

Fast Expansion into Harmonics on the Disk: A Steerable Basis with Fast Radial Convolutions
  • Citing Article
  • September 2023

SIAM Journal on Scientific Computing

... , X L ), where each X ℓ is an N ℓ × R ℓ matrix containing the spherical harmonic coefficients {X ℓ,m (r)}. This model is widely adopted in the cryo-EM literature, e.g., [14,25]. Remarkably, it was shown that the second moment of (5.1) is invariant to the tomographic projection [41,23] and thus it can be understood as a special case of the MRA model (1.2) with G = SO(3). ...

Autocorrelation analysis for cryo-EM with sparsity constraints: Improved sample complexity and projection-based algorithms

Proceedings of the National Academy of Sciences

... In these and other scenarios, the fundamental problem is determining the configuration of points based on partial information about inter-point distances. This problem is known as the Euclidean distance geometry (EDG) problem, which has numerous applications throughout the applied sciences [6][7][8][9][10][11][12][13][14][15]. ...

Quantitatively Visualizing Bipartite Datasets

Physical Review X

... However, several gaps remain before the method can be fully applied to cryo-EM data. For example, experimental images exhibit distinct amplitude contrast [41] and are affected by contrast transfer functions (CTFs) that require correction, as well as by extremely high noise levels [42] that our current approach cannot robustly handle. Potential extensions include explicitly sampling translations, investigating distance measures that are robust to both deformations and noise, and developing a three-dimensional version of the algorithm. ...

Fast Principal Component Analysis for Cryo-EM Images

Biological Imaging

... However, several gaps remain before the method can be fully applied to cryo-EM data. For example, experimental images exhibit distinct amplitude contrast [41] and are affected by contrast transfer functions (CTFs) that require correction, as well as by extremely high noise levels [42] that our current approach cannot robustly handle. Potential extensions include explicitly sampling translations, investigating distance measures that are robust to both deformations and noise, and developing a three-dimensional version of the algorithm. ...

Ab-initio Contrast Estimation and Denoising of Cryo-EM Images
  • Citing Article
  • July 2022

Computer Methods and Programs in Biomedicine