Rahul Mazumder’s research while affiliated with Massachusetts Institute of Technology and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (14)


Projected likelihood contrasts for testing homogeneity in finite mixture models with nuisance parameters
  • Article
  • Full-text available

June 2008

·

61 Reads

·

Rahul Mazumder

This paper develops a test for homogeneity in finite mixture models where the mixing proportions are known a priori (taken to be 0.5) and a common nuisance parameter is present. Statistical tests based on the notion of Projected Likelihood Contrasts (PLC) are considered. The PLC is a slight modification of the usual likelihood ratio statistic or the Wilk's Λ\Lambda and is similar in spirit to the Rao's score test. Theoretical investigations have been carried out to understand the large sample statistical properties of these tests. Simulation studies have been carried out to understand the behavior of the null distribution of the PLC statistic in the case of Gaussian mixtures with unknown means (common variance as nuisance parameter) and unknown variances (common mean as nuisance parameter). The results are in conformity with the theoretical results obtained. Power functions of these tests have been evaluated based on simulations from Gaussian mixtures.

Download

Fluid flow pattern analysis in a trough region: A nonparametric approach

June 2008

·

23 Reads

·

2 Citations

This paper aims at identifying statistically different circulation patterns characterising fluid flow in the trough region between two adjacent asymmetric waveforms, using the velocity data collected by 3D acoustic Doppler velocimeter. Statistical clustering has been performed using ideas originating from information theory and scale space theory in computer vision for splitting the trough region into different spatially connected segments (identifying the circulation bubble in the process) on the basis of circulation patterns. The paper attempts to visualise the fluid fluctuations in the trough region, with emphasis on the circulation region, by simulating the directional fluctuations of fluid particles from the kernel density estimates learned from the experimental data. The image representation of the estimate of the spatial turbulent kinetic energy (TKE) function reveals interesting features corresponding to the regions of high TKE, suggesting the possibilities for further research in this area along the lines of feature extraction and image analysis.


Clustering based on geometry and interactions of turbulence bursting rate processes in a trough region

June 2007

·

8 Reads

·

5 Citations

Environmetrics

This paper aims at understanding turbulence occurring due to fluid flow in the trough region between two artificial adjacent asymmetric waveforms in an open-channel using velocity data collected by 3-D Acoustic Doppler Velocimeter (ADV), concentrating on interactions among the turbulence bursting phenomena across different spatial locations. The Statistical Learning stage of the analysis begins with the identification and extraction of statistically informative and physically interpretable features in the geometry of the Bursting Rate Processes (BRP). Statistical measures characterising the differences among the concerned processes have been developed and used for splitting the trough region into different regions based on the geometry, structure and randomness in the BRP, using the principles of statistical clustering involving parametric, non-parametric techniques and ideas of information theoretic entropy. Experimental observations support the existence of certain BRP which may be considered to be dominant over the others in an almost global sense. The issue of identifying a single bursting phenomenon that changes its orientation strongly relative to others or its closest neighbour across all spatial locations or stretches of vertical heights for every horizontal location and its importance in the entire physical scenario in the light of the spatial clustering problem has been addressed and settled too. Copyright © 2007 John Wiley & Sons, Ltd.


Statistical characterization of circulation patterns and direction of turbulent flow over a waveform structure

August 2006

·

18 Reads

·

4 Citations

Environmetrics

The objective of this paper is to analyze statistically the turbulence characteristics of flow over an artificial waveform geometry in an open channel, using velocity data collected by 3-D acoustic doppler velocimeter (ADV). The joint distribution of velocity fluctuations have been studied and modeled using multivariate normal distribution, and its effectiveness is analyzed in the light of explaining turbulent bursting events, like sweeping and ejection. The relationship between the randomness in the directional variations of the 3-D velocity fluctuations and the intensity of circulation has been studied. We have exploited this relationship and used it in conjunction with the principles of statistical clustering for splitting the region of flow, into different spatially connected segments based on circulation patterns. Directional data analysis has been adopted to visualize the flow pattern over the entire test section, in particular, the circulation bubbles in the region of separated flow. Copyright © 2006 John Wiley & Sons, Ltd.


Citations (13)


... Although easy to apply, these methods may result in information loss or introduce significant bias [29]. In contrast, algorithms such as k-Nearest Neighbor, Expectation-Maximization, Matrix Factorization, and Multiple Imputation using Chained Equations [20,[30][31][32] consider multiple influencing parameters of the real system and their interrelationships as comprehensively as possible, thereby reducing imputation bias. Recently, interpolation models based on Generative Adversarial Networks (GANs) [33] have achieved higher accuracy; however, their training process may encounter issues such as mode collapse and difficulty in convergence. ...

Reference:

Reconstruction and Prediction of Chaotic Time Series with Missing Data: Leveraging Dynamical Correlations Between Variables
Matrix Completion and Low-Rank SVD via Fast Alternating Least Squares
  • Citing Article
  • October 2014

Journal of Machine Learning Research

... According to the data from Demographia (2023), there are a total of 44 megalopolises worldwide, with approximately half of them located in China and India. According to the data from the 19.87 million in its urban area, has become the first megacity in China [7,8]. However, the allocation of medical resources in the suburbs does not match the population size. ...

Assessing the Significance of Global and Local Correlations under Spatial Autocorrelation; a Nonparametric Approach
  • Citing Article
  • January 2014

Biometrics

Júlia Viladomat

·

Rahul Mazumder

·

·

[...]

·

... Hajek et al. (2010) attempted to determine the statistical characterization of grain-size distributions in natural rivers. Ghoshal et al. (2013) and Ghoshal and Pal (2014) did experiments on a sand-gravel mixture to understand the bed material character in the river form, sorting process and its role in controlling the sediment flux through the landscape. Pal and Ghoshal (2014a, 2014b, 2015 in a series of papers carried out experiments on sand-gravel mixture beds under different flow velocities to study the influence of bed roughness, flow velocity, and suspension height on the grain-size distribution in suspension. ...

Turbulence, suspension and downstream fining over a sand-gravel mixture bed

International Journal of Sediment Research

... In turbulent boundary layers, coherent structures with large flux events have been proposed to explain the "bursting" phenomena responsible for two types of eddy motions name as "ejections" and "sweeps" (Cantwell, 1981;Robinson, 1991). These events are traditionally detected by conditional sampling through quadrant analysis in the (x, z)-plane (Willmarth & Lu, 1972) and their statistics have been investigated for a variety of flows and wall-roughness conditions, for example, experiments in open-channel (Hurther & Lemmin, 2000;Hurther, Lemmin, & Terray, 2007;Mazumder, 2007;Mazumder, Pal, Ghoshal, & Ojha, 2009;Nakagawa & Nezu, 1977;Nelson, Shreve, McLean, & Drake, 1995;Ojha & Mazumder, 2008;Venditti & Bauer, 2005); in wind-tunnel (Raupach, 1981); under an-ice boundary layer (Fer, McPhee, & Sirevaag, 2004); in atmospheric boundary layers (Hurther & Lemmin, 2003;Katul, Kuhn, Schieldge, & Hsieh, 1997;Katul, Poggi, Cava, & Finnigan, 2006;Sterk, Jacobs, & van Boxel, 1998), and in scour around vertical circular cylinders (Debnath, Manik, & Mazumder, 2012;Kirkil, Constantinscu, & Ettema, 2008;Sarkar, Chakraborty, & Mazumder, 2015;Sarkar, Chakraborty, & Mazumder, 2016). Besides the dominant role of the sweeps close to a rough wall, an "equilibrium region" is often observed in fully developed turbulent flows (Krogstad, Antonia, & Browne, 1992). ...

Clustering based on geometry and interactions of turbulence bursting rate processes in a trough region
  • Citing Article
  • June 2007

Environmetrics

... (Lyn [4], Bennett and Best [5], Parsons et al. [6], Best [7], Poggi et al. [8], Ojha and Mazumder [9], Peet et al. [10], Stoesser et al. [11], Mazumder et al. [12], Keshavarzi et al. [13]). The turbulent flow and related bursting phenomena over an isolated asymmetric waveform structure were studied statistically by Mazumder and Mazumder [14] using the analysis of multivariate normal distribution. Mazumder [15,16] studied the turbulence of fluid flow in the trough region between a pair of adjacent asymmetric waveform structures using a statistical clustering technique based on geometry and interactions of turbulence bursting rate. ...

Statistical characterization of circulation patterns and direction of turbulent flow over a waveform structure
  • Citing Article
  • August 2006

Environmetrics

... In terms of DC structure in sparse regularization, several works have analyzed the use of the DC function x → ∥x∥ 1 − ∥x∥ 2 as a sparsity inducing regularizer [3,51] as its zeros correspond to 1-sparse vectors. Many popular nonconvex regularizers have also been shown to have a DC decomposition [12], such as SCAD [18], MCP [53], or the Logarithmic penalty [35]. ...

SparseNet: Coordinate Descent With Nonconvex Penalties

... Thus, the spectrum of a balanced signed graph G b is equivalent to the spectrum of the corresponding positive graph G + , and developed spectral filters for positive graphs can be reused for balanced signed graphs. However, existing GL methods computing balanced signed graphs are limited to sub-optimal two-step approaches 1 : first compute a signed graph from data using, for example, graphical lasso (GLASSO) [23], then balance the computed graph via ad-hoc and often computation-expensive balancing algorithms [24]- [27]. ...

The Graphical Lasso: New Insights and Alternatives

Electronic Journal of Statistics

... Hence, using this information as user covariates helps in improving predictions for explicit ratings. Further, one can derive an item graph where edge weights represent movie similarities that are based on global "who-rated-what" matrix (Kouki et al., 2015;Wang et al., 2015;Agarwal et al., 2011;Mazumder and Agarwal, 2011). Imposing sparsity on such a graph and finding its fair communities is attractive since it is intuitive that an item is generally related to only a few other items. ...

Modeling item--item similarities for personalized recommendations on Yahoo! front page

The Annals of Applied Statistics

... We follow the experimental setting in [15,12,11] to generate data and perform synthetic experiments on multivariate Gaussians. Each off-diagonal entry of the precision matrix is drawn from a uniform distribution, i.e., Θ * ij ∼ U(−1, 1), and then set to zero with probability p = 1 − s, where s means the sparsity level (refer to Appendix C.1). ...

A Flexible, Scalable and Efficient Algorithmic Framework for Primal Graphical Lasso
  • Citing Article
  • October 2011

... This is of interest because L R = O q implies that there are no across-block edges so that the two groups are independent. For the case where λ 2 = 0, i.e. in the graphical lasso, this problem was considered by Witten et al. (2011) and Mazumder and Hastie (2012) where it is proved that such block diagonal structure is obtained whenever λ 1 ≥ max i, j∈L |s i j |. Here, we show that the latter result is still true in the pdglasso problem for any λ 2 ≥ 0. ...

Exact Covariance Thresholding into Connected Components for Large-Scale Graphical Lasso

Journal of Machine Learning Research