Sudha Ram’s research while affiliated with University of Arizona and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (210)


A Reduced Modeling Approach for Making Predictions with Incomplete Data Having Blockwise Missing Patterns
  • Article

November 2024

INFORMS Journal on Data Science

·

Faiz Currim

·

Sudha Ram

Incomplete data with blockwise missing patterns are commonly encountered in analytics, and solutions typically entail listwise deletion or imputation. However, as the proportion of missing values in input features increases, listwise or columnwise deletion leads to information loss, whereas imputation diminishes the integrity of the training data set. We present the blockwise reduced modeling (BRM) method for analyzing blockwise missing patterns, which adapts and improves on the notion of reduced modeling proposed by Friedman, Kohavi, and Yun in 1996 as lazy decision trees. In contrast to the original idea of reduced modeling of delaying model induction until a prediction is required, our method is significantly faster because it exploits the blockwise missing patterns to pretrain ensemble models that require minimum imputation of data. Models are pretrained over the overlapping subsets of an incomplete data set that contain only populated values. During prediction, each test instance is mapped to one of these models based on its feature-missing pattern. BRM can be applied to any supervised learning model for tabular data. We benchmark the predictive performance of BRM using simulations of blockwise missing patterns on three complete data sets from public repositories. Thereafter, we evaluate its utility on three data sets with actual blockwise missing patterns. We demonstrate that BRM is superior to most existing benchmarks in terms of predictive performance for linear and nonlinear models. It also scales well and is more reliable than existing benchmarks for making predictions with blockwise missing pattern data. Supplemental Material: The online appendix is available at https://doi.org/10.1287/ijds.2022.9016 .


Fig. 1 Component smooth function of sound level in GAMM for physiological wellbeing as a bivariate function of SDNN and normalized-HF. The solid line indicates how physiological wellbeing varies as a function of sound level, while the dashed lines are confidence intervals.
Fig. 3 Interaction plots of the top two person-level variables moderating the sound-wellbeing association. a Green solid line: Normal blood pressure. Red dashed line: High blood pressure. b Green solid line: Computer use intensive work. Red dashed line: Not computer use intensive work.
Comparing predictive performance of different simultaneous modeling methods.
Comparing predictive performance of machine learning models with and without sound as an input.
Coefficients of person-level input variables contributing to heterogeneity.

+1

Discovery of associative patterns between workplace sound level and physiological wellbeing using wearable devices and empirical Bayes modeling
  • Article
  • Full-text available

January 2023

·

117 Reads

·

7 Citations

npj Digital Medicine

·

Faiz Currim

·

Casey M. Lindberg

·

[...]

·

Sudha Ram

We conducted a field study using multiple wearable devices on 231 federal office workers to assess the impact of the indoor environment on individual wellbeing. Past research has established that the workplace environment is closely tied to an individual’s wellbeing. Since sound is the most-reported environmental factor causing stress and discomfort, we focus on quantifying its association with physiological wellbeing. Physiological wellbeing is represented as a latent variable in an empirical Bayes model with heart rate variability measures—SDNN and normalized-HF as the observed outcomes and with exogenous factors including sound level as inputs. We find that an individual’s physiological wellbeing is optimal when sound level in the workplace is at 50 dBA. At lower (<50dBA) and higher (>50dBA) amplitude ranges, a 10 dBA increase in sound level is related to a 5.4% increase and 1.9% decrease in physiological wellbeing respectively. Age, body-mass-index, high blood pressure, anxiety, and computer use intensive work are person-level factors contributing to heterogeneity in the sound-wellbeing association.

Download

Toward NEPA performance: A framework for assessing EIAs

November 2022

·

79 Reads

·

13 Citations

Environmental Impact Assessment Review

The National Environmental Policy Act (NEPA) provides a regulatory decision-making process that requires U.S. federal agencies to assess the purpose and socio-environmental impacts of a proposed action before deciding to move forward with that action. The multiplicity of NEPA objectives, the complex tradeoffs embedded in the Environmental Impact Assessment (EIA), and the difficulties in accessing data have presented challenges for evaluating NEPA performance. Researchers have responded with a growing array of performance dimensions and specialized measurement approaches. In this paper, we advance a performance framework for EIAs that integrates several of these dimensions and provides a conceptually coherent approach to procedural and substantive performance. The framework articulates three procedural elements (use of science and analysis, nature of public participation, and management of EIA processes); and three substantive elements (quality of both the NEPA review and action decision, and both the accountability and efficiency of the NEPA review decision). Each element is further elaborated by specific functions with specific variables as a basis for future performance measurements. We use two hypothetical use cases, drawn from public land management and federal highway planning, to illustrate how the performance concepts from the framework can be operationalized and measured.


A Human-in-the-Loop Segmented Mixed-Effects Modeling Method for Analyzing Wearables Data

September 2022

·

23 Reads

·

3 Citations

ACM Transactions on Management Information Systems

Wearables are an important source of big data as they provide real-time high-resolution data logs of health indicators of individuals. Higher-order associations between pairs of variables is common in wearables data. Representing higher-order association curves as piece-wise linear segments in a regression model makes them more interpretable. However, existing methods for identifying the change points for segmented modeling either overfit or have low external validity for wearables data containing repeated measures. Therefore, we propose a human-in-the-loop method for segmented modeling of higher-order pairwise associations between variables in wearables data. Our method uses the smooth function estimated by a generalized additive mixed model to allow the analyst to annotate change point estimates for a segmented mixed-effects model, and thereafter employs the Brent's constrained optimization procedure to fine-tuning the manually provided estimates. We validate our method using three real-world wearables datasets. Our method not only outperforms state-of-the-art modeling methods in terms of prediction performance but also provides more interpretable results. Our study contributes to health data science in terms of developing a new method for interpretable modeling of wearables data. Our analysis uncovers interesting insights on higher order associations for health researchers.


Data Completeness and Complex Semantics in Conceptual Modeling: The Need for a Disaggregation Construct

August 2022

·

19 Reads

·

2 Citations

Journal of Data and Information Quality

Conceptual modeling is important for developing databases that maintain the integrity and quality of stored information. However, classical conceptual models have often been assumed to work on well-maintained and high-quality data. With the advancement and expansion of data science, it is no longer the case. The need to model and store data has emerged for settings with lower data quality, which creates the need to update and augment conceptual models to represent lower-quality data. In this paper, we focus on the intersection between data completeness (an important aspect of data quality) and complex class semantics (where a complex class entity represents information that spans more than one simple class entity). We propose a new disaggregation construct to allow the modeling of incomplete information. We demonstrate the use of our disaggregation construct for diverse modeling problems and discuss the anomalies that could occur without this construct. We provide formal definitions and thorough comparisons between various types of complex constructs to guide future application and prove the unique interpretation of our newly proposed disaggregation construct.


ROLEX: A Novel Method for Interpretable Machine Learning Using Robust Local Explanations

June 2022

·

130 Reads

·

11 Citations

MIS Quarterly

Recent developments in big data technologies are revolutionizing the field of healthcare predictive analytics (HPA), enabling researchers to explore challenging problems using complex prediction models. Nevertheless, healthcare practitioners are reluctant to adopt those models as they are less transparent and accountable due to their black-box structure. We believe that instance-level, or local, explanations enhance patient safety and foster trust by enabling patient-level interpretations and medical knowledge discovery. Therefore, we propose the RObust Local EXplanations (ROLEX) method to develop robust, instance-level explanations for HPA models in this study. ROLEX adapts state-of-the-art methods and ameliorates their shortcomings in explaining individual-level predictions made by black-box machine learning models. Our analysis with a large real-world dataset related to a prevalent medical condition called fragility fracture and two publicly available healthcare datasets reveals that ROLEX outperforms widely accepted benchmark methods in terms of local faithfulness of explanations. In addition, ROLEX is more robust since it does not rely on extensive hyperparameter tuning or heuristic algorithms. Explanations generated by ROLEX, along with the prototype user interface presented in this study, have the potential to promote personalized care and precision medicine by providing patient-level interpretations and novel insights. We discuss the theoretical implications of our study in healthcare, big data, and design science.


Discovery of associative patterns between workplace sound level and physiological wellbeing using wearable devices and empirical Bayes modeling

February 2022

·

116 Reads

We conducted a field study using multiple wearable devices on 231 federal office workers to assess the impact of the indoor environment on individual wellbeing. Past research has established that the workplace environment is closely tied to an individual’s wellbeing. Since sound is the most-reported environmental factor causing stress and discomfort, we focus on quantifying its association with physiological wellbeing. Physiological wellbeing is represented as a latent variable in an empirical Bayes model with heart rate variability measures – SDNN and normalized-HF as the observed outcomes and with exogenous factors including sound level as inputs. We find that an individual’s physiological wellbeing is optimal when sound level in the workplace is at 50 dBA. At lower (<50dBA) and higher (>50dBA) amplitude ranges, a 10 dBA increase in sound level is related to a 5.4% increase and 1.9% decrease in physiological wellbeing respectively. Age, body-mass-index, high blood pressure, anxiety, and computer use intensive work are person-level factors contributing to heterogeneity in the sound-wellbeing association.


Deep Learning of Spatiotemporal Patterns for Urban Mobility Prediction Using Big Data

February 2022

·

338 Reads

·

26 Citations

Information Systems Research

Timely and accurate prediction of human movement in urban areas offers instructive insights into transportation management, public safety, and location-based services, to name a few. Yet, modeling urban mobility is challenging and complex because of the spatiotemporal dynamics of movement behavior and the influence of exogenous factors such as weather, holidays, and local events. In this paper, we use bus transportation as a proxy to mine spatiotemporal travel patterns. We propose a deep-learning-based urban mobility prediction model that collectively forecasts passenger flows between pairs of city regions in an origin-destination (OD) matrix. We first process OD matrices in a convolutional neural network to capture spatial correlations. Intermediate results are reconstructed into three multivariate time series: hourly, daily, and weekly time series. Each time series is aggregated in a long short-term memory (LSTM) network with a novel attention mechanism to guide the aggregation. In addition, our model is context-aware by using contextual embeddings learned from exogenous factors. We dynamically merge results from LSTM components and context embeddings in a late fusion network to make a final prediction. The proposed model is implemented and evaluated using a large-scale transportation data set of more than 200 million bus trips with a suite of Big Data technologies developed for data processing. Through performance comparison, we show that our approach achieves sizable accuracy improvements in urban mobility prediction. Our work has major implications for efficient transportation system design and performance improvement. The proposed deep neural network structure is generally applicable for sequential graph data prediction.


Study participants and used models. SVM = support vector machine.
Impact of features on prediction model output. Red and blue colors represent high and low levels of each predictor. The x‐axis represents the SHAP value. A positive SHAP value means likely to have a fracture; a negative value means unlikely to have a fracture. AST = aspartate aminotransferase; TSH = thyroid‐stimulating hormone (thyrotropin); TBS = trabecular bone score; KMMSE = Korean mini‐mental status examination; CRP = C‐reactive protein; K‐GDS = Korean geriatric depression score; SHAP = Shapley additive explanations.
Impact on prediction model output of (A) total hip BMD, (B) lumbar spine BMD, (C) subjective arthralgia score, and (D) homocysteine level. Red and blue colors represent old and young age. The y‐axis represents the SHAP value. A positive SHAP value means likely to have a fracture; a negative value means unlikely to have a fracture. SHAP = Shapley additive explanations.
Performance in AUC of machine learning models
Performance in AUC of machine learning and FRAX score
A Novel Fracture Prediction Model Using Machine Learning in Community‐Based Cohort

February 2020

·

434 Reads

·

63 Citations

JBMR Plus

ABSTRACT The prediction of fracture risk in osteoporotic patients has been a topic of interest for decades, and models have been developed for the accurate prediction of fracture, including the fracture risk assessment tool (FRAX). As machine‐learning methodologies have recently emerged as a potential model for medical prediction tools, we aimed to develop a novel fracture prediction model using machine‐learning methods in a prospective community‐based cohort. In this study, 2227 participants (1257 females) with a baseline bone mineral density (BMD) and trabecular bone score were enrolled from the Ansung cohort. The primary endpoint was the fragility fractures reported by patients or confirmed by X‐rays. We used 3 different models: CatBoost, support vector machine (SVM), and logistic regression. During a mean 7.5‐year follow‐up (range, 2.5 to 10 years), fragility fractures occurred in 537 (25.6%) of participants. In predicting total fragility fractures, the area under the curve (AUC) values of the CatBoost, SVM, and logistic regression models were 0.688, 0.500, and 0.614, respectively. The AUC value of CatBoost was significantly better than that of FRAX (0.663; p


Robust Local Explanations for Healthcare Predictive Analytics: An Application to Fragility Fracture Risk Modeling

December 2019

·

7 Reads

·

1 Citation

With recent advancements in data analytics, healthcare predictive analytics (HPA) is garnering growing interest among practitioners and researchers. However, it is risky to blindly accept the results and users will not accept the HPA model if transparency is not guaranteed. To address this challenge, we propose the RObust Local EXplanations (ROLEX) method, which provides robust, instance-level explanations for any HPA model. The applicability of the ROLEX method is demonstrated using the fragility fracture prediction problem. Analysis with a large real-world dataset demonstrates that our method outperforms state-of-the-art methods in terms of local fidelity. The ROLEX method is applicable to various types of HPA problems beyond the fragility fracture problem. It is applicable to any type of supervised learning model and provides fine-grained explanations that can improve understanding of the phenomenon of interest. Finally, we discuss theoretical implications of our study in light of healthcare IS, big data, and design science.


Citations (66)


... Islam et al. (2022) have provided a systematic review of XAI in terms of different application domains and tasks. XAI has broad applications and profound impacts in sectors including but not limited to healthcare (Kim et al., 2023), precision medicine (Tjoa & Guan, 2020), autonomous driving systems (Dong et al., 2023), decision-making systems (Linardatos et al., 2020;Coussement & Benoit, 2021), emotion analysis for customers in marketing (Li et al., 2024), finance , and broader societal contexts. XAI methods can be either global or local in scope. ...

Reference:

A novel explainable artificial intelligence framework using knockoffs techniques with applications to sports analytics
ROLEX: A Novel Method for Interpretable Machine Learning Using Robust Local Explanations
  • Citing Article
  • June 2022

MIS Quarterly

... Individuals with multimorbidity have poor quality of life, psychological distress, worsening functional capacity, longer hospital stays, and more postoperative complications, leading to higher costs of care [10]. While conventional business intelligence (BI) and data mining methods for analyzing individual disease patterns are well established [11], [12], fewer studies have expressly accounted for multimorbidity in their problem formulation [7]- [9], [13]. ...

Robust Local Explanations for Healthcare Predictive Analytics: An Application to Fragility Fracture Risk Modeling
  • Citing Conference Paper
  • December 2019

... Furthermore, humidity outside the 30%-60% relative humidity (RH) range -less than 30% RH or greater than 60% RH -was associated with a 25% higher stress response (Razjouyan et al., 2020). Finally, noise levels less than 35 or 40 decibels as well as noise levels greater than 45 decibels were associated with higher stress levels (Srinivasan et al., 2023). Personality also plays a role, with persons scoring high on extraversion being happier in open office settings and more focused than those scoring high on neuroticism (introverts) (Baranski et al., 2023) In combination, these findings point to the need for many choices in office design, with a variety of types of spaces for people to gather in different sized groups for different purposes, with options for quiet heads down spaces when needed. ...

Discovery of associative patterns between workplace sound level and physiological wellbeing using wearable devices and empirical Bayes modeling

npj Digital Medicine

... Further, the smart bracelet is deliberately selected as a smart jewelry category in this research in some aspects. First, the smart bracelet is worn on the wrist, and wrist-worn wearables are often utilized in research applications (Srinivasan et al., 2023). Followingly, depending on the body location, smart bracelets are better at collecting specific health data, informing users about notifications, and having social visibility (Inget et al., 2019). ...

A Human-in-the-Loop Segmented Mixed-Effects Modeling Method for Analyzing Wearables Data
  • Citing Article
  • September 2022

ACM Transactions on Management Information Systems

... From a legislative perspective, the principles of EIA were established in 1969 in the United States with the adoption passage of the "National Environmental Policy Act", an act required for the development of an emerging environmental impact statement [47][48][49]. Although the concept led to numerous debates about its role in the implementation process, it is seen as a means to evaluate actions, policies, plans, and programs that contribute to the achievement of objectives and serve as a link between the environment and sustainable development [50]. ...

Toward NEPA performance: A framework for assessing EIAs
  • Citing Article
  • November 2022

Environmental Impact Assessment Review

... In the same vein, an informal, "lightweight" modeling approach can be helpful in early stages of data exploration and analysis (Ambler 2002;Castellanos et al. 2020;Kaur and Rani 2013;Li et al. 2022). Hence, it has the potential to better support informal modeling approaches taken by agile. ...

Data Completeness and Complex Semantics in Conceptual Modeling: The Need for a Disaggregation Construct
  • Citing Article
  • August 2022

Journal of Data and Information Quality

... Despite increasing attention to population dynamics in sustainable urban development, managing urban population flows remains underexplored in planning practice. Recent studies show that deep learning models can enhance predictive analytics for urban mobility, addressing the gaps left by traditional approaches [16], yet the potential of such data remains underutilized, especially in space-constrained cities where infrastructure expansion is limited [17]. As demonstrated in a study [18], integrating dynamic control systems can significantly enhance service levels and optimize the utilization of existing infrastructure. ...

Deep Learning of Spatiotemporal Patterns for Urban Mobility Prediction Using Big Data
  • Citing Article
  • February 2022

Information Systems Research

... In this context, bicycles represent an interesting resource; they are a cost-effective, eco-friendly, versatile, and a healthy means of transportation. Bike-sharing systems [2][3][4] have made it possible for people who cannot or are not willing to own a bicycle to ride in urban areas. These sharing systems are enjoying a good level of success in many cities, but are not free from challenges: they require careful planning and maintenance. ...

SMARTBIKE: Policy making and decision support for bike share systems
  • Citing Conference Paper
  • September 2016

... Car interaction is becoming ubiquitous because of the fast advancement and adoption of new tools of knowledge and interaction. Smart commuter rails have been made possible by several innovations, with every automobile equipped with a GPS device and every driver owning a smartphone, various systems employ GPS data for tracking driving habits and types of transportation [3].Smart health aims to render healthcare accessible to the greatest number of individuals feasible via health treatment [4] and enhanced evaluation support for physicians through the application of machine learning [5]. Only since the increasing prevalence of mobile phones and health monitoring equipment [6] that can record real information regarding inhabitants' lives, in addition to capturing usual tasks and trying to identify motions using motion detectors, has it become critical to utilize network infrastructure to obtain on above data that can enhance health decisions. ...

A big data approach for smart transportation management on bus network
  • Citing Conference Paper
  • September 2016

... In the context of fracture risk prediction, machine learning approaches offer several advantages over traditional statistical methods. Firstly, machine learning algorithms can effectively handle non-linear relationships and high-dimensional data, which are often present in fracture risk assessment scenarios (Kong et al., 2020). Traditional logistic regression models may oversimplify these complex relationships, leading to suboptimal performance. ...

A Novel Fracture Prediction Model Using Machine Learning in Community‐Based Cohort

JBMR Plus