Usman T. Khan’s research while affiliated with New York University and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (57)


Water handling practices documented in the dataset used in this study
Protecting the Safe Water Chain in Refugee Camps: An Exploratory Study of Water Handling Practices, Chlorine Decay, and Household Water Safety in South Sudan, Jordan, and Rwanda
  • Article
  • Full-text available

December 2024

·

19 Reads

The American journal of tropical medicine and hygiene

·

Michael De Santi

·

Georges Monette

·

[...]

·

In refugee and internally displaced person settlements, hygienic water handling and free residual chlorine (FRC) are crucial for protecting water against recontamination after distribution up to the household point-of-consumption. We conducted a secondary analysis of water quality and water handling data collected in refugee camps in South Sudan, Jordan, and Rwanda using statistical and process-based modeling to explore how water handling practices affect FRC decay and household FRC outcomes. The two practices that consistently produced a significant effect on FRC decay and household FRC were storing water in direct sunlight and transferring water between containers during household storage. Samples stored in direct sunlight had 0.22–0.31 mg/L lower household FRC and had FRC decay rates between 2 and 3.7 times higher than samples stored in the shade, and samples that were transferred between containers had 0.031–0.51 mg/L lower household FRC and decay rates 1.65–3 times higher than non-transferred samples in sites in which the effect was significant, suggesting that humanitarian responders should aim to provide additional water storage containers to prevent water transferring in households and encourage water-users not to store water in direct sunlight. By contrast, the effect of the three recommended hygienic water handling behaviors (clean, covered containers and drawing by tap or pouring) was mixed or inconclusive. These inconclusive results were likely due to imbalanced or unreliable approaches to gathering the data, and we recommend that hygienic water handling practices that mechanistically provide a physical barrier against recontamination should always be promoted in humanitarian settings.

Download


LSTM size and training datasets used in Experiment 2. Configurations are grouped by experiment
A diversity centric strategy for the selection of spatio-temporal training data for LSTM-based streamflow forecasting

June 2024

·

27 Reads

Deep learning models are increasingly being applied to streamflow forecasting problems. Their success is in part attributed to the large and hydrologically diverse datasets on which they are trained. However, common data selection methods fail to explicitly account for hydrological diversity contained within training data. In this research, clustering is used to characterise temporal and spatial diversity, in order to better understand the importance of hydrological diversity within regional training datasets. This study presents a novel, diversity-based resampling approach to creating hydrologically diverse datasets. First, the undersampling procedure is used to undersample temporal data, and is used to show how the amount of temporal data needed to train models can be halved without any loss in performance. Next, it is applied to reduce the number of basins in the training dataset. While basins cannot be omitted from training without some loss in performance, we show how hydrologically dissimilar basins are highly beneficial to model performance. This is shown empirically for Canadian basins; models trained to sets of basins separated by thousands of kilometres outperform models trained to localised clusters. We strongly recommend an approach to training data selection that encourages a broad representation of diverse hydrological processes.


The IAHS Science for Solutions decade, with Hydrology Engaging Local People IN one Global world (HELPING)

May 2024

·

968 Reads

·

10 Citations


Flood susceptibility mapping using ANNs: a case study in model generalization and accuracy from Ontario, Canada

February 2024

·

591 Reads

·

1 Citation

Accurate flood susceptibility mapping (FSM) is critical for mitigating the environmental, social and economic consequences of floods. The influence of model generalizability onto new watersheds, and the impact of arbitrarily selecting a small subset of flooded and nonflooded locations are current major knowledge gaps in FSM research impacting predictive accuracy. As such, this study conducts an assessment of machine learning models – (i) an Artificial Neural Network - Synthetic Minority Oversampling Technique (ANN-SMOTE) hybrid ensemble with (ii) knowledge-based Analytical Hierarchy Process (AHP) and (iii) diversity-based Shannon Entropy approaches. The ANN-SMOTE, AHP and Entropy models were trained and tested on the Don River watershed in Ontario, Canada, with Overall Accuracy (OA) results of 0.549, 0.404 and 0.452, respectively. ANN-SMOTE’s predictive accuracy remained high when it was tested on four independent watersheds from southern Ontario, indicating strong generalization ability. To simulate the commonly used flood point inventory approach, the number of training samples was reduced by a factor of a 1000, which resulted in a 28% decrease in accuracy. The high performance and generalization potential of the ANN-SMOTE model demonstrate its utility and versatility for future FSM studies, and as a support tool in flood risk management decision making.


Evaluation of Process-Based Ensemble Models for Forecasting Point-of-Consumption Free Residual Chlorine in Refugee Settlements

January 2024

·

13 Reads

Waterborne illnesses are a leading public health concern in refugee and internally displaced person (IDP) settlements. Controlling the spread of these illnesses can be particularly challenging as pathogens can be introduced into previously-safe drinking water during the post-distribution period of collection, transport, and household storage. Free residual chlorine (FRC) is often used in these settlements to prevent recontamination of drinking water, and thus, it is critical that at least 0.2 mg/L of FRC is available up to the point-of-consumption. Chlorine decay models can be used to determine the chlorine dose required to maintain this residual; however, post-distribution FRC decay is highly uncertain due to many immeasurable factors that vary substantially from user to user within a site. Traditional deterministic FRC decay models are unable to quantify this uncertainty. Therefore, there is a need for improved modelling that quantifies uncertainty in FRC decay. Ensemble forecasting systems, which consist of collections of models as opposed to a single standalone model, can quantify this uncertainty by generating probabilistic forecasts of FRC. This study presents a novel use of ensemble techniques to generate probabilistic forecasts of FRC decay for the post-distribution period in refugee and IDP settlements. The two alternatives considered for determining the decay parameters for the ensembles were a resampling approach with least-squares regression and a quantile regression-based approach, both using six different FRC decay equations. These approaches were tested using a six-month operational water quality dataset collected from a refugee settlement in Bangladesh in 2019. The quantile regression-based ensembles produced more reliable forecasts, and better capture of observed values, as compared to resampling with least-squares. Of the FRC decay equations considered, the parallel first-order decay equation produced the least quantile error when compared to the other decay equations considered. This demonstrates that ensemble forecasting systems effectively quantify uncertainty when modelling post-distribution FRC decay. These findings can be used to develop improved FRC guidance for humanitarian responders working in refugee and IDP settlements.


Propagating Particle Tracking Uncertainty Defined by Fuzzy Numbers in Spatially Variable Velocity Fields

September 2023

·

34 Reads

Journal of Marine Science and Engineering

Accurate prediction of the trajectories of material drifting on the ocean surface is critical for risk assessment and responses to environmental emergencies. Prediction of these trajectories is subject to uncertainty arising from a number of sources, with a primary source being uncertainty in the modelled ocean surface currents and winds used as input to the trajectory model. This article presents a fuzzy number-based algorithm for propagating uncertainty through a particle tracking scheme in a time- and space-varying velocity field. The performance of the algorithm was tested by applying it to idealized, analytical velocity fields and scoring the results against the analytical solution. Both epistemic and aleatoric uncertainty were considered and combined using a fractional Brownian motion model for temporal autocorrelation of the uncertainty. In the evaluation of the algorithm, sensitivity was quantified with respect to parameters such as timestep size, resolution of the forcing velocity field, spatial and temporal gradients in the forcing, and resolution of the applied uncertainty. Parameter values optimizing uncertainty representation and computational cost were identified. The applied uncertainty was found to evolve in agreement with classical relative dispersion relationships.


How might climate change impact water safety and boil water advisories in Canada?

July 2023

·

110 Reads

·

2 Citations

A boil water advisory (BWA) informs the public that there is an increased level of risk associated with their water and that they should boil it before consuming. Studies show that small communities in Canada are particularly likely to experience repeat and long-term BWAs. Climate change has led to changes in precipitation and temperature patterns, leading to region-specific impacts such as increased frequency, severity, or variance in floods, forest fires, droughts, freezing rain, and sea water intrusion. Academic and non-academic “grey” literature was reviewed to establish the most likely impacts of climate change on water treatment and infrastructure. Anonymized data from public drinking water systems in Canada was analyzed to determine the most common causes of BWAs between 2005 and 2020. Most BWAs reported were related to breakdowns/malfunctions along the distribution, though inadequate disinfection residual and turbidity or coliforms in the treated water were also common. Furthermore, statistical analysis of the data showed seasonal trends in some of these parameters. The results of this study suggest that increased precipitation, flooding, permafrost degradation, and forest fires are likely to have significant impacts on water safety in Canada. Highlights Climate change effects are expected to worsen many current water challenges. Climate change will disproportionately impact small, rural, and remote water utilities. Water distribution systems are the main source of water safety risk in Canada. Groundwater-supplied systems experience a disproportionate number of BWAs. Seasonal trends in BWA reasons provide opportunities for targeted mitigation.



Special Issue (Remote Sensing): Artificial Intelligence Applications in Remotely Sensed Hydrologic and Water Systems

November 2022

·

170 Reads

Remote Sensing

Remote Sensing is launching a special issue entitled “Artificial Intelligence Applications in Remotely Sensed Hydrologic and Water Systems.” This issue aims to promote state-of-the-art data-driven and machine learning techniques such as deep learning, ensemble learning, and reinforcement learning, using remote sensing in water research spanning hydro-climatology, hydroinformatics, and hydro-meteorology. Applications of interest include, but not limited to, hazard monitoring, forecasting of extreme events, pollution analysis, mapping of renewables, surface water systems, sociotechnical analysis, hydroinformatics, environment and sustainable agriculture applications. Research featuring advances in statistical modeling approaches is also invited. Consideration will be also given to interdisciplinary methodologies in uncertainty analysis, state-estimation, model interpretability, system identification and relational mapping of remotely sensed systems. https://www.mdpi.com/journal/remotesensing/special_issues/AI_water


Citations (37)


... In a closely related study, delved into the consequences of human intervention and rising sea levels on waterlogging within specific areas, particularly Polders-24 and 25. Their analysis provides insights into how anthropogenic factors and natural climatic changes exacerbate waterlogging, highlighting the need for integrated management strategies (Arheimer et al. 2024). To address potential solutions, Dasgupta et al. (2019) proposed the concept of temporary de-poldering. ...

Reference:

Analysing Land Use and Land Cover Dynamics (1972–2020) and Evaluating Drainage System Efficacy: A Case Study of Bangladesh Polders
The IAHS Science for Solutions decade, with Hydrology Engaging Local People IN one Global world (HELPING)

... The remaining 10% of BWAs issued in 2021 were due to water contamination through pathogens, with 8% due to E. coli alone (Environment and Climate Change Canada, 2022b). Additionally, with water quality suffering due to worsening impacts from global warming, including high temperatures, rising sea levels, flooding, droughts, forest fires, and permafrost melting, an increase in the incidence of BWAs in the future is expected (Moghaddam-Ghadimi et al., 2023). ...

How might climate change impact water safety and boil water advisories in Canada?

... Xu et al. (2023 and applied machine learning and deep learning into runoff estimation and flood forecasting in ungauged basins. With the application of machine learning, the machine learning technologies such as deep learning and neutral network are also integrated with SWMM, which significantly improve the simulation accuracy and make the results more comprehensive and flexible (Snieder and Khan 2023;Zhao et al. 2023;Liu et al. 2024;Szelag et al. 2024). ...

A novel ensemble algorithm based on hydrological event diversity for urban rainfall-runoff model calibration and validation
  • Citing Article
  • February 2023

Journal of Hydrology

... Monitoring and predicting TBM performance are critical for ensuring project timelines and mitigating risks associated with unexpected operational anomalies. Machine learning (ML) techniques, particularly deep learning, have demonstrated promise in analysing TBM operational data for various classification tasks [1][2][3][4][5]. Transfer learning, a subfield of ML, has gained traction for its capability to transfer knowledge from one domain to another, thereby improving the performance of models given limited to no labelled data. ...

Practical recommendations for machine learning in underground rock engineering – On algorithm development, data balancing, and input variable selection
  • Citing Article
  • October 2022

Geomechanik und Tunnelbau

... (Hossain, et al., 2021;Elemam and Eldeeb, 2023). Contamination of originally safe drinking water through transport, and storage has been related to spread of shigellosis, hepatitis E, and cholera in internally displaced populations (IDP) and refugee in South Sudan, Malawi, Kenya, Uganda, and Sudan (Golicha et al., 2018;De Santi et al., 2022). Global drinking water quality guidelines (GDWQG) recommend 0.2 mg/L at least of free residual chlorine (FRC) to be provided throughout the post-distribution period to prevent recontamination by priority pathogens (De Santi et al., 2022). ...

Modelling point-of-consumption residual chlorine in humanitarian response: Can cost-sensitive learning improve probabilistic forecasts?

... The global horizontal irradiance (GHI) and average wind speed data for each sampling location was gathered from Solcast data company (https://solcast.com). All these meteorological variables were summarized to obtain antecedent weather conditions at a resolution of 3 days, 7 days, and 10 days before the sampling event (Blagrave at al., 2022). ...

Heatwaves and storms contribute to degraded water quality conditions in the nearshore of Lake Ontario
  • Citing Article
  • April 2022

Journal of Great Lakes Research

... A value is assigned to each alternative to represent its performance for a certain criterion: from 1 (poor performance) to 5 (excellent performance). The judgment for assigning these values is based on a literature review [68][69][70][71] and professional judgment. ...

A simplified geospatial model to rank LID solutions for urban runoff management
  • Citing Article
  • March 2022

The Science of The Total Environment

... Flood hazard assessment is the first step in flood risk analysis (Oliver et al. 2018;Rincón et al. 2022) and includes modelling the extent of flood propagation, as described in the review of flood conceptual models by Teng et al. (2017). We focus on fluvial flood assessments, which require a hydraulic model of the river system and existing or projected hydraulic structures. ...

Stochastic Flood Risk Assessment under Climate Change Scenarios for Toronto, Canada Using CAPRA

Water

... It is important to note here that overestimation of uncertainty (i.e., predicting too large a possible search area) may have impacts just as detrimental as underestimation. Steps towards accurate accounting of uncertainty have been taken in, amongst others, previous work by Blanken et al. [7], who employed fuzzy numbers [8][9][10] to propagate uncertainty through a drift trajectory model forced by time series measured at a single point. However, this work did not consider spatially variable velocity fields and, therefore, the algorithm used by Blanken et al. [7] is not suitable for use with forcing data obtained from numerical models of the ocean and atmosphere. ...

A Fuzzy-Based Framework for Assessing Uncertainty in Drift Prediction Using Observed Currents and Winds

... In particular, this model comprises three blocks: (1) an ANN, with its ability to map intricate non-linear correlations between historical land use and diverse causes, which has been used to determine the likelihood of suitability of various types of land [36,51]; (2) a discrete and abstract computational system with self-adaptive inertia and a competitive principle based on CA, which makes it possible to accurately predict the long-term spatial pathways of LULC. It may also be utilized to evaluate the particular relationships between various land use types, which are typically represented as interactions and competition, and (3) a Markov model, a stochastic model used to simulate pseudo-random systems and forecast demand in the future using past data [52]. ...

Data Driven Models as A Powerful Tool to Simulate Emerging Bioprocesses: An Artificial Neural Network Model to Describe Methanotrophic Microbial Activity
  • Citing Article
  • March 2021

Journal of Environmental Informatics