Vanessa A. Garayburu-Caruso's research while affiliated with Pacific Northwest National Laboratory and other places

What is this page?


This page lists the scientific contributions of an author, who either does not have a ResearchGate profile, or has not yet added these contributions to their profile.

It was automatically created by ResearchGate to create a record of this author's body of work. We create such pages to advance our goal of creating and maintaining the most comprehensive scientific repository possible. In doing so, we process publicly available (personal) data relating to the author as a member of the scientific community.

If you're a ResearchGate member, you can follow this page to keep up with this author's work.

If you are this author, and you don't want us to display this page anymore, please let us know.

Publications (53)


Supporting Information for "Prediction of Distributed River Sediment Respiration Rates using Community-Generated Data and Machine Learning"
  • Preprint

May 2024

·

10 Reads

·

Stefan F Gary

·

Timothy D Scheibe

·

[...]

·

Share

Genomic fingerprints of the world's soil ecosystems

May 2024

·

31 Reads

mSystems®

mSystems®

Despite the explosion of soil metagenomic data, we lack a synthesized understanding of patterns in the distribution and functions of soil microorganisms. These patterns are critical to predictions of soil microbiome responses to climate change and resulting feedbacks that regulate greenhouse gas release from soils. To address this gap, we assay 1,512 manually curated soil metagenomes using complementary annotation databases, read-based taxonomy, and machine learning to extract multidimensional genomic fingerprints of global soil microbiomes. Our objective is to uncover novel biogeographical patterns of soil microbiomes across environmental factors and ecological biomes with high molecular resolution. We reveal shifts in the potential for (i) microbial nutrient acquisition across pH gradients; (ii) stress-, transport-, and redox-based processes across changes in soil bulk density; and (iii) greenhouse gas emissions across biomes. We also use an unsupervised approach to reveal a collection of soils with distinct genomic signatures, characterized by coordinated changes in soil organic carbon, nitrogen, and cation exchange capacity and in bulk density and clay content that may ultimately reflect soil environments with high microbial activity. Genomic fingerprints for these soils highlight the importance of resource scavenging, plant-microbe interactions, fungi, and heterotrophic metabolisms. Across all analyses, we observed phylogenetic coherence in soil microbiomes—more closely related microorganisms tended to move congruently in response to soil factors. Collectively, the genomic fingerprints uncovered here present a basis for global patterns in the microbial mechanisms underlying soil biogeochemistry and help beget tractable microbial reaction networks for incorporation into process-based models of soil carbon and nutrient cycling. IMPORTANCE We address a critical gap in our understanding of soil microorganisms and their functions, which have a profound impact on our environment. We analyzed 1,512 global soils with advanced analytics to create detailed genetic profiles (fingerprints) of soil microbiomes. Our work reveals novel patterns in how microorganisms are distributed across different soil environments. For instance, we discovered shifts in microbial potential to acquire nutrients in relation to soil acidity, as well as changes in stress responses and potential greenhouse gas emissions linked to soil structure. We also identified soils with putative high activity that had unique genomic characteristics surrounding resource acquisition, plant-microbe interactions, and fungal activity. Finally, we observed that closely related microorganisms tend to respond in similar ways to changes in their surroundings. Our work is a significant step toward comprehending the intricate world of soil microorganisms and its role in the global climate.


Figure 2: Lambda binning to convert raw FTICR-MS into a representative reaction network using the cumulative probability
Lambda-PFLOTRAN 1.0: Workflow for Incorporating Organic Matter Chemistry Informed by Ultra High Resolution Mass Spectrometry into Biogeochemical Modeling
  • Preprint
  • File available

April 2024

·

77 Reads

Organic matter (OM) composition plays a central role in microbial respiration of dissolved organic matter and subsequent biogeochemical reactions. Here, a direct connection of organic carbon chemistry and thermodynamics to reactive transport simulators has been achieved through the newly developed Lambda-PFLOTRAN workflow tool that succinctly incorporates carbon chemistry data generated from Fourier transform ion cyclotron resonance mass spectrometry (FTICR-MS) into reaction networks to simulate organic matter degradation and the resulting biogeochemistry. Lambda-PFLOTRAN is a python-based workflow, executed through a Jupyter Notebook interface, that digests raw FTICR-MS data, develops a representative reaction network based on substrate-explicit thermodynamic modeling (also termed lambda modeling due to its key thermodynamic parameter λ used therein), and completes a biogeochemical simulation with the open source, reactive flow and transport code PFLOTRAN. The workflow consists of the following five steps: configuration, thermodynamic (lambda) analysis, sensitivity analysis, parameter estimation, and simulation output and visualization. Two test cases are provided to demonstrate the functionality of the Lambda-PFLOTRAN workflow. The first test case uses laboratory incubation data of temporal oxygen depletion to fit lambda parameters (i.e., maximum utilization rate and microbial carrying capacity). A slightly more complex second test case fits multiple lambda formulation and soil organic matter release parameters to temporal greenhouse gas generation measured during a soil incubation. Overall, the Lambda-PFLOTRAN workflow facilitates upscaling by using molecular-scale characterization to inform biogeochemical processes occurring at larger scales.

Download

Riverine dissolved organic matter transformations increase with watershed area, water residence time, and Damköhler numbers in nested watersheds

March 2024

·

93 Reads

Quantifying the relative influence of factors and processes controlling riverine ecosystem function is essential to predicting future conditions under global change. Dissolved organic matter (DOM) is a fundamental component of riverine ecosystems that fuels microbial food webs, influences nutrient and light availability, and represents a significant carbon flux globally. The heterogeneous nature of DOM molecular composition and its propensity for interaction (i.e., functional diversity) can characterize riverine ecosystem function across spatiotemporal scales. To investigate fundamental drivers of DOM diversity, we collected seasonal water samples from 42 nested locations within five watersheds spanning multiple watershed sizes (~5 to 30,000 km) across the United States. Patterns in DOM molecular diversity and putative biochemical transformations derived from high-resolution mass spectrometry were assessed across gradients of explanatory variables associated with watershed characteristics (e.g., watershed area, water residence time, land cover). We found that putative biochemical transformations were more strongly related to explanatory variables across watersheds than common bulk DOM parameters and that watershed area, surface water residence time and derived Damköhler numbers representing DOM reactivity timescales were strong predictors of DOM diversity. The data also indicate that catchment-specific land cover factors can significantly influence DOM diversity in diverging directions. Overall, the results highlight the importance of considering water residence time and land cover when interpreting longitudinal patterns in DOM chemistry and the continued challenge of identifying generalizable drivers that are transferable across watershed and regional scales for application in Earth system models. This work also introduces a Findable Accessible Interoperable Reusable (FAIR) dataset (>300 samples) to the community for future syntheses.


Sediment-associated processes drive spatial variation in ecosystem respiration in the Yakima River basin

March 2024

·

33 Reads

Areas where groundwater and surface water mix (i.e., hyporheic zones, HZ) contribute substantially to stream ecosystem respiration (ERtot). We rely on reactive transport models to understand HZ respiration at large scales; however, model outputs have not been evaluated with field estimates of ERtot. Here we evaluate the degree to which spatial variation in model-predicted HZ respiration can explain spatial variation in field-estimated ERtot across 32 sites in the Yakima River basin (YRB). We find that predicted HZ respiration did not explain spatial variation in ERtot. We hypothesize that ERtot is influenced by processes that integrate contributions from sediments, such as benthic algae, submerged macrophytes, and shallow HZ. Our results indicate that sediment-associated processes hydrologically connected to the active channel are primary drivers of spatial variation in ERtot in the YRB. We encourage conceptual and physical models of stream ERtot to integrate shallow hyporheic exchange with sediment associated primary production.


Prediction of Distributed River Sediment Respiration Rates using Community-Generated Data and Machine Learning

March 2024

·

35 Reads

River sediment microbial respiration is a key indicator of ecosystem functioning and the biogeochemical fluxes across this critical zone link surface and subsurface waters. As such, there is tremendous interest in measuring and mapping these respiration rates. Respiration observations are expensive and labor intensive; there is limited data available to the community. An open science, collaborative initiative is collecting samples for respiration rate analysis and multi-scale metadata; this evolving data set is being used for making machine learning (ML) predictions at unsampled sites to help inform continued community engagement. However, it is a challenge to find an optimum configuration for ML models to work with this feature-rich (i.e. 100+ possible input variables) data set. Here, we present results from a two-tiered approach to managing the analysis of this complex data set: 1) a stacked ensemble of models that automatically optimizes hyperparameters and manages the training of many models and 2) feature permutation importance to detect the most important features in the models. The major elements of this workflow are modular, portable, open, and cloud-based thus making this implementation a potential template for other applications. The models developed here predict that sediment organic matter chemistry is one of the most important features for predicting sediment respiration rate. Other larger-scale, important features fall into the categories of climatic, ecological, geological, and fluvial settings. Leveraging these larger-scale features to generate data-driven estimates of river sediment respiration rates reveals spatially consistent but heterogeneous patterns across the river network of the Columbia River Basin.


Correlation plots describing showing the percentage of sediment DOM represented as inferred biochemical classes from high resolution FTICR‐MS analyses (See Methods Section 2.5) as a function of longitude.
Correlation Matrix to show r values for significant correlations (p < 0.05) between inferred biochemical classes derived from high resolution FTICR‐MS analyses (See Methods Section 2.5) with sediment and soil elemental composition as well as land cover. A negative value indicates an inverse relationship.
Redundancy Analysis generated with results from stepwise model building for constrained ordination. Panel (a) is the RDA while panel (b) displays RDA 1 as a function of longitude. For panel a, red colors designate DOM chemistry as FTICR‐MS biochemical classes, which served as independent variables for this analysis. The black colors represent the predictor variable for this analysis, which includes soil and sediment elemental composition and land cover.
Linkages Between Mineral Element Composition of Soils and Sediments With Hyporheic Zone Dissolved Organic Matter Chemistry Across the Contiguous United States

March 2024

·

70 Reads

The hyporheic zone is a hotspot for biogeochemical cycling where interactions with mineral metals preserve the release and biodegradation of organic matter (OM). A small fraction of OM can still be exchanged between localized sediments and the overlying water column, and recent evidence suggests there exists a longitudinal structuring in sediment dissolved OM (DOM) chemistry across the continental United States (CONUS). In this study, we tested a hypothesis that water extractable sediment DOM chemistry could be explained by sediment metal contents and integrative watershed scale features at the CONUS scale. Crowdsourced samples were characterized for high resolution mass spectrometry and coupled with sediment metals determined via x‐ray fluorescence as well as with land cover and soil elemental information obtained from national databases. Our results highlight weak relationships between DOM chemistry and elemental composition at the CONUS scale indicating limited transferability of organo‐metal linkages into multi‐scale hydrobiogeochemical models.



Investigating the impacts of solid phase extraction on dissolved organic matter optical signatures and the pairing with high‐resolution mass spectrometry data across a freshwater stream network

February 2024

·

42 Reads

Advancing our understanding of dissolved organic matter (DOM) chemistry in aquatic systems necessitates the integration of data streams from multiple analytical platforms. Some measurements require pretreatment with solid phase extraction (SPE), while others are performed directly on whole water samples. Evidence has suggested that SPE will be biased against select DOM fractions, leading to concerns over the ability to establish data linkages across platforms with variable needs for SPE pretreatment, such as those from optical measurements and those that provide high‐resolution molecular information. Here, we directly addressed this concern by assessing the impact of SPE on DOM optical properties through excitation–emission matrices with parallel factor analysis (PARAFAC) for 47 samples across a stream network within a single watershed reflective of variable DOM sources. PARAFAC data was further paired with molecular information obtained by Fourier transform ion cyclotron resonance mass spectrometry (FTICR‐MS). A comparison of PARAFAC models first revealed no systematic qualitative differences in major components between whole water DOM and DOM isolated by SPE (SPE‐DOM); however, quantitative biases against select components were observed. Further linkages with FTICR‐MS data revealed that the molecular fingerprint associated with each PARAFAC component was consistent between the whole water DOM and SPE‐DOM. Our results suggest that bulk scale linkages across these analytical platforms could be inferred irrespective of the observed quantitative biases resulting from SPE for samples within this example watershed. This work represents a key step toward the systematic evaluation of linkages between optical and high‐resolution mass spectrometry datasets in freshwater lotic environments.


Yakima River Basin Water Column Respiration is a Minor Component of River Ecosystem Respiration

January 2024

·

64 Reads

·

1 Citation

Aerobic respiration of organic matter is a key metabolic process influencing carbon (C) biogeochemistry in aquatic ecosystems. Anthropogenic and environmental perturbations to stream ecosystem metabolism can have deleterious effects on downstream water quality. Various environmental features of rivers also influence stream metabolism, including physical (e.g., discharge, light, flow regimes) and chemical factors (nutrients, organic matter) and watershed characteristics (e.g., stream size or drainage area, land use). The relative proportion of surface water contact with benthic sediments has been considered the primary driver of ecosystem processes, including ecosystem respiration (ER). While aquatic ecosystem respiration occurs in the water column (ERwc) and in benthic sediments—including surficial and subsurface sediments (ERsed)—ERsed has long been assumed to be the primary contributor to whole-river ecosystem respiration (ERtot). Recent studies show, however, that somewhere along the river continuum (e.g., 5th–9th order), rivers transition from being dominated by benthic processes to being dominated by water column processes. Yet few metabolism studies have parsed contributions from the water column (ERwc) to ERtot, making it difficult to evaluate the relative magnitude and importance of ERwc across the river continuum and across biomes. In this study, we used the Yakima River basin, Washington, USA, to increase our understanding of basin-scale variation in ERwc. We collected ERwc data and water chemistry samples in triplicate at 47 sites in the Yakima River basin distributed across Strahler stream orders 2–7 and different hydrological and biophysical settings during summer baseflow conditions in 2021. We found that observed ERwc rates were consistently slow throughout the basin during baseflow conditions, ranging from −0.11–0.03 mg O2 L⁻1 d⁻1, and were generally at the very slow end of the range of published ERwc literature values. When compared to reach-scale ERtot rates predicted for rivers across the conterminous United States (CONUS), the very slow ERwc rates we observed throughout the Yakima River basin indicate that ERwc is likely a small component of ERtot in this basin. Despite these slow rates, ERwc nonetheless shows spatial variation across the Yakima River basin that was well explained by watershed characteristics and water chemistry. Multiple linear regression model results show that nitrate (NO3-N), dissolved organic carbon (DOC), and temperature together explained 41.5 % of the spatial variation in ERwc. Supporting the findings of other studies, we found that ERwc increased linearly with increasing NO3-N, increasing DOC, and increasing temperature. We hypothesize that low concentrations of nutrients, DOC, and low temperatures in the water column, coupled with low TSS concentrations, likely contribute to the slow ERwc rates observed throughout the Yakima River basin. Because ERtot measurements integrate contributions from water column respiration and sediment-associated respiration (ERsed), estimating ERtot in cold, clear, low nutrient rivers like those in the Yakima River basin with very slow ERwc will essentially measure contributions from ERsed.


Citations (29)


... In fact, the hyporheic zone accounts for the majority of ecosystem metabolism in some aquatic systems (e.g., Naegeli and Uehlinger 1997;Fulton et al. 2024). Although characterizing hyporheic metabolism is key for understanding river corridor biogeochemistry, high spatiotemporal heterogeneity and interacting environmental drivers in the HZ makes it difficult to develop predictive relationships for hyporheic metabolism at reach-to-basin scales (Buser-Young et al. 2023;Stegen et al. 2023;Tureţcaia et al. 2023). ...

Reference:

Allometric scaling of hyporheic respiration across basins in the Pacific Northwest USA
Rethinking Aerobic Respiration in the Hyporheic Zone under Variation in Carbon and Nitrogen Stoichiometry
  • Citing Article
  • October 2023

Environmental Science and Technology

... The bias was possibly due to a weak independence (14 %) of the bottom station relative to the top station, with reach respiration estimates masked by difficulties in removing the effect of upstream processes (Demars et al., 2015). The supersaturation effect has recently been corrected through the use of TDG, travel time and gas exchange rate (Roley et al., 2023). Alternatively, the concomitant use of argon, a noble gas with similar diffusion and solubility to oxygen, allows the estimation of unbiased ER using only one station (Demars and Dörsch, 2023). ...

Coupled primary production and respiration in a large river contrasts with smaller rivers and streams
  • Citing Article
  • October 2023

... A 2008 analysis of PyOM with solid-state nuclear magnetic resonance spectroscopy, near-edge X-ray absorption fine structure spectroscopy, and X-ray photoelectron spectroscopy of Brazilian soils suggested that PyOM remains stable in soils for centuries to millennia 128 . However, thermodynamic calculations using representative PyOM and unburned dissolved organic matter (DOM) compounds published in 2023 demonstrate substantial overlap between the predicted metabolic rates of PyOM and DOM microbial degradation 127 . Thus, PyOM might not be as markedly resistant to microbial degradation as previously presumed, especially when compared with unburned DOM. ...

Potential bioavailability of representative pyrogenic organic matter compounds in comparison to natural dissolved organic matter pools

Biogeosciences

... Although recent studies have shown distinct spatial patterns of DOM within and across streams (Riedel et al., 2016;Garayburu-Caruso et al., 2020;Stadler et al., 2023;Freeman et al., 2024), the intrinsic and/or extrinsic attributes driving such variations are not yet fully understood. Research has shown that the composition of DOM varies across different scales including in-stream compartments, positions in the river networks, and latitude zones (Jaffé et al., 2012;Roth et al., 2013;Hawkes et al., 2018). ...

Applying the core-satellite species concept: Characteristics of rare and common riverine dissolved organic matter

Frontiers in Water

... 16 When deterministic processes are spatiotemporally inconsistent or insufficient to overcome random effects like hydrologic mixing, stochasticity prevails in DOM molecular dynamics. 17,18 Considering the inherent stochastic nature of molecular dynamics, it is necessary to incorporate stochasticity into the mechanistic comprehension of the spatiotemporal variability of organic matter. 19,20 The interplay between deterministic and stochastic factors in shaping DOM molecular assemblages bears resemblance to ecological processes in the microbial community assembly. ...

Organic matter transformations are disconnected between surface water and the hyporheic zone

Biogeosciences

... Here, we use a ML approach to identify patterns and trends in the previously characterized molecular composition of continental-scale river and sediment DOM samples collected under the crowdsourced Frontiers in Water 03 frontiersin.org Worldwide Hydrobiogeochemical Observation Network for Dynamic River Systems (WHONDRS; see for example Barnard et al., 2022;Borton et al., 2022;Dwivedi et al., 2022;Goldman et al., 2022). The data set was created using Fourier Transform Ion Cyclotron Resonance Mass Spectrometry (FTICR-MS) which generates highly dimensional datasets that largely defy 2D or other linear approaches of data manipulation. ...

It Takes a Village: Using a Crowdsourced Approach to Investigate Organic Matter Composition in Global Rivers Through the Lens of Ecological Theory

Frontiers in Water

... Therefore, secondary chlorination is necessary in this type of distribution system. 21,22,25 On the contrary, the mixture of end water from different sources in a dual-source DWDS avoids secondary chlorination due to the continuous complementary in residual chlorine from two separate sources. The attenuation at the junction is also shaped by the water supply amount. ...

Disinfection byproducts formed during drinking water treatment reveal an export control point for dissolved organic matter in a subalpine headwater stream

Water Research X

... Fourier transform ion cyclotron resonance mass spectrometry (FTICR-MS) is another powerful tool capable of resolving thousands of individual molecular formulae in complex DOM mixtures. In this case, direct injection analyses require desalting and extraction of DOM from the original sample matrix (Kujawinski 2002), which can introduce potential biases in the types of DOM identified (Li et al. 2016(Li et al. , 2017Bahureksa et al. 2021;Nelson et al. 2022). ...

Implications of sample treatment on characterization of riverine dissolved organic matter

Environmental Science: Processes and Impacts

... In brief, the transformation analysis allows researchers to identify potential reactions by first looking at the mass difference between all peaks within a sample, and then comparing these differences to a list of common and known reactions. Recent research has demonstrated that results from this analysis are significantly related to ecosystem biogeochemistry (Graham et al., 2018;Buser-Young et al., 2023) and provide insight into both biotic and abiotic processes (Fudyma et al., 2021;Stegen et al., 2022). Given that these transformations are inherently tied to DOM composition changes, we suspect that the number of transformations within a sample should be related to the functional diversity of that sample (e.g., functional diversity calculated using H/C measures degradation state while transformations also partially estimate degradation may be significantly related). ...

Organic Matter Transformations are Disconnected Between Surface Water and the Hyporheic Zone

... To elucidate these complex assembly mechanisms, researchers typically utilize null models and metrics, such as beta nearest taxon index (βNTI). A null-model-based statistical and quantitative tools can distinguish the relative impacts of stochastic versus deterministic processes, providing a quantitative framework for community analysis [13,14]. The neutral community model (NCM) presents an alternative view, positing that stochastic events like birth, death, migration, and drift are pivotal in shaping microbial community structures. ...

Inferring the Contribution of Microbial Taxa and Organic Matter Molecular Formulas to Ecological Assembly