Marie V. Brasseur’s research while affiliated with University of Duisburg-Essen and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (13)


Location of sampling sites with indication of the proportions of G. pulex (grey) and G. fossarum (black) found (barcoded specimens) . Restoration status in 2020 (stream color) and the year of restoration are indicated. Three in‐stream barriers (1 tunnel, 2 weirs) are indicated together with their date of removal in the Boye catchment (brown “x” and stream section). Catchment areas are indicated with dashed lines and catchment names are written in capitals. Furthermore, the location of the sampling area (https://www.umwelt.nrw.de/system/files/media/images/2015‐07/Karte.jpg) in Germany (https://en.m.wikipedia.org/wiki/File:Locator_map_RVR_in_Germany.svg) is shown.
Population structure of G. pulex in the Emscher catchment (A) COI minimum spanning network colored according to catchments. Orthogonal dashes on network branches indicate mutations between haplotypes. (B) COI haplotype map showing the haplotype distribution. The sizes of haplotype pie charts are scaled according to the numbers of sequences per site and haplotypes of Gp‐C and E are indicated by differently colored frames. Below the names of the restored sites, the year of restoration is displayed. (C) Principal component analysis of the ddRAD‐seq data. Individuals are colored according to catchments and different symbols are used for different COI MOTUs. (D) Ancestry estimates from sNMF analysis for k= 7 are displayed on the map, with vertical bars representing individual ancestry coefficients. Where known, year of first occurrence is indicated below the site names.
Population structure of G. fossarum in the Emscher catchment: (A) COI minimum spanning network colored according to catchments. Orthogonal dashes on network branches lines indicate mutations between haplotypes. (B) COI haplotype map showing the haplotype composition. The sizes of haplotype pie charts are scaled according to the numbers of sequences per site. Private haplotypes are indicated by an asterisk. Below the names of the restored sites, the year of restoration is displayed. (C) Principal component analysis of the ddRAD‐seq data. Individuals are colored according to location and different symbols are used for different COI haplotype groups. (D) Ancestry estimates from sNMF analysis for k = 4 are displayed on the map, with vertical bars representing individual ancestry coefficients. Where known, year of first occurrence is indicated below the site names.
Population Genomics Reveals Small‐Scale Metapopulation Structure of Two Strictly Aquatic Keystone Species in a Recently Restored Urban River System (Emscher, Germany)
  • Article
  • Full-text available

April 2025

·

79 Reads

·

Marie V. Brasseur

·

·

Urbanization and the resulting modifications of freshwater ecosystems can play an important role in shaping metapopulation structure and dynamics of aquatic organisms. Ecological restoration aims at improving river ecosystems by reducing or removing anthropogenic stressors and habitat fragmentation, facilitating natural dispersal among population patches. However, the success of such ecological restoration measures is not guaranteed, and for many of the functionally important but smaller organisms, improved connectivity is difficult to assess. Here, genetic markers can help in assessing small‐scale connectivity and in identifying persisting gene flow barriers. In this study used high‐resolution genetic markers to study the metapopulation structure of two ecologically important amphipod species, Gammarus pulex and Gammarus fossarum, in the heavily urbanized Emscher catchment in Germany. This catchment was strongly degraded and polluted for over a century but has been restored over the past two decades. For both strictly aquatic species, we analyzed mitochondrial cytochrome c oxidase I (COI) gene sequences as well as nuclear genome‐wide single nucleotide polymorphism (SNP) data. We detected strong metapopulation structure within both species, which was mainly driven by catchment affiliation, wastewater, large in‐stream barriers, and recent recolonization of restored stream sections. However, population structure was not fully explained by these factors, indicating that eco‐evolutionary factors such as priority effects, adaptation, or biotic interactions play a role in shaping the population structure. Furthermore, our data show a strong mito‐nuclear discordance for both species with regard to detailed population structure and also the presence of possible cryptic species for G. pulex. Here, nuclear data indicate that the diverging mitochondrial lineages of G. pulex (Gp‐C and Gp‐E) represent only one species in this region. Our study shows how genetic markers can support the assessment of population connectivity and thus evaluate the success of ecological restoration.

Download

Chromosome-Level Genome Assembly of the Viviparous Eelpout Zoarces viviparus

July 2024

·

32 Reads

·

1 Citation

Genome Biology and Evolution

Nico Fuhrmann

·

Marie V Brasseur

·

Christina E Bakowski

·

[...]

·

The viviparous eelpout Zoarces viviparus is a common fish across the North-East Atlantic and has successfully colonized habitats across environmental gradients. Due to its wide distribution and predictable phenotypic responses to pollution, Z. viviparus is used as bioindicator organism and has been routinely sampled over decades by several countries to monitor marine environmental health. Additionally, this species is a promising model to study adaptive processes related to environmental change, specifically global warming. Here, we report the chromosome-level genome assembly of Z. viviparus, which has a size of 663 mega base pairs (mbp) and consists of 607 scaffolds (N50 = 26 mbp). The 24 largest represent the 24 chromosomes of the haploid Z. viviparus genome, which harbors 98% of the complete BUSCOs defined for ray-finned fish, indicating that the assembly is highly contiguous and complete. Comparative analyses between the Z. viviparus assembly and chromosome-level genomes of two other eelpout species revealed a high synteny, but also an accumulation of repetitive elements in the Z. viviparus genome. Our reference genome will be an important resource enabling future in-depth genomic analyses of the effects of environmental change in this important bioindicator species.


Fine sediment and the insecticide chlorantraniliprole inhibit organic‐matter decomposition in streams through different pathways

January 2024

·

84 Reads

·

1 Citation

Freshwater Biology

Intensive agriculture drives an ongoing deterioration of stream biodiversity and ecosystem functioning across the planet. Key agricultural stressors include increased deposited fine sediment and insecticides flushed from adjacent land into streams. The individual and combined effects on aquatic biota are increasingly studied, but the functional consequences of biodiversity loss associated with agricultural stressors remain poorly understood. We addressed this knowledge gap by examining the effects of fine sediment and different concentrations of the insecticide chlorantraniliprole on organic‐matter decomposition. We conducted an outdoor stream mesocosm experiment. Mesocosms contained a standardised organic‐matter assay (the cotton‐strip assay), which was used to assess organic‐matter decomposition rates (as tensile‐strength loss of the fabric) and microbial respiration of the cotton strips. The decomposition rate of strips buried under fine sediment was inhibited, a result we attribute to the limited accessibility for invertebrate feeding and microbial activities, as well as the limited nutrient and dissolved oxygen exchange. The insecticide also inhibited decay rates, a finding we relate to reduced invertebrate grazing and associated excessive algal growth. In contrast to decomposition rates, we did not observe stressor effects on microbial respiration. An interaction between fine sediment and chlorantraniliprole was not identified. Our results suggest that stressors induced by agriculture affect functions of stream ecosystems through a variety of pathways and operate by modifying habitats and biotic interactions. By examining a combination of stressors and responses that have not been addressed before, this study gives important insights into the effects of agricultural practices on streams. Understanding the effects of chlorantraniliprole is especially important since it is likely to become more widely used in future agricultural practice due to the increasing ban on neonicotinoid insecticides. Furthermore, most experimental studies address multiple stressor effects on biota. For a comprehensive understanding of complex stressor effects on ecosystems, ecosystem functions also need to be studied, such as the organic‐matter decomposition within streams.


of the study design. ExStream samples were processed using omics-based methods and amplicon sequencing, and HTS data were processed using two clustering methods (only applicable to amplicon sequencing), six taxonomic levels, two data types, with or without feature selection, and eight machine learning algorithms, for a total of 1,536 evaluated combinations of sequencing and data-processing methods. KNN, k-Nearest Neighbors; Lasso, Logistic Lasso Regression; Ridge, Logistic Ridge Regression; LSVC, Linear Support Vector Classification; MLP, Multilayer Perceptron; RF, Random Forest; SVC, Support Vector Classification; XGB, XGBoost.
Number of total, unique, and overlapping taxa for each taxonomic dataset on the phylum, genus, and species level (chord diagrams), as well as the distribution of taxonomic groups within each taxonomic dataset (pie charts). In the chord diagrams, the size of the outer bars represents the total number of detected taxa, the size of the connections between taxonomic datasets represents the number of overlapping taxa, and the fraction of outer bars with no connection to other taxonomic datasets represents the number of unique taxa detected only in that taxonomic dataset.
MCC as a proxy for SPP across all combinations of sequencing and data-processing methods tested. Since data types had no significant impact on SPP (see Figures 4, 5), only P–A-based SPPs are shown.
Correlation between MCC as a proxy for SPP and sequencing and data-processing methods.
Correlation between MCC as a proxy for SPP and data-processing methods for individual taxonomic datasets.
Predicting environmental stressor levels with machine learning: a comparison between amplicon sequencing, metagenomics, and total RNA sequencing based on taxonomically assigned data

November 2023

·

137 Reads

·

5 Citations

Introduction Microbes are increasingly (re)considered for environmental assessments because they are powerful indicators for the health of ecosystems. The complexity of microbial communities necessitates powerful novel tools to derive conclusions for environmental decision-makers, and machine learning is a promising option in that context. While amplicon sequencing is typically applied to assess microbial communities, metagenomics and total RNA sequencing (herein summarized as omics-based methods) can provide a more holistic picture of microbial biodiversity at sufficient sequencing depths. Despite this advantage, amplicon sequencing and omics-based methods have not yet been compared for taxonomy-based environmental assessments with machine learning. Methods In this study, we applied 16S and ITS-2 sequencing, metagenomics, and total RNA sequencing to samples from a stream mesocosm experiment that investigated the impacts of two aquatic stressors, insecticide and increased fine sediment deposition, on stream biodiversity. We processed the data using similarity clustering and denoising (only applicable to amplicon sequencing) as well as multiple taxonomic levels, data types, feature selection, and machine learning algorithms and evaluated the stressor prediction performance of each generated model for a total of 1,536 evaluated combinations of taxonomic datasets and data-processing methods. Results Sequencing and data-processing methods had a substantial impact on stressor prediction. While omics-based methods detected a higher diversity of taxa than amplicon sequencing, 16S sequencing outperformed all other sequencing methods in terms of stressor prediction based on the Matthews Correlation Coefficient. However, even the highest observed performance for 16S sequencing was still only moderate. Omics-based methods performed poorly overall, but this was likely due to insufficient sequencing depth. Data types had no impact on performance while feature selection significantly improved performance for omics-based methods but not for amplicon sequencing. Discussion We conclude that amplicon sequencing might be a better candidate for machine-learning-based environmental stressor prediction than omics-based methods, but the latter require further research at higher sequencing depths to confirm this conclusion. More sampling could improve stressor prediction performance, and while this was not possible in the context of our study, thousands of sampling sites are monitored for routine environmental assessments, providing an ideal framework to further refine the approach for possible implementation in environmental diagnostics.


Overview of the field experiment. A Stream water was pumped into header tanks and supplied to 64 mesocosms. The macroinvertebrate communities living in the mesocosms were exposed to the stressors in a 4 × 2 factorial design (4 pesticide levels, 2 fine sediment levels) with 8 replicates per treatment. B Time course of the experiment. A 21-day colonization period was followed by a 21-day stressor period. For this study, specimens were sampled after the insecticide pulse (day 4, red square). C Stream water entered the mesocosms via the inflow, flowed in a clockwise orientation, and left the system through the central circular opening. The sieve in the outflow allowed sampling of drifting organisms to estimate stressor effects for the macroinvertebrate community
Biplot of the first two principal component axes for AL. basale and BG. pulex
Heatmap of differentially expressed genes in L. basale (A) and G. pulex (B). Gray bars represent genes with no significant differences in expression between the treatment and control conditions (adjusted p-value ≥ 0.05). Red and blue numbers represent up- and downregulated genes, respectively. Fold changes are reported at the binary logarithmic scale (Log2FC)
Clustering of insecticide-responsive genes according to their expression profile along an increasing insecticide concentration gradient. A–F Differentially expressed genes in L. basale under high insecticide concentration. G–J Differentially expressed genes in L. basale under high insecticide concentration combined with increased fine sediment. Z-scores represent the scaled and centered gene expression value. K Cluster identities of differentially expressed genes in the single (purple arc, right) and the two-stressor treatment (red arc, left). Black lines indicate groups of genes for which the average expression pattern is similar between the two stressor scenarios. The purple and red arcs only contain the genes that are differentially expressed in both treatments
Multiple stressor effects of insecticide exposure and increased fine sediment deposition on the gene expression profiles of two freshwater invertebrate species

September 2023

·

224 Reads

·

1 Citation

Environmental Sciences Europe

Background Freshwater ecosystem degradation and biodiversity decline are strongly associated with intensive agricultural practices. Simultaneously occurring agricultural stressors can interact in complex ways, preventing an accurate prediction of their combined effects on aquatic biota. Here, we address the limited mechanistic understanding of multiple stressor effects of two globally important stressors, an insecticide (chlorantraniliprole), and increased fine sediment load and assessed their impact on the transcriptomic profile of two stream macroinvertebrates: the amphipod Gammarus pulex and the caddisfly Lepidostoma basale. Results We identified mainly antagonistic stressor interactions at the transcriptional level, presumably because the insecticide adsorbed to fine sediment particles. L. basale, which is phylogenetically more closely related to the insecticide’s target taxon Lepidoptera, exhibited strong transcriptional changes when the insecticide stressor was applied, whereas no clear response patterns were observed in the amphipod G. pulex. These differences in species vulnerability can presumably be attributed to molecular mechanisms determining the cellular affinity toward a stressor as well as differential exposure patterns resulting from varying ecological requirements between L. basale and G. pulex. Interestingly, the transcriptional response induced by insecticide exposure in L. basale was not associated with a disruption of the calcium homeostasis, which is the described mode of action for chlorantraniliprole. Instead, immune responses and alterations of the developmental program appear to play a more significant role. Conclusions Our study shows how transcriptomic data can be used to identify multiple stressor effects and to explore the molecular mechanisms underlying stressor-induced physiological responses. As such, stressor effects assessed at the molecular level can inform about modes of action of chemicals and their interplay with non-chemical stressors. We demonstrated that stressor effects vary between different organismic groups and that insecticide effects are not necessarily covered by their described mode of action, which has important implications for environmental risk assessment of insecticides in non-target organisms.


Transcriptomic sequencing data illuminate insecticide-induced physiological stress mechanisms in aquatic non-target invertebrates

August 2023

·

44 Reads

·

8 Citations

Environmental Pollution

Pesticides are major agricultural stressors for freshwater species. Exposure to pesticides can disrupt the biotic integrity of freshwater ecosystems and impair associated ecosystem functions. Unfortunately, physiological mechanisms through which pesticides affect aquatic organisms are largely unknown. For example, the widely-used insecticide chlorantraniliprole is supposed to be highly selective for target pest species, i.e. Lepidoptera (butterflies), but its effect in aquatic non-target taxa is poorly studied. Using RNA-sequencing data, we quantified the insecticide effect on three aquatic invertebrate species: the caddisfly Lepidostoma basale, the mayfly Ephemera danica and the amphipod Gammarus pulex. Further, we tested how the insecticide-induced transcriptional response is modulated by biotic interaction between the two leaf-shredding species L. basale and G. pulex. While G. pulex was only weakly affected by chlorantraniliprole exposure, we detected strong transcriptional responses in L. basale and E. danica, implying that the stressor receptors are conserved between the target taxon Lepidoptera and other insect groups. We found in both insect species evidence for alterations of the developmental program. If transcriptional changes in the developmental program induce alterations in emergence phenology, pronounced effects on food web dynamics in a cross-ecosystem context are expected.


Exploring macroinvertebrate biodiversity in the dynamic southern Balkan stream network of the Vjosa using preservative-based DNA metabarcoding

April 2023

·

250 Reads

·

5 Citations

Aquatic Sciences

Reliable biodiversity data are crucial for environmental research and management. Unfortunately, data paucity prevails for many regions and organismal groups such as aquatic invertebrates. High-throughput DNA-based identification, in particular DNA metabarcoding, has accelerated biodiversity data generation. However, in the process of metabarcoding, specimens are usually destroyed, precluding later specimen-based analyses. Metabarcoding of DNA released into the preservative ethanol has been proposed as a non-destructive alternative, but proof-of-principle studies have yielded ambiguous results, reporting variance in detection probability for various taxa and methodological biases. In this study, we tested the performance of preservative-based metabarcoding of aquatic invertebrates in comparison to a standard morpho-taxonomic assessment based on samples from one of Europe’s last free-flowing rivers, the Vjosa. Multi-habitat samples were collected at 43 sites in two seasons and stored in ethanol, after fixation in formaldehyde for morpho-taxonomic analyses. Preservative-based DNA metabarcoding detected three times more taxa but failed to detect other taxa found using the standard method. In addition to incomplete reference data and primer bias that likely precluded the detection of specific taxa, preservative-based DNA metabarcoding cannot provide accurate abundance estimates. However, the metabarcoding data revealed distinct small-scale and large-scale community patterns in the Vjosa river network, which were also recovered by quantitative data of the standard approach. Overall, our results indicate that preservative-based metabarcoding provides important biodiversity data, which could be further improved by quantitative validation. The method is robust and reliable, even though samples were taken under harsh field-conditions and stored without cooling. Further, our results emphasise the need for reliable DNA barcoding reference libraries. Building those may be supported by preservative-based metabarcoding that maintains intact vouchers for subsequent specimen-based analyses.


Illustration of MitoGeneExtractor algorithm. Exonerate is called to translate and align the DNA sequence reads to the amino acid reference (a). Then, MitoGeneExtractor creates the MSA of reads (b) and infers a gene consensus sequence (c). The degeneracy of the genetic code allows a considerable DNA sequence variation between the reads and the reference (d).
Runtime differences between MitoGeneExtractor (MGE) and MitoFinder. Total runtime needed to reconstruct only COI, or all PCGs with MitoGeneExtractor compared to the runtime for the MitoFinder assembly and annotation, depending on various dataset sizes.
COI (left) and ND5 (right) reconstruction success with MitoGeneExtractor (blue) and MitoFinder (red). Density plots indicate the probability density curve. Dots show the number of nucleotides in individual sequences. Diamonds indicate the median nucleotide recovery obtained with MitoGeneExtractor (COI = 1545, ND5 = 1755) and MitoFinder (COI = 0, ND5 = 0). The avian COI and ND5 gene typically comprise 1551 and 1818 nucleotides respectively.
MitoGeneExtractor: Efficient extraction of mitochondrial genes from next‐generation sequencing libraries

February 2023

·

120 Reads

·

6 Citations

Mitochondrial DNA (mtDNA) sequences are often found as byproducts in next‐generation sequencing (NGS) datasets that were originally created to capture genomic or transcriptomic information of an organism. These mtDNA sequences are often discarded, wasting this valuable sequencing information. We developed MitoGeneExtractor, an innovative tool which allows to extract mitochondrial protein coding genes (PCGs) of interest from NGS libraries through multiple sequence alignments of sequencing reads to amino acid references. General references, for example on order level are sufficient for mining mitochondrial PCGs. In a case study, we applied MitoGeneExtractor to recently published genomic datasets of 1993 birds and were able to extract complete or nearly complete sequences for all 13 mitochondrial PCGs for a large proportion of libraries. Compared to an existing assembly guided sequence reconstruction algorithm, MitoGeneExtractor was faster and substantially more sensitive. We compared COI sequences mined with MitoGeneExtractor to COI databases. Mined sequences show a high sequence similarity and correct taxonomic assignment between the recovered sequence and the assigned morphospecies in most samples. In some cases of incongruent taxonomic assignments, we found evidence for contamination in NGS libraries. MitoGeneExtractor allows a fast extraction of mitochondrial PCGs from a wide range of NGS datasets. We recommend to routinely harvest and curate mitochondrial sequence information from genomic resources. MitoGeneExtractor output can be used to identify contaminated NGS libraries and to validate the species identity of the sequenced animal based on the extracted COI sequences.


Heatmap showing the differential expression of genes in response to single and combined stressor exposure. Grey bars indicate changes in expression, which are not significant (adjusted p-values ≥ 0.1). Numbers in red and blue denote upregulated and downregulated genes, respectively
UpSet plots, summarising unique and shared differentially expressed genes due to exposure to different treatments. Black dots in panel matrices correspond to different Venn diagram sections: connected dots indicate intersections of sets of differentially expressed genes; singular dots represent unique sets of differentially expressed genes. Vertical barplots summarise the number of upregulated (a) and downregulated (b) genes. Horizontal bars indicate the total number of regulated genes in each treatment combination. The set union of all upregulated and downregulated genes detected in salinity treatments represents 443 salinity genes, of which one gene was upregulated when increased salinity was combined with reduced flow but downregulated when all three stressors were applied (= 442 exclusively regulated genes)
Clustering of genes, which showed an interaction between two stressors. The expression significantly differed from expectations based on the single stressor effects due to the interaction between salinity and flow (a-c), sediment and salinity (d) or sediment and flow (e). The z-score represents the scaled and centred gene expression. Positive and negative z-scores indicate expression values above and below the average across samples, respectively. Pale lines connect expression values of individual genes, bold lines the average expression values
Number of genes, which have an additional LFC due to the interaction between stressors
Impacts of multiple anthropogenic stressors on the transcriptional response of Gammarus fossarum in a mesocosm field experiment

December 2022

·

189 Reads

·

6 Citations

BMC Genomics

Background Freshwaters are exposed to multiple anthropogenic stressors, leading to habitat degradation and biodiversity decline. In particular, agricultural stressors are known to result in decreased abundances and community shifts towards more tolerant taxa. However, the combined effects of stressors are difficult to predict as they can interact in complex ways, leading to enhanced (synergistic) or decreased (antagonistic) response patterns. Furthermore, stress responses may remain undetected if only the abundance changes in ecological experiments are considered, as organisms may have physiological protective pathways to counteract stressor effects. Therefore, we here used transcriptome-wide sequencing data to quantify single and combined effects of elevated fine sediment deposition, increased salinity and reduced flow velocity on the gene expression of the amphipod Gammarus fossarum in a mesocosm field experiment. Results Stressor exposure resulted in a strong transcriptional suppression of genes involved in metabolic and energy consuming cellular processes, indicating that G . fossarum responds to stressor exposure by directing energy to vitally essential processes. Treatments involving increased salinity induced by far the strongest transcriptional response, contrasting the observed abundance patterns where no effect was detected. Specifically, increased salinity induced the expression of detoxification enzymes and ion transporter genes, which control the membrane permeability of sodium, potassium or chloride. Stressor interactions at the physiological level were mainly antagonistic, such as the combined effect of increased fine sediment and reduced flow velocity. The compensation of the fine sediment induced effect by reduced flow velocity is in line with observations based on specimen abundance data. Conclusions Our findings show that gene expression data provide new mechanistic insights in responses of freshwater organisms to multiple anthropogenic stressors. The assessment of stressor effects at the transcriptomic level and its integration with stressor effects at the level of specimen abundances significantly contribute to our understanding of multiple stressor effects in freshwater ecosystems.


Figure 3: MCC, i.e., SPP across all combinations of sequencing and data-processing methods tested. Since data types had no significant impact on SPP (see Figures 4 and 5), only P-A-based SPPs are shown. SPPs varied substantially between combinations, indicated by the wide range and uneven distribution of MCC values.
Figure 4: Correlation between MCC, i.e., SPP and sequencing and data-processing methods.
Figure 5: Correlation between MCC, i.e., SPP and data-processing methods for individual taxonomic datasets.
Predicting environmental stressor levels with machine learning: a comparison between amplicon sequencing, metagenomics, and total RNA sequencing based on taxonomically assigned data

November 2022

·

226 Reads

·

1 Citation

Background: Microbes are increasingly (re)considered for environmental assessments because they are powerful indicators for the health of ecosystems. The complexity of microbial communities necessitates powerful novel tools to derive conclusions for environmental decision-makers, and machine learning is a promising option in that context. While amplicon sequencing is typically applied to assess microbial communities, metagenomics and total RNA sequencing (herein summarized as omics-based methods) can provide a more holistic picture of microbial biodiversity at sufficient sequencing depths. Despite this advantage, amplicon sequencing and omics-based methods have not yet been compared for taxonomy-based environmental assessments with machine learning. In this study, we applied 16S and ITS-2 sequencing, metagenomics, and total RNA sequencing to samples from a stream mesocosm experiment that investigated the impacts of two aquatic stressors, insecticide and increased fine sediment deposition, on stream biodiversity. We processed the data using similarity clustering and denoising (only applicable to amplicon sequencing) as well as multiple taxonomic levels, data types, feature selection, and machine learning algorithms and evaluated the stressor prediction performance of each generated model for a total of 1,536 evaluated combinations of taxonomic datasets and data-processing methods. Results: Sequencing and data-processing methods had a substantial impact on stressor prediction. While omics-based methods detected much more taxa than amplicon sequencing, 16S sequencing outperformed all other sequencing methods in terms of stressor prediction based on the Matthews Correlation Coefficient. However, even the highest observed performance for 16S sequencing was still only moderate. Omics-based methods performed poorly overall, but this was likely due to insufficient sequencing depth. Data types had no impact on performance while feature selection significantly improved performance for omics-based methods but not for amplicon sequencing. Conclusion: Amplicon sequencing might be a better candidate for machine-learning-based environmental stressor prediction than omics-based methods, but the latter require further research at higher sequencing depths to confirm this conclusion. More sampling could improve stressor prediction performance, and while this was not possible in the context of our study, thousands of sampling sites are monitored for routine environmental assessments, providing an ideal framework to further refine the approach for possible implementation in environmental diagnostics.


Citations (9)


... These traits appeared far after the origin of AFP III and between periods of extensive glaciation in the Eocene-Oligocene and Pliocene-Pleistocene [106][107][108][109][110][111] that shaped the evolutionary history of other cold-adapted [13,18,19] and deep-sea [42,79] fishes, but largely precede the invasion of extreme habitats in many lineages (figures 1, 3 and 4A). Innovations include facultative air-breathing, which we infer convergently evolved in gunnels (Pholidae), monkeyface prickleback Cebidichthys violaceus [36,37,84], viviparous eelpout Zoarces viviparus [81], and twice in Stichaeidae [37,112], ovoviviparity, which evolved once in a subclade of eelpouts (Zoarces) [82,83,113,114], the extensive association of juveniles with scyphozoans by the prowfish Zaprora silenus [85], the multiple origins of fangs across the Zoarcoidea (figure 3) and the evolution of molariform dentition in wolffishes (Anarhichadidae) [89,90]. These results exemplify the diverse ecologies and life history strategies of zoarcoids and show that, apart from AFP III, many innovations evolved in a pulse (figure 3) prior to a high frequency of invasions into deep sea and polar habitats (figures 1, 3 and 4). ...

Reference:

The many origins of extremophile fishes
Chromosome-Level Genome Assembly of the Viviparous Eelpout Zoarces viviparus
  • Citing Article
  • July 2024

Genome Biology and Evolution

... In this study, we compare the performance of amplicon sequencing, metagenomics, and total RNA-Seq to predict environmental stressor levels based on taxonomically assigned data using machine learning. We used samples obtained from an ExStream system [48] consisting of stream mesocosms that were exposed to fine sediment and an insecticide to investigate the impact of these aquatic key stressors on stream biodiversity and decomposition of organic [49] [note to reviewers: this citation is in submission and will be updated once it is published]. For amplicon sequencing, we used the two marker genes ITS-2 and 16S both with an operational taxonomic unit (OTU) clustering and an exact sequence variant (ESV) denoising method. ...

Fine sediment and the insecticide chlorantraniliprole inhibit organic‐matter decomposition in streams through different pathways
  • Citing Article
  • January 2024

Freshwater Biology

... These results support that RFE adapts well to different microbial strategies, offering flexibility depending on the spatial structure and ecological behavior of each taxonomic group. Similar conclusions were reached by Hempel et al. (2023), who demonstrated that machine learning models combined with RFE could successfully predict environmental stressor levels using taxonomically assigned microbial data across sequencing approaches. The effectiveness of RFE, when paired with high-resolution environmental covariates, reinforces its value for microbial spatial modeling in heterogeneous agroecosystems, especially those shaped by topographic and grazing gradients (King et al., 2010;Wilhelm et al., 2022). ...

Predicting environmental stressor levels with machine learning: a comparison between amplicon sequencing, metagenomics, and total RNA sequencing based on taxonomically assigned data

... The full-factorial ExStream system which has been applied for organisms of multiple trophic 111 levels in streams worldwide is an example of targeted stressor testing by manipulating abiotic 112 factors [42,43,[51][52][53][54][55]. In studies based on this mesocosm system and focusing on the 113 preprint (which was not certified by peer review) is the author/funder. ...

Multiple stressor effects of insecticide exposure and increased fine sediment deposition on the gene expression profiles of two freshwater invertebrate species

Environmental Sciences Europe

... The tested aquatic community consists of species having important key roles in a freshwater system and each of the species is regularly considered for ecotoxicological studies (Brasseur et al., 2023;Brettschneider et al., 2019;Cold and Forbes, 2004;Jourdan et al., 2019;Nørum et al., 2011;OECD, 2007;OECD, 2016;Rasmussen et al., 2013a;Rosa et al., 2016;Ruppert et al., 2017;Sardo and Soares, 2010;Soose et al., 2023). For this reason, adverse effects on the tested representatives of an aquatic community are strong indicators of a negative impact on the overall trophic structure of the aquatic biocoenosis and thus on the entire aquatic ecosystem. ...

Transcriptomic sequencing data illuminate insecticide-induced physiological stress mechanisms in aquatic non-target invertebrates
  • Citing Article
  • August 2023

Environmental Pollution

... [53] and processed using phrynomics [54]. COI sequences were extracted with MitoGeneExtractor [55] using the generated Sanger reference and aligned with MAFFT [44]. Poorly aligned bases were trimmed (alignment = 612 bp). ...

MitoGeneExtractor: Efficient extraction of mitochondrial genes from next‐generation sequencing libraries

... It is likely that terrestrial invertebrate communities are overall more difficult to detect with preservative-based DNA metabarcoding than soft-body dominated communities, e.g. benthic invertebrate communities (e.g., Brasseur et al., 2023). However, we also acknowledge that species pools were different for each trap content and species numbers per sample varied substantially. ...

Exploring macroinvertebrate biodiversity in the dynamic southern Balkan stream network of the Vjosa using preservative-based DNA metabarcoding

Aquatic Sciences

... For example, at higher temperatures, increased metabolic activity exacerbates the impacts of salinity on zooplankton of the genus Cladocera (Mart ınez-Meg ıas and Rico, 2022). Conversely, in the freshwater amphipod Gammarus fossarum, pressures interact antagonistically at the physiological level, as seen where the effect induced by fine sediment is compensated for by reduced flow velocity (Brasseur et al., 2022). ...

Impacts of multiple anthropogenic stressors on the transcriptional response of Gammarus fossarum in a mesocosm field experiment

BMC Genomics

... Additionally, as an external outgroup, we included the mitochondrial genome sequences of five Syrphidae species belonging to the subfamily Eristalinae, which is phylogenetically close to the subfamily Syrphinae [5,35] (Supplementary Table S1). ...

Systematics and evolution of predatory flower flies (Diptera: Syrphidae) based on exon‐capture sequencing
  • Citing Article
  • October 2022

Systematic Entomology