About
182
Publications
48,354
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
16,675
Citations
Citations since 2017
Introduction
Björn Grüning currently works at the Inst. of Computer Science, University of Freiburg and is leading the Freiburg Galaxy team.
Additional affiliations
August 2013 - present
November 2009 - July 2013
Pharmaceutical Sciences
Position
- University of Freiburg
November 2009 - July 2013
Publications
Publications (182)
Improvements in genome sequencing and assembly are enabling high-quality reference genomes for all species. However, the assembly process is still laborious, computationally and technically demanding, lacks standards for reproducibility, and is not readily scalable. Here we present the latest Vertebrate Genomes Project assembly pipeline and demonst...
The Coronavirus disease 2019 (COVID-19) pandemic caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) resulted in a major health crisis worldwide with its continuously emerging new strains, resulting in new viral variants that drive “waves” of infection. PCR or antigen detection assays have been routinely used to detect clinic...
Highly multiplexed tissue imaging (MTI) is accomplished by applying powerful antibody-based spatial proteomics technologies that characterize tissues in situ at single-cell and potentially subcellular resolution and enable the creation of two-dimensional tissue maps. Example MTI methods include cyclic immunofluorescence (CyCIF), multiplex immunohis...
Galaxy is a mature, browser accessible workbench for scientific computing. It enables scientists to share, analyze and visualize their own data, with minimal technical impediments. A thriving global community continues to use, maintain and contribute to the project, with support from multiple national infrastructure providers that enable freely acc...
There are thousands of well-maintained high-quality open-source software utilities for all aspects of scientific data analysis. For over a decade, the Galaxy Project has been providing computational infrastructure and a unified user interface for these tools to make them accessible to a wide range of researchers. In order to streamline the process...
Background
Sepsis is associated with high platelet turnover and elevated levels of immature platelets. Changes in the platelet transcriptome and the specific impact of immature platelets on the platelet transcriptome remain unclear. Thus, this study sought to address whether and how elevated levels of immature platelets affect the platelet transcri...
The amount of public proteomics data is increasing at an extraordinary rate. Hundreds of datasets are submitted each month to ProteomeXchange repositories, representing many types of proteomics studies, focusing on different aspects such as quantitative experiments, post-translational modifications, protein-protein interactions, or subcellular loca...
The COVID-19 pandemic is shifting teaching to an online setting all over the world. The Galaxy framework facilitates the online learning process and makes it accessible by providing a library of high-quality community-curated training materials, enabling easy access to data and tools, and facilitates sharing achievements and progress between studen...
The COVID-19 pandemic is the first global health crisis to occur in the age of big genomic data. Although data generation capacity is well established and sufficiently standardized, analytical capacity is not. To establish analytical capacity it is necessary to pull together global computational resources and deliver the best open source tools and...
Hypomethylating agents (HMA) have become the backbone of nonintensive acute myeloid leukemia/myelodysplastic syndrome (AML/MDS) treatment, also by virtue of their activity in patients with adverse genetics, for example, monosomal karyotypes, often with losses on chromosome 7, 5, or 17. No comparable activity is observed with cytarabine, a cytidine...
This paper is a tutorial developed for the data analysis platform Galaxy. The purpose of Galaxy is to make high-throughput computational data analysis, such as molecular dynamics, a structured, reproducible and transparent process. In this tutorial we focus on 3 questions: How are protein-ligand systems parameterized for molecular dynamics simulati...
Update of BioContainers and BioConda projects.
Motivation:
The correct prediction of bacterial sRNA homologs is a prerequisite for many downstream analyses based on comparative genomics, but it is frequently challenging due to the short length and distinct heterogeneity of such homologs. GLASSGo is an efficient tool for the prediction of sRNA homologs from a single input query. To make the alg...
Abstract Here, we introduce the ChemicalToolbox, a publicly available web server for performing cheminformatics analysis. The ChemicalToolbox provides an intuitive, graphical interface for common tools for downloading, filtering, visualizing and simulating small molecules and proteins. The ChemicalToolbox is based on Galaxy, an open-source web-base...
Background
Reticulated platelets (RP) are the youngest circulating platelets in blood. An increased amount of this subpopulation is associated with higher cardiovascular risk and mortality.
Objectives
It is unknown to what extent intrinsic properties of RP contribute to their hyperreactive features. This study is the first providing a multifactori...
This paper is a tutorial developed for the data analysis platform Galaxy. The purpose of Galaxy is to make high-throughput computational data analysis, such as molecular dynamics, a structured, reproducible and transparent process. In this tutorial we focus on 3 questions: How are protein-ligand systems parameterized for molecular dynamics simulati...
The Omics Discovery Index is an open source platform that can be used to access, discover and disseminate omics datasets. OmicsDI integrates proteomics, genomics, metabolomics, models and transcriptomics datasets. Using an efficient indexing system, OmicsDI integrates different biological entities including genes, transcripts, proteins, metabolites...
Background
Infinium Human Methylation BeadChip is an array platform for complex evaluation of DNA methylation at an individual CpG locus in the human genome based on Illumina’s bead technology and is one of the most common techniques used in epigenome-wide association studies. Finding associations between epigenetic variation and phenotype is a sig...
The Galaxy HiCExplorer provides a web service at https://hicexplorer.usegalaxy.eu. It enables the integrative analysis of chromosome conformation by providing tools and computational resources to pre-process, analyse and visualize Hi-C, Capture Hi-C (cHi-C) and single-cell Hi-C (scHi-C) data. Since the last publication, Galaxy HiCExplorer has been...
Bioinformatics software development has become a cornerstone in modern biology research. Large-scale quantitative biology studies have created a demand for more complex workflows and data analysis pipelines. Challenges in reproducing bioinformatics analyses are compounded by the fact that the programs themselves are difficult to install on computer...
Chromatin loops are an important factor in the structural organization of the genome. The detection of chromatin loops in Hi-C interaction matrices is a challenging and compute intensive task. The presented approach shows a chromatin loop detection algorithm which applies a strict candidate selection based on continuous negative binomial distributi...
Single-cell Hi-C interaction matrices are high dimensional and very sparse. To cluster thousands of single-cell Hi-C interaction matrices they are flattened and compiled into one matrix. This matrix can, depending on the resolution, have a few millions or even billions of features and any computation with it is therefore memory demanding. A common...
PouV and SoxB1 family transcription factors (TFs) have emerged as master regulators of cell fate transitions. To investigate the genetic interactions between Pou5f3 and Sox19b in zebrafish embryos passing through Zygotic Genome Activation (ZGA), we combined time-resolved mutant transcription analysis using the novel tool RNA-sense, chromatin state...
To gain a thorough appreciation of microbiome dynamics, researchers characterize the functional role of expressed microbial genes/proteins. This can be accomplished through metaproteomics, which characterizes the protein complement of the microbiome. Several software tools exist for analyzing microbiomes at the functional level by measuring their c...
Abstract Bioinformaticians and biologists rely increasingly upon workflows for the flexible utilization of the many life science tools that are needed to optimally convert data into knowledge. We outline a pan-European enterprise to provide a catalogue (https://bio.tools) of tools and databases that can be used in these workflows. bio.tools not onl...
Background
RNA plays essential roles in all known forms of life. Clustering RNA sequences with common sequence and structure is an essential step towards studying RNA function. With the advent of high-throughput sequencing techniques, experimental and genomic data are expanding to complement the predictive methods. However, the existing methods do...
Background
Mass spectrometry imaging is increasingly used in biological and translational research because it has the ability to determine the spatial distribution of hundreds of analytes in a sample. Being at the interface of proteomics/metabolomics and imaging, the acquired datasets are large and complex and often analyzed with proprietary softwa...
Galaxy is a web-based and open-source scientific data-processing platform. Researchers compose pipelines in Galaxy to analyse scientific data. These pipelines, also known as workflows, can be complex and difficult to create from thousands of tools, especially for researchers new to Galaxy. To make creating workflows easier, faster and less error-pr...
The German Network for Bioinformatics Infrastructure (de.NBI) is a national and academic infrastructure funded by the German Federal Ministry of Education and Research (BMBF). The de.NBI provides (i) service, (ii) training, and (iii) cloud computing to users in life sciences research and biomedicine in Germany and Europe and (iv) fosters the cooper...
Epigenetic variations alter gene expression among various diseases and histone modifications play an important role in controlling gene expression. Predicting gene expression from histone modification signals can be useful for designing epigenetic drugs to combat diseases. Multiple computational models have been proposed to use histone modification...
Ribosome profiling (ribo-seq) provides a means to analyze active translation by determining ribosome occupancy in a transcriptome-wide manner. The vast majority of ribosome protected fragments (RPFs) resides within the protein-coding sequence of mRNAs. However, commonly reads are also found within the transcript leader sequence (TLS) (aka 5' untran...
New methods in drug discovery, such as combinatorial chemistry and high-throughput screening, have resulted in the creation of huge amounts of new chemical data. The need to analyse this data effectively has spawned the field of cheminformatics, which applies computational methods to chemical problems. Numerous toolkits and applications have been d...
BioContainers (biocontainers.pro) is an open-source and community-driven framework which provides platform independent executable environments for bioinformatics software. BioContainers allows labs of all sizes to easily install bioinformatics software, maintain multiple versions of the same software and combine tools into powerful analysis pipelin...
RNA has become one of the major research topics in molecular biology. As a central player in key processes regulating gene expression, RNA is in the focus of many efforts to decipher the pathways that govern the transition of genetic information to a fully functional cell. As more and more researchers join this endeavour, there is a rapidly growing...
Background: Mass spectrometry imaging is increasingly used in biological and translational research as it has the ability to determine the spatial distribution of hundreds of analytes in a sample. Being at the interface of proteomics/metabolomics and imaging, the acquired data sets are large and complex and often analyzed with proprietary software...
The increasing complexity of data and analysis methods has created an environment where scientists, who may not have formal training, are finding themselves playing the impromptu role of software engineer. While several resources are available for introducing scientists to the basics of programming, researchers have been left with little guidance o...
Background
Human retinal microvascular endothelial cells (HRMVECs) are involved in the pathogenesis of retinopathy of prematurity. In this study, the microRNA (miRNA) expression profiles of HRMVECs were investigated under resting conditions, angiogenic stimulation (VEGF treatment) and anti-VEGF treatment.
Materials and methods
The miRNA profiles o...
Chorismate constitutes a branch‐point intermediates in the biosynthesis of many aromatic metabolites in microorganisms and plants. To obtain unnatural compounds, we modified the route to menaquinone in E. coli. We propose a model for the binding of isochorismate to the active site of MenD (SEPHCHC synthase) that explains the outcome of the native r...
RNA plays essential regulatory roles in all known forms of life. Clustering RNA sequences with common sequence and structure is an essential step towards studying RNA function. With the advent of high-throughput sequencing techniques, experimental and genomic data are expanding to complement the predictive methods. However, the existing methods do...
Background
Epigenome-wide association studies (EWAS) analyse genome-wide activity of epigenetic marks in cohorts of different individuals to find associations between epigenetic variation and phenotype. One of the most common technique used in EWAS studies is the Infinium Methylation Assay, which quantifies the DNA methylation level of over 450k lo...
Motivation
The pathway from genomics through proteomics and onto a molecular description of biochemical processes make the discovery of drugs and biomaterials possible. A research framework common to genomics and proteomics is needed to conduct biomolecular simulations that will connect biological data to the dynamic molecular mechanisms of enzymes...
The zebrafish embryo is transcriptionally mostly quiescent during the first 10 cell cycles, until the main wave of zygotic genome activation (ZGA) occurs, accompanied by fast chromatin remodeling. At ZGA, homologs of the mammalian stem cell transcription factors (TFs) Pou5f3, Nanog, and Sox19b bind to thousands of developmental enhancers to initiat...
Making reproducible, auditable and scalable data-processing analysis workflows is an important challenge in the field of bioinformatics. Recently, software containers and cloud computing introduced a novel solution to address these challenges. They simplify software installation, management and reproducibility by packaging tools and their dependenc...
moFF is a modular and operating system independent tool for quantitative analysis of label-free mass-spectrometry based proteomics data. The moFF workflow, comprising matching-between-runs and apex quantification, can be applied to any upstream search engine’s output, along with the corresponding Thermo or mzML raw file. We here present moFF 2.0, w...
Introduction: All-trans retinoic acid (ATRA, RA) has powerful activity in acute promyelocytic leukemia (APL); its efficacy in non-APL acute myeloid leukemia (AML) is still unclear, but may be enhanced by epigenetic drugs such as azanucleoside DNMT inhibitors (Blagitko-Dorfs et al. PLoS ONE 2013). In a randomized phase II study (DECIDER trial, NCT00...
Background:
Diabetes mellitus (DM) has been associated with increased platelet reactivity as well as increased levels of platelet RNAs in plasma. Here, we sought to evaluate whether the platelet transcriptome is altered in the presence of uncontrolled DM.
Methods:
Next-generation sequencing (NGS) was performed on platelet RNA for 5 patients with...
Motivation:
This paper presents Parkour, a software package for sample processing and quality management of next generation sequencing data and samples.
Results:
Starting with user requests, Parkour allows tracking and assessing samples based on predefined quality criteria through different stages of the sample preparation workflow. Ideally suit...
Ribosome profiling (ribo-seq) provides a means to analyze active translation by determining ribosome occupancy in a transcriptome-wide manner. The vast majority of ribosome protected fragments resides within the protein-coding sequence of mRNAs. However, commonly reads are also found within the transcript leader sequence (TLS) (aka 5' untranslated...
Protein/peptide-level quantification (either labeled or label-free) is routinely used in shotgun proteomics data analysis for determining the abundance of proteins in a given sample. However, accurate, rapid and robust label-free quantification is still a major challenge in the field of quantitative proteomics. Label-free quantification (LFQ) based...
Many areas of research suffer from poor reproducibility, particularly in computationally intensive domains where results rely on a series of complex methodological decisions that are not well captured by traditional publication approaches. Various guidelines have emerged for achieving reproducibility, but implementation of these practices remains d...
The primary problem with the explosion of biomedical datasets is not the data, not computational resources, and not the required storage space, but the general lack of trained and skilled researchers to manipulate and analyze these data. Eliminating this problem requires development of comprehensive educational resources. Here we present a communit...
Galaxy HiCExplorer is a web server that facilitates the study of the 3D conformation of chromatin by allowing Hi-C data processing, analysis and visualization. With the Galaxy HiCExplorer web server, users with little bioinformatic background can perform every step of the analysis in one workflow: mapping of the raw sequence data, creation of Hi-C...
The zebrafish embryo remains transcriptionally quiescent during the first 10 cell cycles. Only then Zygotic Genome Activation (ZGA) occurs and is accompanied by fast chromatin remodeling. At ZGA, homologs of mammalian stem cell transcription factors (TFs) Pou5f3/Oct4, Nanog and Sox19b bind to thousands of developmental enhancers to initiate transcr...
This paper presents Parkour, a software package for sample processing and quality management of next generation sequencing data and samples. Starting with user requests, Parkour allows tracking and assessing samples based on predefined quality criteria through different stages of the sample preparation workflow. Ideally suited for academic core lab...
Galaxy (homepage: https://galaxyproject.org, main public server: https://usegalaxy.org) is a web-based scientific analysis platform used by tens of thousands of scientists across the world to analyze large biomedical datasets such as those found in genomics, proteomics, metabolomics and imaging. Started in 2005, Galaxy continues to focus on three k...
RNA centric research is of growing importance for medicine and molecular biology. Increasing amounts of data from deep sequencing experiments create a demand for automatic analysis and interpretation solutions.
The RNA-Workbench offers a wide range of tools covering classic RNA-bioinformatics as well as RNA-seq fields. Predefined workflows for the...
Mass Spectrometry (MS) based quantitative proteomics provides information regarding protein expression and abundance in a given sample. Protein / Peptide level quantitation (either labeled or label-free) is routinely used in analysis of shotgun proteomics data. For multi-omics studies such as proteogenomics and metaproteomics, peptide-detection and...
The impact of microbial communities, also known as the microbiome, on human health and the environment is receiving increased attention. Studying translated gene products (proteins) and comparing metaproteomic profiles may elucidate how microbiomes respond to specific environmental stimuli, and interact with host organisms. Characterizing proteins...