Björn Andreas Grüning

Björn Andreas Grüning
University of Freiburg | Albert-Ludwigs-Universität Freiburg · Bioinformatics - Inst. of Computer Science

About

182
Publications
48,354
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
16,675
Citations
Citations since 2017
133 Research Items
15722 Citations
201720182019202020212022202305001,0001,5002,0002,5003,000
201720182019202020212022202305001,0001,5002,0002,5003,000
201720182019202020212022202305001,0001,5002,0002,5003,000
201720182019202020212022202305001,0001,5002,0002,5003,000
Introduction
Björn Grüning currently works at the Inst. of Computer Science, University of Freiburg and is leading the Freiburg Galaxy team.
Additional affiliations
August 2013 - present
University of Freiburg
Position
  • PhD
November 2009 - July 2013
Pharmaceutical Sciences
Position
  • University of Freiburg
November 2009 - July 2013
University of Freiburg
Position
  • PhD

Publications

Publications (182)
Preprint
Full-text available
Improvements in genome sequencing and assembly are enabling high-quality reference genomes for all species. However, the assembly process is still laborious, computationally and technically demanding, lacks standards for reproducibility, and is not readily scalable. Here we present the latest Vertebrate Genomes Project assembly pipeline and demonst...
Article
Full-text available
The Coronavirus disease 2019 (COVID-19) pandemic caused by the severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) resulted in a major health crisis worldwide with its continuously emerging new strains, resulting in new viral variants that drive “waves” of infection. PCR or antigen detection assays have been routinely used to detect clinic...
Preprint
Full-text available
Highly multiplexed tissue imaging (MTI) is accomplished by applying powerful antibody-based spatial proteomics technologies that characterize tissues in situ at single-cell and potentially subcellular resolution and enable the creation of two-dimensional tissue maps. Example MTI methods include cyclic immunofluorescence (CyCIF), multiplex immunohis...
Article
Full-text available
Galaxy is a mature, browser accessible workbench for scientific computing. It enables scientists to share, analyze and visualize their own data, with minimal technical impediments. A thriving global community continues to use, maintain and contribute to the project, with support from multiple national infrastructure providers that enable freely acc...
Preprint
Full-text available
There are thousands of well-maintained high-quality open-source software utilities for all aspects of scientific data analysis. For over a decade, the Galaxy Project has been providing computational infrastructure and a unified user interface for these tools to make them accessible to a wide range of researchers. In order to streamline the process...
Article
Full-text available
Background Sepsis is associated with high platelet turnover and elevated levels of immature platelets. Changes in the platelet transcriptome and the specific impact of immature platelets on the platelet transcriptome remain unclear. Thus, this study sought to address whether and how elevated levels of immature platelets affect the platelet transcri...
Preprint
Full-text available
The amount of public proteomics data is increasing at an extraordinary rate. Hundreds of datasets are submitted each month to ProteomeXchange repositories, representing many types of proteomics studies, focusing on different aspects such as quantitative experiments, post-translational modifications, protein-protein interactions, or subcellular loca...
Article
Full-text available
The COVID-19 pandemic is shifting teaching to an online setting all over the world. The Galaxy framework facilitates the online learning process and makes it accessible by providing a library of high-quality community-curated training materials, enabling easy access to data and tools, and facilitates sharing achievements and progress between studen...
Preprint
Full-text available
The COVID-19 pandemic is the first global health crisis to occur in the age of big genomic data. Although data generation capacity is well established and sufficiently standardized, analytical capacity is not. To establish analytical capacity it is necessary to pull together global computational resources and deliver the best open source tools and...
Article
Hypomethylating agents (HMA) have become the backbone of nonintensive acute myeloid leukemia/myelodysplastic syndrome (AML/MDS) treatment, also by virtue of their activity in patients with adverse genetics, for example, monosomal karyotypes, often with losses on chromosome 7, 5, or 17. No comparable activity is observed with cytarabine, a cytidine...
Article
Full-text available
This paper is a tutorial developed for the data analysis platform Galaxy. The purpose of Galaxy is to make high-throughput computational data analysis, such as molecular dynamics, a structured, reproducible and transparent process. In this tutorial we focus on 3 questions: How are protein-ligand systems parameterized for molecular dynamics simulati...
Article
Full-text available
Motivation: The correct prediction of bacterial sRNA homologs is a prerequisite for many downstream analyses based on comparative genomics, but it is frequently challenging due to the short length and distinct heterogeneity of such homologs. GLASSGo is an efficient tool for the prediction of sRNA homologs from a single input query. To make the alg...
Article
Full-text available
Abstract Here, we introduce the ChemicalToolbox, a publicly available web server for performing cheminformatics analysis. The ChemicalToolbox provides an intuitive, graphical interface for common tools for downloading, filtering, visualizing and simulating small molecules and proteins. The ChemicalToolbox is based on Galaxy, an open-source web-base...
Article
Full-text available
Background Reticulated platelets (RP) are the youngest circulating platelets in blood. An increased amount of this subpopulation is associated with higher cardiovascular risk and mortality. Objectives It is unknown to what extent intrinsic properties of RP contribute to their hyperreactive features. This study is the first providing a multifactori...
Preprint
Full-text available
This paper is a tutorial developed for the data analysis platform Galaxy. The purpose of Galaxy is to make high-throughput computational data analysis, such as molecular dynamics, a structured, reproducible and transparent process. In this tutorial we focus on 3 questions: How are protein-ligand systems parameterized for molecular dynamics simulati...
Article
Full-text available
The Omics Discovery Index is an open source platform that can be used to access, discover and disseminate omics datasets. OmicsDI integrates proteomics, genomics, metabolomics, models and transcriptomics datasets. Using an efficient indexing system, OmicsDI integrates different biological entities including genes, transcripts, proteins, metabolites...
Article
Full-text available
Background Infinium Human Methylation BeadChip is an array platform for complex evaluation of DNA methylation at an individual CpG locus in the human genome based on Illumina’s bead technology and is one of the most common techniques used in epigenome-wide association studies. Finding associations between epigenetic variation and phenotype is a sig...
Article
Full-text available
The Galaxy HiCExplorer provides a web service at https://hicexplorer.usegalaxy.eu. It enables the integrative analysis of chromosome conformation by providing tools and computational resources to pre-process, analyse and visualize Hi-C, Capture Hi-C (cHi-C) and single-cell Hi-C (scHi-C) data. Since the last publication, Galaxy HiCExplorer has been...
Chapter
Bioinformatics software development has become a cornerstone in modern biology research. Large-scale quantitative biology studies have created a demand for more complex workflows and data analysis pipelines. Challenges in reproducing bioinformatics analyses are compounded by the fact that the programs themselves are difficult to install on computer...
Preprint
Chromatin loops are an important factor in the structural organization of the genome. The detection of chromatin loops in Hi-C interaction matrices is a challenging and compute intensive task. The presented approach shows a chromatin loop detection algorithm which applies a strict candidate selection based on continuous negative binomial distributi...
Preprint
Single-cell Hi-C interaction matrices are high dimensional and very sparse. To cluster thousands of single-cell Hi-C interaction matrices they are flattened and compiled into one matrix. This matrix can, depending on the resolution, have a few millions or even billions of features and any computation with it is therefore memory demanding. A common...
Preprint
Full-text available
PouV and SoxB1 family transcription factors (TFs) have emerged as master regulators of cell fate transitions. To investigate the genetic interactions between Pou5f3 and Sox19b in zebrafish embryos passing through Zygotic Genome Activation (ZGA), we combined time-resolved mutant transcription analysis using the novel tool RNA-sense, chromatin state...
Preprint
Full-text available
To gain a thorough appreciation of microbiome dynamics, researchers characterize the functional role of expressed microbial genes/proteins. This can be accomplished through metaproteomics, which characterizes the protein complement of the microbiome. Several software tools exist for analyzing microbiomes at the functional level by measuring their c...
Article
Full-text available
Abstract Bioinformaticians and biologists rely increasingly upon workflows for the flexible utilization of the many life science tools that are needed to optimally convert data into knowledge. We outline a pan-European enterprise to provide a catalogue (https://bio.tools) of tools and databases that can be used in these workflows. bio.tools not onl...
Article
Full-text available
Background RNA plays essential roles in all known forms of life. Clustering RNA sequences with common sequence and structure is an essential step towards studying RNA function. With the advent of high-throughput sequencing techniques, experimental and genomic data are expanding to complement the predictive methods. However, the existing methods do...
Article
Full-text available
Background Mass spectrometry imaging is increasingly used in biological and translational research because it has the ability to determine the spatial distribution of hundreds of analytes in a sample. Being at the interface of proteomics/metabolomics and imaging, the acquired datasets are large and complex and often analyzed with proprietary softwa...
Preprint
Galaxy is a web-based and open-source scientific data-processing platform. Researchers compose pipelines in Galaxy to analyse scientific data. These pipelines, also known as workflows, can be complex and difficult to create from thousands of tools, especially for researchers new to Galaxy. To make creating workflows easier, faster and less error-pr...
Article
Full-text available
The German Network for Bioinformatics Infrastructure (de.NBI) is a national and academic infrastructure funded by the German Federal Ministry of Education and Research (BMBF). The de.NBI provides (i) service, (ii) training, and (iii) cloud computing to users in life sciences research and biomedicine in Germany and Europe and (iv) fosters the cooper...
Conference Paper
Epigenetic variations alter gene expression among various diseases and histone modifications play an important role in controlling gene expression. Predicting gene expression from histone modification signals can be useful for designing epigenetic drugs to combat diseases. Multiple computational models have been proposed to use histone modification...
Article
Full-text available
Ribosome profiling (ribo-seq) provides a means to analyze active translation by determining ribosome occupancy in a transcriptome-wide manner. The vast majority of ribosome protected fragments (RPFs) resides within the protein-coding sequence of mRNAs. However, commonly reads are also found within the transcript leader sequence (TLS) (aka 5' untran...
Conference Paper
Full-text available
New methods in drug discovery, such as combinatorial chemistry and high-throughput screening, have resulted in the creation of huge amounts of new chemical data. The need to analyse this data effectively has spawned the field of cheminformatics, which applies computational methods to chemical problems. Numerous toolkits and applications have been d...
Presentation
Full-text available
BioContainers (biocontainers.pro) is an open-source and community-driven framework which provides platform independent executable environments for bioinformatics software. BioContainers allows labs of all sizes to easily install bioinformatics software, maintain multiple versions of the same software and combine tools into powerful analysis pipelin...
Article
Full-text available
RNA has become one of the major research topics in molecular biology. As a central player in key processes regulating gene expression, RNA is in the focus of many efforts to decipher the pathways that govern the transition of genetic information to a fully functional cell. As more and more researchers join this endeavour, there is a rapidly growing...
Preprint
Full-text available
Background: Mass spectrometry imaging is increasingly used in biological and translational research as it has the ability to determine the spatial distribution of hundreds of analytes in a sample. Being at the interface of proteomics/metabolomics and imaging, the acquired data sets are large and complex and often analyzed with proprietary software...
Article
Full-text available
The increasing complexity of data and analysis methods has created an environment where scientists, who may not have formal training, are finding themselves playing the impromptu role of software engineer. While several resources are available for introducing scientists to the basics of programming, researchers have been left with little guidance o...
Article
Full-text available
Background Human retinal microvascular endothelial cells (HRMVECs) are involved in the pathogenesis of retinopathy of prematurity. In this study, the microRNA (miRNA) expression profiles of HRMVECs were investigated under resting conditions, angiogenic stimulation (VEGF treatment) and anti-VEGF treatment. Materials and methods The miRNA profiles o...
Article
Full-text available
Chorismate constitutes a branch‐point intermediates in the biosynthesis of many aromatic metabolites in microorganisms and plants. To obtain unnatural compounds, we modified the route to menaquinone in E. coli. We propose a model for the binding of isochorismate to the active site of MenD (SEPHCHC synthase) that explains the outcome of the native r...
Preprint
Full-text available
RNA plays essential regulatory roles in all known forms of life. Clustering RNA sequences with common sequence and structure is an essential step towards studying RNA function. With the advent of high-throughput sequencing techniques, experimental and genomic data are expanding to complement the predictive methods. However, the existing methods do...
Preprint
Full-text available
Background Epigenome-wide association studies (EWAS) analyse genome-wide activity of epigenetic marks in cohorts of different individuals to find associations between epigenetic variation and phenotype. One of the most common technique used in EWAS studies is the Infinium Methylation Assay, which quantifies the DNA methylation level of over 450k lo...
Article
Full-text available
Motivation The pathway from genomics through proteomics and onto a molecular description of biochemical processes make the discovery of drugs and biomaterials possible. A research framework common to genomics and proteomics is needed to conduct biomolecular simulations that will connect biological data to the dynamic molecular mechanisms of enzymes...
Article
Full-text available
The zebrafish embryo is transcriptionally mostly quiescent during the first 10 cell cycles, until the main wave of zygotic genome activation (ZGA) occurs, accompanied by fast chromatin remodeling. At ZGA, homologs of the mammalian stem cell transcription factors (TFs) Pou5f3, Nanog, and Sox19b bind to thousands of developmental enhancers to initiat...
Preprint
Full-text available
Making reproducible, auditable and scalable data-processing analysis workflows is an important challenge in the field of bioinformatics. Recently, software containers and cloud computing introduced a novel solution to address these challenges. They simplify software installation, management and reproducibility by packaging tools and their dependenc...
Article
moFF is a modular and operating system independent tool for quantitative analysis of label-free mass-spectrometry based proteomics data. The moFF workflow, comprising matching-between-runs and apex quantification, can be applied to any upstream search engine’s output, along with the corresponding Thermo or mzML raw file. We here present moFF 2.0, w...
Article
Introduction: All-trans retinoic acid (ATRA, RA) has powerful activity in acute promyelocytic leukemia (APL); its efficacy in non-APL acute myeloid leukemia (AML) is still unclear, but may be enhanced by epigenetic drugs such as azanucleoside DNMT inhibitors (Blagitko-Dorfs et al. PLoS ONE 2013). In a randomized phase II study (DECIDER trial, NCT00...
Article
Full-text available
Background: Diabetes mellitus (DM) has been associated with increased platelet reactivity as well as increased levels of platelet RNAs in plasma. Here, we sought to evaluate whether the platelet transcriptome is altered in the presence of uncontrolled DM. Methods: Next-generation sequencing (NGS) was performed on platelet RNA for 5 patients with...
Article
Motivation: This paper presents Parkour, a software package for sample processing and quality management of next generation sequencing data and samples. Results: Starting with user requests, Parkour allows tracking and assessing samples based on predefined quality criteria through different stages of the sample preparation workflow. Ideally suit...
Preprint
Ribosome profiling (ribo-seq) provides a means to analyze active translation by determining ribosome occupancy in a transcriptome-wide manner. The vast majority of ribosome protected fragments resides within the protein-coding sequence of mRNAs. However, commonly reads are also found within the transcript leader sequence (TLS) (aka 5' untranslated...
Poster
Full-text available
Protein/peptide-level quantification (either labeled or label-free) is routinely used in shotgun proteomics data analysis for determining the abundance of proteins in a given sample. However, accurate, rapid and robust label-free quantification is still a major challenge in the field of quantitative proteomics. Label-free quantification (LFQ) based...
Article
Many areas of research suffer from poor reproducibility, particularly in computationally intensive domains where results rely on a series of complex methodological decisions that are not well captured by traditional publication approaches. Various guidelines have emerged for achieving reproducibility, but implementation of these practices remains d...
Article
The primary problem with the explosion of biomedical datasets is not the data, not computational resources, and not the required storage space, but the general lack of trained and skilled researchers to manipulate and analyze these data. Eliminating this problem requires development of comprehensive educational resources. Here we present a communit...
Article
Full-text available
Galaxy HiCExplorer is a web server that facilitates the study of the 3D conformation of chromatin by allowing Hi-C data processing, analysis and visualization. With the Galaxy HiCExplorer web server, users with little bioinformatic background can perform every step of the analysis in one workflow: mapping of the raw sequence data, creation of Hi-C...
Preprint
Full-text available
The zebrafish embryo remains transcriptionally quiescent during the first 10 cell cycles. Only then Zygotic Genome Activation (ZGA) occurs and is accompanied by fast chromatin remodeling. At ZGA, homologs of mammalian stem cell transcription factors (TFs) Pou5f3/Oct4, Nanog and Sox19b bind to thousands of developmental enhancers to initiate transcr...
Preprint
This paper presents Parkour, a software package for sample processing and quality management of next generation sequencing data and samples. Starting with user requests, Parkour allows tracking and assessing samples based on predefined quality criteria through different stages of the sample preparation workflow. Ideally suited for academic core lab...
Article
Full-text available
Galaxy (homepage: https://galaxyproject.org, main public server: https://usegalaxy.org) is a web-based scientific analysis platform used by tens of thousands of scientists across the world to analyze large biomedical datasets such as those found in genomics, proteomics, metabolomics and imaging. Started in 2005, Galaxy continues to focus on three k...
Presentation
Full-text available
RNA centric research is of growing importance for medicine and molecular biology. Increasing amounts of data from deep sequencing experiments create a demand for automatic analysis and interpretation solutions. The RNA-Workbench offers a wide range of tools covering classic RNA-bioinformatics as well as RNA-seq fields. Predefined workflows for the...
Poster
Full-text available
Mass Spectrometry (MS) based quantitative proteomics provides information regarding protein expression and abundance in a given sample. Protein / Peptide level quantitation (either labeled or label-free) is routinely used in analysis of shotgun proteomics data. For multi-omics studies such as proteogenomics and metaproteomics, peptide-detection and...
Article
Full-text available
The impact of microbial communities, also known as the microbiome, on human health and the environment is receiving increased attention. Studying translated gene products (proteins) and comparing metaproteomic profiles may elucidate how microbiomes respond to specific environmental stimuli, and interact with host organisms. Characterizing proteins...