About
367
Publications
227,339
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
36,932
Citations
Introduction
Louise Jespensgaard just defended a very nice MSc report:
"Evaluation of an Electronic Tongue in the Pharmaceutical Industry – Performance Evaluation and Sensory Validation of Taste Masking".
She did the project together with the pharmaceutical company Lundbeck
Current institution
Additional affiliations
January 2013 - December 2015
Publications
Publications (367)
When a small electric bias is applied to a single-molecule junction, current will flow through the molecule via a tunneling mechanism. In molecules with a cyclic or helical structure there may be circular currents, giving rise to a uni-directional magnetic field. Here, we implement the Biot Savart law and calculate the magnetic field resulting from...
1 2 Shift Invariant Tri-linearity (SIT) for fast, flexible blind source separation in hyphenated chromatography. Ideally, hyphenated chromatography measurements follow the parallel factor analysis (PAFAFAC) model. However, measurements that exhibit shifting in the chromatogram elution profiles do not follow the PARAFAC model and researchers have in...
Binding free energies are a key element in understanding and predicting the strength of protein--drug interactions. While classical free energy simulations yield good results for many purely organic ligands, drugs including transition metal atoms often require quantum chemical methods for an accurate description. We propose a general and automated...
We present a quantum-in-quantum embedding strategy coupled to machine learning potentials to improve on the accuracy of quantum-classical hybrid models for the description of large molecules. In such hybrid models, relevant structural regions (such as those around reaction centers or pockets for binding of host molecules) can be described by a quan...
The properties and dynamics of gold nanowires have been studied for decades as an important testbed for several physical phenomena. Gold nanowires forming at contacts are an integral part of molecular junctions used to study the electronic and thermal properties of single molecules. However, the huge discrepancy in timescales between experiments an...
Thermal management in molecular systems presents challenges that require a deeper understanding of phonon transport, an essential aspect of heat conduction in single-molecule junctions. Our work introduces the use of heavy atoms as a strategy for suppressing phonon transport in organic molecules. Starting with a one-dimensional (1D) force-constant...
When a small electric bias is applied to a single- molecule junction, current will flow through the molecule via a tunneling mechanism. In molecules with a cyclic or helical structure there may be circular currents, giving rise to a uni-directional magnetic field. Here, we implement the Biot-Savart law and calculate the magnetic field resulting fro...
This paper introduces a novel deconvolution algorithm, shift‐invariant multi‐linearity (SIML), which significantly enhances the analysis of data from two‐dimensional gas chromatography instruments coupled to a time‐of‐flight mass spectrometer (GC × GC‐TOFMS). Designed to address the challenges posed by retention time shifts and high noise levels, S...
One key aspect for the development of functional molecular electronic devices is the ability to precisely tune and reversibly switch the conductance of individual molecules in electrode‐molecule‐electrode junctions in response to external stimuli. In this work, we present a new approach to access molecular switches by deliberately controlling the f...
One key aspect for the development of functional molecular electronic devices is the ability to precisely tune and reversibly switch the conductance of individual molecules in electrode‐molecule‐electrode junctions in response to external stimuli. In this work, we present a new approach to access molecular switches by deliberately controlling the f...
This paper introduces a novel deconvolution algorithm, shift-invariant multi-linearity (SIML), which significantly enhances the analysis of data from a comprehensive two-dimensional gas chromatograph coupled to a mass spectrometric detector (GC$\times$GC-TOFMS). Designed to address the challenges posed by retention time shifts and high noise levels...
Thermal management in molecular systems presents challenges that require a deeper understanding of phonon transport, an essential aspect of heat conduction in single molecule junctions. Our work introduces the use of heavy atoms as a strategy for suppressing phonon transport in organic molecules. Starting with a 1D force-constant model and density...
Thermal management in molecular systems presents challenges that require a deeper understanding of phonon transport, an essential aspect of heat conduction in single molecule junctions. Our work introduces the use of heavy atoms as a strategy for suppressing phonon transport in organic molecules. Starting with a 1D force-constant model and density...
Thermal management in molecular systems presents challenges that require a deeper understanding of phonon transport, an essential aspect of heat conduction in single molecule junctions. Our work introduces the use of heavy atoms as a strategy for suppressing phonon transport in organic molecules. Starting with a 1D force-constant model and density...
Thermal management in molecular systems presents challenges that require a deeper understanding of phonon transport, an essential aspect of heat conduction in single molecule junctions. Our work introduces the use of heavy atoms as a strategy for suppressing phonon transport in organic molecules. Starting with a 1D force-constant model and density...
The proposed method is a shift-invariant tri-linear decomposition that can effectively model peak shape changes in gas chromatography coupled mass spectrometry (GC-MS). The method automatically adapts to the complexity of the data, allowing for a fine-tuned flexibility in the modeling process. Performance on real GC-MS data indicates that the propo...
Single-molecule experiments offer a unique means to probe molecular properties of individual molecules–yet they rest upon the successful control of background noise and irrelevant signals. In single-molecule transport studies, large amounts of data that probe a wide range of physical and chemical behaviors are often generated. However, due to the s...
There is an increasing interest in the development of new data-driven models useful to assess the performance of communication networks. For many applications, like network monitoring and troubleshooting, a data model is of little use if it cannot be interpreted by a human operator. In this paper, we present an extension of the Multivariate Big Dat...
Chemistry of the Au–S interface at the nanoscale is one of the most complex systems to study, as the nature and strength of the Au–S bond change under different experimental conditions. In this study, using mechanically controlled break junction technique, we probed the conductance and analyzed Flicker noise for several aliphatic and aromatic thiol...
Bloodstains are commonly encountered at crime scenes, especially on floor tiles, and can be deposited over different periods and intervals. Therefore, it is crucial to develop techniques that can accurately identify bloodstains deposited at different times. This study builds upon a previous investigation and aims to enhance the performance of three...
While the use of molecular orbitals (MOs) and their isosurfaces to explain physical phenomena in chemical systems is a time-honored tool, we show that the nodes are an equally important component for understanding the current density through single-molecule junctions. We investigate three different model systems consisting of an alkane, alkene, and...
While the use of molecular orbitals (MOs) and their isosurfaces to explain physical phenomena in chemical systems are a time-honored tool, we show that the nodes are an equally important component for understanding the current density through single-molecule junctions. We investigate three dif- ferent model systems consisting of an alkane, alkene a...
This study aimed to prepare a novel colorimetric indicator film from virtually pure (99 %) amylose (AM) and anthocyanins extracted from red cabbage (RCA). The AM used was a unique engineered bulk material extracted from transgenic barley grains. Films produced by solution casting were compared to normal barely starch (NB) and pure barley amylopecti...
While the use of molecular orbitals (MOs) and their isosurfaces to explain physical phenomena in chemical systems are a time-honored tool, we show that the nodes are an equally important component for understanding the current density through single-molecule junctions. We investigate three dif- ferent model systems consisting of an alkane, alkene a...
Multi‐way data analysis is popular in chemometrics for the decomposition of, for example, spectroscopic or chromatographic higher‐order tensor datasets. Parallel factor analysis (PARAFAC) and its extension, PARAFAC2, are extensively employed methods in chemometrics. Applications of PARAFAC2 for untargeted data analysis of hyphenated gas chromatogra...
Cell-based sensors and assays have great potential in bioanalysis, drug discovery screening, and biochemical mechanisms research. The cell viability tests should be fast, safe, reliable, and time- and cost-effective. Although methods stated as "gold standards", such as MTT, XTT, and LDH assays, usually fulfill these assumptions, they also show some...
Quantum interference effects in conjugated molecules have been well-explored, with benzene frequently invoked as a pedagogical example. These interference effects have been understood through a quantum interference map in which the electronic transmission is separated into interfering and non-interfering terms, with a focus on the π-orbitals for co...
The present protocol provides general guidelines for users working with PARADISe, a deconvolution and identification system for processing GC-MS data. This tool allows users to perform untargeted analysis of large datasets efficiently and minimizes inter-user variability. The final output is a peak table, in excel format, containing the peak area a...
Quantum interference effects in conjugated molecules have been well explored, with benzene frequently invoked as a pedagogical example. These interference effects have been understood through a quantum interference map in which the electronic transmission is separated into interfering and non-interfering terms, with a focus on the π-orbitals for co...
When building classification models of complex systems with many classes, the traditional chemometric approaches such as discriminant analysis or soft independent modelling of class analogy often fail. Some people resort to advanced deep neural network, but this is only an option if there is access to very many samples. Another alternative often us...
In this tutorial review, we will describe crucial aspects related to the application of machine learning to help users avoid the most common pitfalls. The examples we present will be based on data from the field of molecular electronics, specifically single-molecule electron transport experiments, but the concepts and problems we explore will be su...
Unlike other food products, virgin olive oil must undergo an organoleptic assessment that is currently based on a trained human panel, which presents drawbacks that might affect the efficiency and robustness. Therefore, disposing of instrumental methods that could serve as screening tools to support sensory panels is of paramount importance. The pr...
Tensor decompositions, such as CANDECOMP/PARAFAC (CP), are widely used in a variety of applications, such as chemometrics, signal processing, and machine learning. A broadly used method for computing such decompositions relies on the Alternating Least Squares (ALS) algorithm. When the number of components is small, regardless of its implementation,...
The Canonical Polyadic (CP) tensor decomposition is frequently used as a model in applications in a variety of different fields. Using jackknife resampling to estimate parameter uncertainties is often desirable but results in an increase of the already high computational cost. Upon observation that the resampled tensors, though different, are nearl...
Higher-order tensor data analysis has been extensively employed to understand complicated data, such as multi-way GC-MS data in untargeted/targeted analysis. However, the analysis can be complicated when one of the modes shifts e.g., the elution profiles of specific compounds often with respect to retention time; something which violates the assump...
A total of 56 key volatile compounds present in natural and alkalized cocoa powders have been rapidly evaluated using a non-target approach using stir bar sorptive extraction gas chromatography mass spectrometry (SBSE-GC-MS) coupled to Parallel Factor Analysis 2 (PARAFAC2) automated in PARADISe. Principal component analysis (PCA) explained 80% of t...
While the use of molecular orbitals (MOs) and their isosurfaces to explain physical phenomena in chemical systems are a time-honored tool, we show that the nodes are an equally important component for understanding the current density through single-molecule junctions. We investigate three different model systems consisting of an alkane, alkene and...
The Canonical Polyadic (CP) tensor decomposition is frequently used as a model in applications in a variety of different fields. Using jackknife resampling to estimate parameter uncertainties is often desirable but results in an increase of the already high computational cost. Upon observation that the resampled tensors, though different, are nearl...
Increasing awareness of the ability to transform data into knowledge has steered more focus on data science within the educational system as well as the development of machine learning methods capable of handling complex problems with minimal or no human interaction. In principle, this raises the question on where human–computer interaction is supe...
Analyzing multi-way measurements with variations across one mode of the dataset is a challenge in various fields including data mining, neuroscience and chemometrics. For example, measurements may evolve over time or have unaligned time profiles. The PARAFAC2 model has been successfully used to analyze such data by allowing the underlying factor ma...
Gas chromatography – mass spectrometry (GC-MS) is an important tool in contemporary untargeted chemical analysis, where the batch analysis of sample series and subsequent generation of peak tables are still commonly subject to software-uncertainty leading to issues in reproducibility and hypothesis testing.
Using tensor-based modelling in combinati...
One of the most important types of evidence in certain criminal investigations is traces of human blood. For a detailed investigation, blood samples must be identified and collected at the crime scene. The present study aimed to evaluate the potential of the identification of human blood in stains deposited on different types of floor tiles (five t...
PARAFAC2 is a useful algorithm for decomposing tensors that do not have low-rank variation such as e.g. PARAFAC requires. It has been applied in analyzing different types of multi-way data, such as GC-MS data. Since the optimization in fitting the loss function of PARAFAC2 is non-convex, the PARAFAC2 model suffers from local minima. In this paper,...
The frontier molecular orbital (MO) topology of linear carbon molecules, such as polyynes, can be visually identified as helices. However, there is no clear way to quantify the helical curvature of these π-MOs, and it is thus challenging to quantify correlations between the helical curvature and molecular properties. In this paper, we develop a met...
PARAFAC2 is a well-established method for specific type of tensor decomposition problems, for example when observations have different lengths or measured profiles slightly change position in the multi-way data. Most commonly used PARAFAC2-ALS algorithms are very slow. In this paper, we propose novel implementations of extrapolation-based PARAFAC2...
The frontier molecular orbital (MO) topology of linear carbon molecules, such as polyynes, can be visually identified as helices. However, there is no clear way to quantify the helical curvature of these π-MOs and it is thus challenging to quantify correlations between the helical curvature and molecular properties. In this paper, we develop a meth...
The frontier molecular orbital (MO) topology of linear carbon molecules, such as polyynes, can be visually identified as helices. However, there is no clear way to quantify the helical curvature of these π-MOs and it is thus challenging to quantify correlations between the helical curvature and molecular properties. In this paper, we develop a meth...
Calibration model maintenance is often overlooked but is a significant part of successful use of multivariate calibration models, for example, in process monitoring and optimization. In some cases, companies are maintaining tens or even hundreds of calibration models. This could be partial least squares (PLS) calibration models pertaining to differ...
Elevated levels of particulate matter (PM) in urban atmospheres are one of the major environmental challenges of the Anthropocene. To effectively lower those levels, identification and quantification of sources of PM is required. Biomonitoring methods are helpful tools to tackle this problem but have not been fully established yet. An example is th...
The consumers’ interest towards beer consumption has been on the rise during the past decade: new approaches and ingredients get tested, expanding the traditional recipe for brewing beer. As a consequence, the field of “beeromics” has also been constantly growing, as well as the demand for quick and exhaustive analytical methods. In this study, we...
This work describes the development of methodology based on the hierarchical soft classification method by combining multivariate analysis techniques and Hyperspectral Near Infrared Images (HSI-NIR) to confirm identification of bloodstains on colored and printed fabrics. The term hierarchical is used to designate that the classification is done seq...
Sparse Principal Component Analysis (sPCA) is a popular matrix factorization approach based on Principal Component Analysis (PCA). It combines variance maximization and sparsity with the ultimate goal of improving data interpretation. A main application of sPCA is to handle high-dimensional data, for example biological omics data. In Part I of this...
The one-dimensional carbon allotrope carbyne, a linear chain of sp-hybridized carbon atoms, is predicted to exist in a polyynic and a cumulenic structure. While molecular forms of carbyne have been extensively characterized, the structural nature is hard to determine for many linear carbon wires that are made in-situ during pulling experiments. Her...
In this paper, we discuss the validity of using score plots of component models such as partial least squares regression, especially when these models are used for building classification models, and models derived from partial least squares regression for discriminant analysis (PLS-DA). Using examples and simulations, it is shown that the currentl...
Matrix factorization methods are extensively employed to understand complex data. In this paper, we introduce the cross-product penalized component analysis (X-CAN), a matrix factorization based on the optimization of a loss function that allows a trade-off between variance maximization and structural preservation, with a focus on highlighting diff...
Laser-induced breakdown spectroscopy (LIBS) was used to characterize base (Al and Cu) and noble (Au and Ag) elements on a printed circuit board (PCB) from hard disk (HD). A PCB was cut in 77 fragments and, a matrix of 4 rows and 4 columns with 10 laser pulses in each point of the matrix was acquired in each fragment by LIBS. For each element, a spe...
p>The one-dimensional carbon allotrope carbyne, a linear chain of sp -hybridized carbon atoms, is predicted to exist in a polyynic and a cumulenic structure. While molecular forms of carbyne have been extensively characterized, the structural nature is hard to determine for many linear carbon wires that are made in-situ during pulling experiments....
In this section, multilinear models for multi-way arrays requiring iterative fitting algorithms are outlined. Among them: the PARAFAC (PARAllel FACtor analysis) model and one of its variants (the PARAFAC2 model); Tucker models in which one or more modes are reduced (viz., the N-way Tucker-N and Tucker-m models); hybrid models having intermediate pr...
We propose networkmetrics, a new data-driven approach for monitoring, troubleshooting and understanding communication networks using multivariate analysis. Networkmetric models are powerful machine-learning tools to interpret and interact with data collected from a network. In this paper, we illustrate the application of Multivariate Big Data Analy...
Matrix factorization methods are extensively employed to understand complex data. In this paper, we introduce the cross-product penalized component analysis (XCAN), a sparse matrix factorization based on the optimization of a loss function that allows a trade-off between variance maximization and structural preservation. The approach is based on pr...
Analysis of untargeted gas-chromatographic data is time consuming. With the earlier introduction of the PARAFAC2 (PARAllel FACtor analysis 2) based PARADISe (PARAFAC2 based Deconvolution and Identification System) approach in 2017, this task was made considerably more time-efficient. However, there are still a number of manual steps in the analysis...
Multivariate exploratory data analysis allows revealing patterns and extracting information from complex multivariate data sets. However, highly complex data may not show evident groupings or trends in the principal component space, e.g. because the variation of the variables are not grouped but rather continuous. In these cases, classical explorat...
The spectra responsible for natural dissolved organic matter (DOM) fluorescence in 90 peer-reviewed studies have been compared using new similarity metrics. Numerous spectra cluster in specific wavelength regions. The emerging...
Single-molecule conductance generally decays exponentially with the length of the molecule when the transport mechanism is a coherent tunneling process. However, it was recently found that this length dependence can be reversed in linear conjugated molecules if the bond-length alternation is reversed. In this work we show that even-carbon cumulenes...
NMR is one of the most powerful analytical techniques of our time. It allows detailed investigation of qualitative and quantitative characteristics of complex chemical and biological samples. The resulting NMR data provides a wealth of information about the samples, but the NMR data analysis has been and still is suffering from oversimplified appro...
Modeling variability in tensor decomposition methods is one of the challenges of source separation. One possible solution to account for variations from one data set to another, jointly analysed, is to resort to the PARAFAC2 model. However, so far imposing constraints on the mode with variability has not been possible. In the following manuscript,...
PARAFAC2 is a powerful decomposition method which is ideally suited for modeling gas chromatography-mass spectrometry (GC-MS) data. However, the most widely used fitting algorithms (alternating least squares, ALS) are very slow which hinders use of the model. In this paper, an iterative method called geometric search is proposed to fit the PARAFAC2...
Modeling variability in tensor decomposition methods is one of the challenges of source separation. One possible solution to account for variations from one data set to another, jointly analysed, is to resort to the PARAFAC2 model. However, so far imposing constraints on the mode with variability has not been possible. In the following manuscript,...
Parallel factor analysis (PARAFAC) of food fluorescence has found many applications in food science, such as in non-contact and non-destructive food characterization, the detection of food adulteration, and the authentication of geographical and botanical origins of food products. This Chapter presents a theoretical background of the PARAFAC method...
This IUPAC Technical Report describes and compares the currently applied methods for the calibration and standardization of multi-dimensional fluorescence (MDF) spectroscopy data as well as recommendations on the correct use of chemometric methods for MDF data analysis. The paper starts with a brief description of the measurement principles for the...
It has become easy to obtain multivariate chemical data of high dimensions. However, it may be expensive or time consuming to obtain a large number of samples or to acquire reference measures, so the number of samples available for multivariate calibration modelling may be limited. If data contains nonlinear relationships, nonlinear methods are req...
Demonstrates the use of PARAllel FACtor analysis for excitation emission fluorescence.
Foodomics is a newly developed discipline that has become more and more important in the last years where focus on food and the understanding of food systems has increased significantly. In this review, the flow of a typical foodomics study will be followed with a focus on the core components, where chemometric expertise is more deeply involved. Th...
Flavour matching can be viewed as trying to reproduce a specific flavour. This is a time consuming task and may lead to flavour mixtures that are too complex or too expensive to be commercialized. In order to facilitate the matching, we have developed a new mathematical model, called Prioritizer. Based on the chemical composition of a mixture of vo...
Data fusion, i.e., extracting information through the fusion of complementary data sets, is a topic of great interest in metabolomics since analytical platforms such as Liquid Chromatography - Mass Spectrometry (LC-MS) and Nuclear Magnetic Resonance (NMR) spectroscopy commonly used for chemical profiling of biofluids provide complementary informati...
Evaluation of GC–MS data may be challenging due to the high complexity of data including overlapped, embedded, retention time shifted and low S/N ratio peaks. In this work, we demonstrate a new approach, PARAFAC2 based Deconvolution and Identification System (PARADISe), for processing raw GC–MS data. PARADISe is a computer platform independent free...
Multi-way data arrays are becoming more common in several fields of science. For instance, analytical instruments can sometimes collect signals at different modes simultaneously, as e.g. fluorescence and LC/GC-MS. Higher order data can also arise from sensory science, were product scores can be reported as function of sample, judge and attribute. A...
Flavour matching can be viewed as trying to reproduce a specific flavour. This is a time consuming task and may lead to flavour mixtures that are too complex or too expensive to be commercialized. In order to facilitate the matching, we have developed a new mathematical model, called Prioritizer. Based on the chemical composition of a mixture of vo...
NMR is one of the most powerful analytical techniques of our time. It allows detailed investigation of qualitative and quantitative characteristics of complex chemical and biological samples. The resulting NMR data provides a wealth of information about the samples, but the NMR data analysis has been and still is suffering from oversimplified appro...
Significant improvements can be realized by converting conventional batch processes into continuous ones. The main drivers include reduction of cost and waste, increased safety, and simpler scale-up and tech transfer activities. Re-designing the process layout offers the opportunity to incorporate a set of process analytical technologies (PAT) embr...
In many areas of science multiple sets of data are collected pertaining to the same system. Examples are food products which are characterized by different sets of variables, bio-processes which are on-line sampled with different instruments, or biological systems of which different genomics measurements are obtained. Data fusion is concerned with...
In many areas of science multiple sets of data are collected pertaining to the same system. Examples are food products which are characterized by different sets of variables, bio-processes which are on-line sampled with different instruments, or biological systems of which different genomics measurements are obtained. Data fusion is concerned with...
The focus of the present paper is to propose and discuss different procedures for performing variable selection in a multi-block regression context. In particular, the focus is on two multi-block regression methods: Multi-Block Partial Least Squares (MB-PLS) and Sequential and Orthogonalized Partial Least Squares (SO-PLS) regression. A small simula...
The aim was to investigate the effects of increased water or dairy intake on total intake of energy, nutrients, foods and dietary patterns in overweight adolescents in the Milk Components and Metabolic Syndrome (MoMS) study (n= 173). Participants were randomly assigned to consume 1l/d of skim milk, whey, casein or water for 12 weeks. A decrease in...
With a goal of identifying biomarkers/patterns related to certain conditions or diseases, metabolomics focuses on the detection of chemical substances in biological samples such as urine and blood using a number of analytical tech- niques, including nuclear magnetic resonance (NMR) spectros- copy, liquid chromatography-mass spectrometry (LC-MS), an...
Little is known about the development of dietary patterns during toddlerhood and the relation to growth and health. The study objective was to characterise the development of dietary patterns from 9-36 mo of age and investigate the association to body size, body composition and metabolic risk markers at 36 mo. Food records were filled out at 9, 18...
We consider factoring low-rank tensors in the presence of outlying slabs.
This problem is important in practice, because data collected in many
real-world applications, such as speech, fluorescence, and some social network
data, fit this paradigm. Prior work tackles this problem by iteratively
selecting a fixed number of slabs and fitting, a proced...
Questions
Question (1)
Would it be the number of components, how to handle scattering, fixing outliers or something completely different?