About
23
Publications
6,506
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
165
Citations
Introduction
Marcus Pinnecke currently works at the Institute of Technical and Business Information Systems (ITI), Otto-von-Guericke-Universität Magdeburg. Marcus does research in Data Structures, Artificial Intelligence and Databases. Their current project is 'COOPeR: Cross-device OLTP/OLAP PRocessing'.
Publications
Publications (23)
Heterogeneous Hybrid Transactional Analytical Processing ( $$\mathrm{H}^{2}$$ H 2 TAP) database systems have been developed to match the requirements for low latency analysis of real-time operational data. Due to technical challenges, these systems are hard to architect, non-trivial to engineer, and complex to administrate. Current research has pro...
Finding the right vertical partitioning scheme to match a workload is one of the essential database optimization problems. With the proper partitioning, queries and management tasks can skip unnecessary data, improving their performance. Algorithmic approaches are common for determining a partitioning scheme, with solutions being shaped by their ch...
Scholarly network analysis is the study of a scientific research network aiming to discover meaningful insights and making data-driven research decisions. Analyzing such networks has become increasingly challenging, due to the amount of scientific research that is added every day. Furthermore, online resources often include information from other o...
This presentation presents the insights into the scientific publication paper Codd's World: Topics and their Evolution in the Database Community Publication Graph.
This scientific poster presents early insights into the scientific publication paper Codd's World: Topics and their Evolution in the Database Community Publication Graph. (The full-text paper can be found at http://ceur-ws.org/Vol-2367/paper_3.pdf)
In this interactive demonstration, we show the current state of Protobase, our main-
memory analytic document store that is designed from scratch to enable rapid prototyping of efficient
microservices that perform analytics and explorations on (third-party) JSON-like documents stored in
a novel columnar binary-encoded format, called the Cabin file...
In this interactive demonstration, we show the current state of Protobase, our main-
memory analytic document store that is designed from scratch to enable rapid prototyping of efficient
microservices that perform analytics and explorations on (third-party) JSON-like documents stored in
a novel columnar binary-encoded format, called the Cabin file...
In this interactive demonstration, we show the current state of Protobase, our main-
memory analytic document store that is designed from scratch to enable rapid prototyping of efficient
microservices that perform analytics and explorations on (third-party) JSON-like documents stored in
a novel columnar binary-encoded format, called the Cabin file...
In this paper we take a close look into the role of GPUs for executing OLTP workloads, with a focus on CRUD operator-based processing, as opposed to more complex OLTP transactions. To this end we develop a prototype system supporting GPU and CPU variants of DSM and NSM processing , with a delegation-based approach that uses a single-thread schedule...
Grouped aggregation is a commonly used analytical function. The common implementation of the function using hashing techniques suffers lower throughput rate due to the collision of the insert keys in the hashing techniques. During collision, the underlying technique searches for an alternative location to insert keys. Searching an alternative locat...
In this paper we define a research agenda to develop a general framework supporting online autonomous tuning of data partitioning and layouts with a reinforcement learning formulation. We establish the core elements of our approach: agent, environment, action space and supporting components. Externally predicted workloads and the current physical d...
Every year, the number of scientific publications increases, adding complexity to the networks of collaborations, citations, and topics , in which papers are embedded. Analyzing these networks with efficient tools is important to help researchers identify relevant works and understand scientific impact. However, available tools face several limitat...
FeatureIDE is a popular open-source tool for modeling, implementing, configuring, and analyzing software product lines. However, FeatureIDE's initial design was lacking mechanisms that facilitate extension and reuse of core implementations. In current releases, we improve these traits by providing a modular concept for core data structures and func...
Software product lines enable developers to derive similar products from a common code base. Existing implementation techniques can be categorized as composition-based and annotation-based approaches, with both approaches promising complementary benefits. However, annotation-based approaches are commonly used in practice despite composition allowin...
Employing special-purpose processors (e.g., GPUs) in database systems has been studied throughout the last decade. Research on heterogeneous database systems that use both general- and special-purpose processors has addressed either transaction- or analytic processing, but not the combination of them. Support for hybrid transaction- and analytic pr...
The analysis of networks, either at a single point in time or through their evolution, is an increasingly important task in modern data management. Graph databases are uniquely suited to improve static network analysis. However, there’s still no consensus on how to best model data evolution with
these databases. In our work we propose an elementary...
Management of graph-shaped data gained a momentum to both industry and research. Traversal queries through a graph-shaped dataset are easy to express, and can be efficiently executed using graph databases. High-performance traversals through graph-shaped data is claimed to be enabled by native graph storage (i.e., encoding data using graph data str...
Continuous processing of event streams evolved to an important class of data management over the last years and will become even more important due to novel applications such as the Internet of Things. Because systems for data stream and event processing have been developed independent of each other, often in competition and without the existence o...
In recent years, the need for continuous processing and analysis of data streams has increased rapidly. To achieve high throughput-rates, stream-applications make use of operator-parallelization, batching-strategies and distribution. Another possibility is to utilize co-processors capabilities per operator. Further, the database community noticed,...