
Daniel Staegemann- Research Associate at Otto-von-Guericke University Magdeburg
Daniel Staegemann
- Research Associate at Otto-von-Guericke University Magdeburg
About
102
Publications
23,723
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
478
Citations
Introduction
Skills and Expertise
Current institution
Publications
Publications (102)
Big data projects have become increasingly important in today's data-driven world, significantly influencing sectors such as healthcare, finance, and retail. However, these projects often face high failure rates, with estimates suggesting that between 80% and 87% fail to produce sustainable solutions. This systematic literature review aims to inves...
App reviews provide crucial feedback for software maintenance and evolution, but manually extracting useful reviews from vast volumes is time-consuming and challenging. This study investigates the effectiveness of six Naïve Bayes variants for automatically filtering useful app reviews. We evaluated these variants on datasets from five popular apps,...
This research explores the intricate landscape arising from the integration of big data and large language models (LLMs) across sectors, unveiling intellectual property (IP) challenges requiring careful scrutiny. The transformative impact of big data and the ascendancy of LLMs in artificial intelligence have precipitated complex inquiries into data...
Through the increasing adoption of managed services and cloud computing, new options for the realization and provisioning of backup services emerge. One of these possibilities is the option to utilize managed backups, which are a form of managed service where a service provider is responsible for the data protection of a whole business and also pro...
303 304 D. Staegemann et al. and their general nature is outlined. Then, subsequently, suitable approaches for the components' testing are discussed. In doing so, researchers and practitioners alike shall be provided with a starting point when it comes to the discussion of big data quality assurance and how it can be implemented in their own endeav...
The growing digitalization influences almost all areas of our modern life. Companies are aligning themselves with this movement already. However, public institutions in particular still have problems that are primarily originating out of high costs. Serverless computing is a new trend in cloud computing that enables modern software development to b...
In the era of Big Data, the successful completion of Data Science (DS) projects is crucial. However, DS project management is quite challenging due to its interdisciplinary nature. Existing DS process models, such as CRISP-DM, have limitations, resulting in low success rates for these undertakings. To address this issue, a novel methodology for the...
As the complexity and diversity of big data systems reaches a new level, testing the solutions developed is becoming increasingly difficult. In this study, a systematic literature review is conducted on the role of testing and related quality assurance techniques in current big data systems in terms of applied strategies and design guidelines. Afte...
Aged information systems are commonly referred to as legacy information systems or just merely as legacy systems. Typically, these are mission-critical systems developed years ago that significantly resist evolution. Many organizations are confronted with these systems and consider them a burden because they cement their businesses and cause unreas...
This systematic literature review explores how Big Data Analytics (BDA) adoption impacts operational efficiency and innovation capacity within the technology sector. Employing PRISMA and related guidelines, the study finds BDA offers significant potential to enhance operational efficiency, competitiveness, and decision-making, leading to improved b...
Big data (BD) and the systems used for its harnessing heavily impact many aspects of today’s society and it has been repeatedly shown that they can positively impact the operations of organizations that incorporate them. However, creating and maintaining these applications is extremely challenging. Therefore, it is necessary to pay additional atten...
The role of data as a valuable resource has caused significant transformations in various areas of life. Data Science (DS) aims to extract knowledge from data and thus, has gained attraction from organizations aiming to optimize existing processes and uncover previously unknown potentials. DS can be beneficially integrated into the business process...
Data Science (DS) has gained increased relevance due to the potential to extract useful insights from data. Quite commonly, this involves the utilization of Machine Learning (ML). The challenging pursuit of developing and productionizing ML models can be supported and automated through MLOps, a specialization of the DevOps paradigm from software de...
With society’s increasing data production and the corresponding demand for systems that are capable of utilizing them, the big data domain has gained significant importance. However, besides the systems’ actual implementation, their testing also needs to be considered. For this, oftentimes, proper test data sets are necessary. This publication disc...
Citizens are a country’s greatest resource. As today’s world is quite knowledge-driven, encouraging citizens to facilitate their competences becomes essential to the future and success of any country. Manifestly, providing high-quality education is necessary and important. However, it is not as simple in practice to provide a high level of educatio...
The extensive use of information and, thereby, also the application of big data (BD) technologies, are some of the biggest influencing factors in today’s society. However, due to the sheer deluge of data, it is not feasible to turn them into usable information in a manual fashion. Instead, automated approaches are required, which makes machine lear...
As a consequence of the necessities of the digital age, agility is becoming more and more important in software development. Consequently, agile change management is increasingly coming into focus and many projects are undergoing a transformation process from classic software development to agile software development. Through this, managers are con...
Data has become an important asset for businesses, and it is crucial to understand its value. The valuation of data is an important step in leveraging the potential of data as an asset. It helps companies to use data effectively and adapt their strategies accordingly. A distinction must be made between valuation and pricing. Valuation refers to the...
Big data analytics have claimed an important role in today’s society. Consequently, ways of improving the design and development of the corresponding applications are highly sought after. One rather current proposition is the application of test driven development (TDD) in the big data domain. The idea behind it is to increase the quality and the f...
Big data applications have gained widespread
usage across various fields, including healthcare, business, and
education. The effectiveness and accuracy of these applications
heavily rely on the availability of a large volume of data.
However, the collected and generated data for these
applications often suffer from incompleteness, inaccuracy, and
l...
Data and their analysis are one of the driving forces of today’s society. Therefore, the concept of big data (BD) is also heavily discussed by researchers and practitioners alike. Besides many other aspects, this also includes the corresponding quality assurance. The publication at hand contributes to this by further examining the rather recent pro...
In a world where the amount of generated and stored data is steadily increasing, Data Science (DS) has emerged as an integral discipline for organizations to extract knowledge and value from these resources. Due to the high failure rate in these undertakings, new approaches to support the project management are urgently needed. The use of appropria...
Retail is expected to be one of the industries that will benefit most from advances in artificial intelligence (AI) in the future. One branch of AI is video analytics, which is used to analyze the behavior, flow, and interactions of customers in a store. Properly implementing features to optimize store operations, prevent theft, or provide targeted...
The technological evolution is dominating the social change. There is a constant increase in the number of networked devices and the use of smart meters has become more relevant. A required feature of smart meter architectures is reliability, which ensures the continuous provision of the services of the system. In this article, the research questio...
The panorama of data is ever evolving, and big data has emerged to become one of the most hyped terms in the industry. Today, users are the perpetual producers of data that if gleaned and crunched, have the potential to reveal game-changing patterns. This has introduced an important shift regarding the role of data in organizations and many strive...
Microservice architectures have emerged as counter design to traditional monolithic applications. While monoliths are single executable applications, microservice architectures consist of several smaller units. Advantages of microservice architectures are their development speed, lower costs of change, and dynamic scaling ability. However, this pat...
As Decentralised Autonomous Organisation (DAO) is a new emerging form of organisation with unrevealed characteristics, this study examines how DAO can be classified in terms of organisational design theory and provides an overview of its characteristics. The investigation could further provide guidance on what types of organisations can be easily t...
The trend of adopting container-based systems become increasingly
relevant for companies and their IT departments. Improved scalability and shorter deployment cycles in IT production are the most mentioned benefits of the technology. The adoption of container-based technologies in an existing IT system landscape requires a consideration of migratio...
With the significance, ubiquity, and complexity of information technology continuously rising, the corresponding quality assurance becomes increasingly important and challenging. Consequently, numerous tools, techniques, paradigms, and strategies that facilitate the quality assurance of the developed applications have emerged. Test-driven developme...
A new emerging form of organisation, the decentralised autonomous organisation (DAO), built on blockchain technology and smart contracts, offers the potential for transforming social interaction. The basic regulations for running an association under German law indicate that transforming the traditional form of an association into a decentralised a...
The future of retail will be shaped by the rapidly evolving digital landscape. New technologies such as bigdata analytics, artificial intelligence, virtual reality, and cloud computing are expected to play a crucial rolein advertising products, making personalized offers tailored precisely to customers' needs, and thus meetingrising expectations in...
Business Simulation Games (BSGs) aim to simulate reality and impart knowledge as well as skills in a playful way. To be able to verify the goal attainment, the first steps towards an evaluation concept were taken in this paper. With the exemplary evaluation of Global Bike Go, a series of mini BSGs for SAP ERP teaching, initial indications could be...
Big data (BD) is one of the major technological trends of today and finds application in numerous domains and contexts. However, while there are huge potential benefits, there are also considerable challenges. One of these is the difficulty to make sure the respective applications have the necessary quality. For this purpose, the application of tes...
Today’s society is heavily driven by data intensive systems, whose application promises immense benefits. However, this only applies when they are utilized correctly. Yet these types of applications are highly susceptible to errors. Consequently, it is necessary to test them comprehensively and rigorously. One method that has an especially high foc...
The decarbonization and resulting energy transition has a lot of challenges for the future. Decentralized power generation and its stabilization of the power grid is a problem that can be solved by more accurate monitoring of power consumption using smart meters. The overall objective is to prevent blackouts. Therefore, the reliability of the used...
Robotic process automation (RPA) is a key technology for automating mundane, repetitive back-office tasks that are typically performed by human workers. Because RPA instantiations, known as software robots, operate partially with the same graphical user interfaces as humans and can only replicate the business processes for which they were previousl...
The significant increase in the amount of generated data provides potential for organizations to improve performance. Accordingly, Data Science (DS), which encompasses the methods to extract knowledge from data, has increased in popularity. Nevertheless, enterprises often fail to reap the benefits from data as they suffer from high failure rates in...
With the growing energy hunger of today’s society and the ongoing transition from fossil fuels to renewable energies, the demands on the electrical power grids are growing. Consequently, grid operators are seeking for ways to improve their performance, flexibility, and reliability. One of these avenues is the use of artificial intelligence. However...
Through knowledge extraction from data with various methods, Data Science (DS) allows organizations to achieve improvements in performance. The execution of these projects is mainly supported by DS process models such as CRISP-DM. As a high percentage of DS undertakings are failing, revisions to current DS project management practices become necess...
An overview of common process models for the implementation of data science is presented in this article. Since the development of KDD and CRISP-DM, the central ideas have been examined from broader perspectives, and further frameworks have been created. In addition to the core activities that are conducted in the individual process phases, typical...
Due to the ongoing trend of digitalization, the importance of software for today’s society is continuously increasing. Naturally, there is also a huge interest in improving its quality, which led to a highly active research community dedicated to this aim. Consequently, a plethora of propositions, tools, and methods emerged from the corresponding e...
Data Science projects aim to methodologically extract knowledge and value from data to help organizations to improve performance. Dedicated process models are applied to support the management of these endeavors. However, high failure rates in the execution highlight the need for improvements in Data Science project management. Therefore, in this p...
Design Science Research (DSR) has emerged as a methodological approach for conducting research whose overarching goal is to develop new means, referred to as artifacts, in the form of constructs, models, methods, and instantiations to improve reality. Due to their context dependent nature and the growing interest in the rapid development of new tec...
The idea of gaining new insights about business processes from data is driving companies to invest in the area of data analytics. In particular, the opportunities to improve typical business metrics such as revenue, profit, but also market share are pivotal for the motivation. In most data analytics projects, it is a challenge to determine to what...
To enable reliable software releases, automated concepts are increasingly being sought as part of the necessary test processes to be able to detect potential errors at an early stage. Static code analysis tools (SCATs) are especially suited for this purpose, as these testing tools perform their checks without actually executing the software. Thus,...
In times of crisis, at which many people experience insecurity, fear, and uncer-tainty, the government plays a critical role when it comes to civil protection and the distribution of vital information. However, in many cases, misleading or con-tradictory information are spread by various sources, exacerbating the desperation of the local population...
Within the last decade, big data became a promising trend for many application areas, offering immense potential and a competitive edge for various organizations. As the technical foundation for most of today´s data-intensive projects, not only corresponding infrastructures and facilities but also the appropriate knowledge is required. Currently, s...
In a world that is more and more driven by data, decision makers are
provided with a huge amount of information. However, while this appears to be a good development, they also face the challenge of getting through those masses to get to the actually important insights. To ease this task for managers that oversee highly complex situations, manageme...
With the increasing relevance of decentralisation for the software development process, being aware of the possible challenges and corresponding solutions has become more relevant than ever. The scientific body of knowledge is currently containing many publications about specific aspects of decentralisation, but is lacking in collections that cover...
As a consequence of the ongoing digitalization in today’s society, the amount of data that is being produced is rapidly increasing. Moreover, not only the volume of the data is growing, but there are also more complex types of data and, depending on the use case, it is also necessary to integrate heterogenous data into one analysis. Since tradition...
The trend of connectivity dominates the technological progress. The number of networked devices is constantly increasing and the use of smart meters has become more societally relevant. For that reason, reliability is an important attribute of related architectures. To calculate reliability, it is required to do a specific analysis for the entire s...
In today’s knowledge-driven world, the competences of its citizens have turned into one of the most valuable resources a country can build its success upon. Hence, it is of extremely high importance to not only provide high-quality education, but to also make life-long learning as convenient as possible. This is the main idea behind the national ed...
Knowledge, information, and modern technologies have become some of the most influential drivers of today’s society, consequently leading to a high popularity of the concepts of big data (BD). However, their actual harnessing is a demanding task that is accompanied by many barriers and challenges. To facilitate the realization of the corresponding...
For almost a decade now, big data has become the foundation of today's data-intensive systems used for various disciplines, such as data science or artificial intelligence. Although a certain level of maturity has been reached since then, not only in the domain itself but also in the engineering of interconnected systems, many problems still exist...
The concept of big data hugely impacts today’s society and promises immense benefits when utilized correctly, yet the corresponding applications are highly susceptible to errors. Therefore, testing should be performed as much and rigorous as possible. One of the solutions proposed in the literature is the test driven development (TDD) approach. TDD...
Big data has emerged to be one of the driving factors of today’s society. However, the quality assurance of the corresponding applications is still far from being mature. Therefore, further work in this field is needed. This includes the improvement of existing approaches and strategies as well as the exploration of new ones. One rather recent prop...
For many years now, the domain of big data has received lots of attention, as numerous studies, reports, and research articles reveal. Up to this date, a multitude of different definitions, guidelines, technologies, architectures, and best practices appeared that were supposed to provide clarity in the jungle of existing solutions. Instead, it led...
The ongoing digitization of our world leads to many areas of our lives being more pleasant and improved. New technologies and paradigms are emerging to support the development of software and systems. Their proliferation not only leads to higher complexity of potential solutions, but also to the problem of finding qualified people. Especially enter...
With a continuously increasing amount and complexity of data being produced and captured, traditional ways of dealing with their storing, processing, analysis and presentation are no longer sufficient, which has led to the emergence of the concept of big data. However, not only the implementation of the corresponding applications is a challenging t...
The approach of copying business models to create a successful company is discredited as non-innovative and propagated as a low-risk variant of entrepreneurship although a simple so-called copycat would only increase competition in the market and not guarantee success. Hence, the question of which characteristics of a business model enable success...
For many years now big data presents itself as a technical starting point for a multitude of different trends, including data science, artificial intelligence, the internet of things, industry 4.0, and others. However, many decision makers are still challenged by the realization of related projects. One of the major problems arises from the multitu...
Digitalization move has motivated companies to adopt software for their business processes. This led to increase projects' complexity. Project management process plays a significant role in reaching a successful completion of IT-projects. It has been seen that management methodologies like waterfall and scrum are still not sufficient for every proj...
Due to the ever increasing amount of data that is produced and captured in today’s world, the concept of big data has risen to prominence. However, implementing the respective applications is still a challenging task. This holds especially true, since a high degree of flexibility is desirable. One potential approach is the utilization of novel dece...
Big data has evolved to a ubiquitous part of today’s society. However, despite its popularity, the development and testing of the corresponding applications are still very challenging tasks that are being actively researched in pursuit of ways for improvement. One newly introduced proposition is the application of test driven development (TDD) in t...
Microservices and Big Data are renowned hot topics in computer science that have gained a lot of hype. While the use of microservices is an approach that is used in modern software development to increase flexibility, Big Data allows organizations to turn today’s information deluge into valuable insights. Many of those Big Data architectures have r...
In the recent years, the term big data has attracted a lot of attention. It refers to the processing of data that is characterized mainly by 4Vs, namely volume, velocity, variety and veracity. The need for collecting and analysing big data has increased manifolds these days as organizations want to derive meaningful information out of any data that...
Motivation has been observed to be crucial for learning success. In computer science education, new approaches for knowledge transfer that create more engagement by their users seem to be a desirable solution. This can be facilitated, inter alia, through business simulation games (BSG). Within this paper, a preliminary literature review is conducte...
Due to the constantly increasing amount and variety of data produced, big data and the corresponding technologies have become an integral part of daily life, influencing numerous domains and organizations. However, because of its diversity and complexity, the necessary testing of the corresponding applications is a highly challenging task that lack...
Der vorliegende Beitrag gibt eine grundlegende Einführung zu dem Begriff Big Data. Nach einer kurzen Darstellung der Relevanz und Aktualität des Themas, wird im Folgenden auf den Begriff selbst, und die ihm zugrunde liegenden Charakteristiken der Daten eingegangen. Im Anschluss erfolgt eine Vorstellung technischer Grundlagen, wobei ausgewählte Konz...
Der Beitrag stellt den Einfluss der Beschaffung auf den Unternehmenserfolg dar, schildert wie hierbei durch die Nutzung mobiler Endgeräte ein Mehrwert geschaffen werden kann und welche Herausforderungen dabei zu beachten sind. Zusätzlich werden mögliche Anwendungsszenarien und zwei exemplarische Ansätze aus der Praxis vorgestellt. Zum Abschluss wer...
While the emerging technology of robotic process automation is primarily suitable for back office processes, companies use traditional chatbots to support customer interaction in the front office. However, customer requests that require more than written information usually demand an employee to execute an internal process. This paper summarizes th...
Big data analytics and the according applications have gained huge importance in daily life. This results on the one hand from their versatility and on the other hand from their capability to greatly improve an organization’s performance when utilized appropriately. However, despite their prevalence and the corresponding attention through practitio...
The goal of this work is to obtain a framework that represents the technological core aspects of blockchain, separated into components, their subcategories and related basic technologies. In order to gain a holistic view of blockchain, with the help of the framework, technologies constructs should be made identifiable as blockchain. For this purpos...
The amount of semi-structured textual data being generated all over the world wide web has placed the limelight on the fields of natural language processing and text mining. With the continuous growth of IT companies, there are a lot of new and diverse projects getting introduced. Hence, the need for project management tools that provide robust, ef...
Due to the growing amount of data that are produced, the concept of big data analytics gained plenty of interest among researchers and practitioners. However, there are many challenges related to its use. One of those is the data storing and subsequently the corresponding extraction of required data. Since this process is prone to error, it require...
Big data attracts researchers and practitioners around the globe in their desire to effectively manage the data deluge resulting from the ongoing evolution of the information systems domain. Consequently, many decision makers attempt to harness the potentials arising with the use of those modern technologies in a multitude of application scenarios....
Although the term of big data and related technologies received lots of attention in recent years, many projects are less successful than anticipated. One of the most crucial steps in the planning of a system includes the modeling of the underlying architecture. However, as of now, no standardized approach exists that facilitates the modeling of bi...
Big Data is a term that gained popularity due to its potential benefits in various fields, and is progressively being used. However, there are still many gaps and challenges to overcome, especially when it comes to the selection and handling of relevant technologies. A consequence of the huge number of manifestations in this area, growing each year...
Big data has proved to be one of the most promising trends in recent years. However, many challenges and barriers still exist, especially when it comes to the strategic planning and realization of those kinds of projects. Most of all, the selection and combination of the domain–related technologies represents a sophisticated endeavor that increases...
The utilization of data in general and big data in particular offers large opportunities, but is at the same time accompanied by a huge number of potential causes for failure. To avoid those pitfalls when realizing such undertakings, at the beginning, it is necessary to develop an in-depth understanding of those causes. This contribution analyses t...
The copycat approach - replicating proven business models - presents an alternative to following the classic entrepreneurial path in venture creation which is highly contested due to the common perception of being non-innovative and of low-risk. However, existing literature contradicts this conception. This paper features a case study of German e-c...
For many years now, researchers as well as practitioners are harnessing well-known data mining processes, such as the CRISP-DM or KDD, to realize their data analytics projects. In times of big data and data science, at which not only the volume, variety and velocity of the data increases, but also the complexity to process, store and manage them, c...
As big data is a rather young, but growing discipline, lots of confusion about the general nature of this term exists. Consequently, multiple research endeavours to discover unique characteristics, technologies, techniques and their interconnections were conducted, resulting in comprehensive classification approaches. For this purpose, various taxo...
In order to ensure adequate education and training in a statistics-driven field, large sets of content-compliant training data (CCTD) are required. Within the context of practical orientation, such data sets should be as realistic as possible concerning the content in order to improve the learning experience. While there are different data generato...
Der vorliegende Beitrag gibt eine grundlegende Einführung zu dem Begriff Big Data. Nach einer kurzen Darstellung der Relevanz und Aktualität des Themas, wird im Folgenden auf den Begriff selbst, und die ihm zugrunde liegenden Charakteristiken der Daten eingegangen. Im Anschluss erfolgt eine Vorstellung technischer Grundlagen, wobei ausgewählte Konz...
Big data is considered as one of the most promising technological advancements in the last decades. Today it is used for a multitude of data intensive projects in various domains and also serves as the technical foundation for other recent trends in the computer science domain. However, the complexity of its implementation and utilization renders i...
Today, the amount and complexity of data that is globally produced increases continuously, surpassing the abilities of traditional approaches. Therefore, to capture and analyze those data, new concepts and techniques are utilized to engineer powerful big data systems. However, despite the existence of sophisticated approaches for the engineering of...
Alongside the fourth industrial revolution and many other emerging technologies and markets, IT sectors are obligated to achieve a much higher level of efficiency in managing IT infrastructure through automation. Recent studies reported various statistical analysis that suggests tremendous growth in the energy consumption of data center operation....
The amount of data to be produced and analyzed is increasing year by year. As a result, the concept of big data gained interest among researchers and practitioners. However, a plethora of challenges and potentials require the attention from researchers and practitioners to enhance the future development. Apart from the pure processing of the data a...
In today’s world, digitization has reached an important role. While more and more enterprises are interested in a realization and aware of its importance, it rather lacks on the implementation. This applies most of all for those with a low budget for new investments, such as small and medium enterprises (SME), as well as even smaller forms (VSE). F...
Big Data is a crucial pillar for many of today’s newly emerging business models. Areas of application range from consumer analysis over medicine to fraud detection. All of those domains require reliable software. Even though imperfect results are accepted in Big Data software, bugs and other defects can have drastic consequences. Therefore, in this...
Der Beitrag stellt den Einfluss der Beschaffung auf den Unternehmenserfolg dar, schildert wie hierbei durch die Nutzung mobiler Endgeräte ein Mehrwert geschaffen werden kann und welche Herausforderungen dabei zu beachten sind. Zusätzlich werden mögliche Anwendungsszenarien und zwei exemplarische Ansätze
aus der Praxis vorgestellt. Zum Abschluss wer...