Available via license: CC BY 4.0
Content may be subject to copyright.
Page 1/16
Towards a taxonomy for business capabilities determining data
value
Markus Hafner ( markus.hafner@tecnico.ulisboa.pt )
Instituto Superior Técnico
Miguel Mira da Silva
Instituto Superior Técnico
Research Article
Keywords: Data value, data valuation, business capability, taxonomy
Posted Date: February 22nd, 2023
DOI: https://doi.org/10.21203/rs.3.rs-2609006/v1
License: This work is licensed under a Creative Commons Attribution 4.0 International License. Read Full License
Page 2/16
Abstract
Data and its valuation become increasingly crucial for enterprises and academia, which coincides a multitude of data valuation
approaches including numerous affected focus areas, dimensions, and characteristics. Therefore, this paper analyzes different
approaches to determine data value from a business capability perspective according to the TOGAF standard. Specically, this paper
deals with (a) the development of a taxonomy for data valuation business capabilities (DVBC) as well as (b) the taxonomy validation
by the use of existing data valuation approaches. The applied methodologies are taxonomy development techniques for information
systems, which are based on a previously executed systematic literature review with a sample size of 67 articles. Further, the data
valuation business capability taxonomy is validated through applying two recent data valuation approaches from academia. As a
result, the taxonomy developed consists of four business capability layers, nine dimensions, and 36 characteristics. The
characteristics are of exclusive or non-exclusive nature, depending on their meaningfulness, and are validated by the successful
application of two data valuation approaches. Compiled ndings meet both objective and subjective quality standards. With the
developed DVBC taxonomy, scholars and professionals are equipped with a tool to classify and structure their data valuation
endeavors. In addition, the DVBC taxonomy bridges the domains of information systems, enterprise architecture management, and
data management to serve as a foundation for interdisciplinary value generation with and through data.
1 Introduction
Generating value from and with data is not just a short-term hype, but a paradigm that has become rmly established in a wide range
of industries and enterprises [1–7]. For this, it is particularly relevant to determine the value of data, the associated data initiatives, and
data-driven use cases in a systematic and standardized manner [2, 8, 9].
The current solutions for determining the data value in science and practice are scattered over a broad range with a wide variety of
content depth. While several enterprises do not determine their data value at all [2], others use estimation techniques based on
knowledge and gut feeling of subject matter experts [10]. Furthermore, a previous and currently submitted literature review unveiled,
that there are data valuation concepts from a multitude of research areas, especially computer science, decision science, as well as
business, management, and accounting. It is noticeable that these data valuation approaches pursue different if not opposing targets
and often occur encapsulated from each other. Consequently, it is challenging for enterprises to identify, classify, and relate data
valuation approaches as well as to implement them in their ecosphere [11–15].
In order to streamline the ideas, concepts, and approaches regarding data value determination and make them applicable to
enterprises, this paper denes data valuation as a holistic business capability. This not only includes the pure determination of the
data value, but also expands the focus to include processes, people, resources, and information [16–18].Furthermore, we hypothesize
that it is benecial for professionals and scholars to obtain a classication and differentiation key for data valuation business
capabilities (DVBC) [19, 20]. Consequently, within the scope of this paper, a DVBC taxonomy is developed, which supports practitioners
in analyzing and understanding the complex topic of data valuation and deriving conclusions. For developing the DVBC taxonomy,
following research question serves as compass:
RQ: What are the main dimensions and characteristics of a data valuation business capability?
For the purpose of contributing to the creation of value from and with data as well as closing the existing research gap, we answer the
research question by describing the scientic background in Section 2. In particular, section 2 discusses the concepts of business
capabilities and data value including their relationship to one another and describes related taxonomies. Section 3 elaborates on the
systematic literature review (SLR) as foundation for the subsequent taxonomy development process tailored to information systems
according to [19]. The nal DVBC taxonomy is presented in Section 4 before being tested in Section 5 using two data valuation
approaches provided by academia. In Section 6 the ndings are discussed before Section 7 completes with a conclusion.
2 Research Background
This section elaborates on the concepts of business capabilities and data value. Furthermore, adjacent and related taxonomies are
described.
2.1 Business capabilities
Page 3/16
The denition of a business capability is not consistently expressed in academia. A literature analysis that has been carried out by [18]
underlines this conclusion and therefore bundles the various denitions of a business capability as follows: “A particular ability that a
business may possess or exchange to achieve a specic corporate goal.” [18].
Since the above-mentioned denition is rather generic, we use the more detailed and practical denition of a business capability
according to
TOGAF
.
TOGAF
is considered as scientically sound [18, 21, 22] and is practically in use [23], which is crucial for the
applicability of a business capability taxonomy in practice. According to
TOGAF
, a business capability consists of the components
information, processes, roles
, and
resources
.
Information
, meaning the knowledge associated with a business capability [16, 17], is
crucial to perform various
processes
[16, 24] within a business capability. These processes may be executed by people [17] associated
with
roles
[16] using tangible and intangible
resources
[16, 24].
2.2 Data valuation business capability
In order to understand what is meant by a data valuation business capability, the delimitation of the term
data value
serves as
foundation. Data and the associated information imply a value in diverse shades that can be generated through the implementation of
data-driven use cases and the direct or indirect sale of data [2–5]. This data value can be of social-ecological, economical [25, 26],
functional and/or symbolic nature [10, 26] with the purpose of adding a measurable business value [13]. Thereby, data value is
determined by a multitude of value drivers [27, 28], underlying theories [29–33], as well as frameworks [34, 35].
The contents and purpose of the above-mentioned data value denitions suggests the application of the business capability concept
in order to achieve the purpose of determining the data value. Thus, the idea of bundling isolated approaches from the eld of data
value emerges, including the data value drivers, theories, as well as frameworks and expanding them in the context of a business
capability. The resulting data valuation business capabilities therefore comprise information, processes, roles, and resources (see
Section 2.1).
2.3 Related taxonomies
In academic literature, there have already been attempts to explain the traits of data value and its determination approaches, for
example in the form of taxonomies. These existing concepts have been elaborated especially in the two domains (IT) business value
and data value.
One taxonomy for classifying value catalogues was developed by [36] to intensify the linkage between enterprise performance and IT
investments. Value catalogs are therefore dened as reference lists that determine the economic impact of the utilization of
information technology. In the value catalogue taxonomy, dimensions are introduced that are used as a foundation for the DVBC
taxonomy. Specically, the dimension relating to
methods for quantifying the value
via value catalogues is particularly close to the
DVBC taxonomy. Thereby, methods under certainty and uncertainty encompassing static, dynamic, and qualitative characteristics are
distinguished. However, the development of additional IT business value assessment approaches are noted as open research gaps,
towards the DVBC taxonomy can contribute.
[37] also adapted the concept of business value and concretized it to articial intelligence (AI) use cases. The resulting taxonomy aims
to identify dimensions and characteristics that enable the value contribution of AI use cases at an organizational scope. The
dimensions
source of business value improvement
and
benet to business value
are particularly relevant for the DVBC taxonomy. The
source of business value improvement
classies the effect implied by an AI use case. This effect, either of automation,
transformation, or information nature, can serve as a basis for formulating potential value drivers and theories for determining the
value. Furthermore, the
benet to business value
dimension distinguishes which type of improvement as a result of an AI use case
contributes to the increase in business value. The characteristics of
cost, quality, revenue
, as well as
risk and complianceperformance
can all impact business value. Thus, these characteristics are declared as non-exclusive since several performance improvements can
be covered simultaneously by an AI use case. A similar logic is used by the DVBC taxonomy in Chapter 4.
In addition to taxonomies in the area of business value and information technology, [38] have developed a taxonomy regarding data
value in the context of decision making. Two main dimensions of data value are dened, namely
data quality
and
data utility
, which
are also included as non-exclusive characteristics in the DVBC taxonomy under the dimension
data value driver
. In the decision-
making data value taxonomy, the data value dimensions and their characteristics are subdivided into more ne-grained metrics that
promote the applicability of the taxonomy. The decision-making data value taxonomy thus serves to promote an understanding of the
Page 4/16
scope and limits of data value but does not address the classication of data valuation approaches in the context of business
capabilities in an enterprise architecture. As a further research step, it is proposed to concretize data value assessment frameworks, to
which the DVBC taxonomy also contributes.
3 Research Design
In the following subsections, the methods used for developing the DVBC taxonomy are described. In addition, relevant literature for the
taxonomy design is analyzed.
3.1 Literature review
The rst step of our research design is adhering to [39] recommendations for systematic literature reviews (SLR) combined with the
forward- and backward-search according to [40].
As a basis for our SLR, we have developed a search query with reference to our research question. The search query ("Compan*" OR
"Enterprise*") AND ("Data Pric*" OR "Data Valu*") AND ("Approach*" OR "Architecture*" OR "Capabilit*" OR "Method*" OR "Model*") NOT
("Data Values") combines data valuation approaches in combination with enterprise architectures and capabilities. Furthermore,
related wording and synonyms are covered, while data values, which refer to technical values of data storages, are explicitly excluded.
To consider a complete set of relevant published full-text access literature (journal articles, conference proceedings, and book
chapters) in English language from 2012 onwards, the search query was applied to the databases
ACM Digital Library, AIS eLibrary
Ebsco, Emerald Insight, IEEE Xplore, ScienceDirect, Scopus, Web of Science,
and
Wiley Online Library
.
In total 306 scientic contributions were recognized as raw sample. Out of the raw sample, 133 duplicates were removed. A thorough
abstract reading was used for the ensuing 173 scientic contributions, which led to the deletion of another 140 articles and a sample
size of 33 articles. According to [40], both backward (+24) and forward search (+10) were applied to ll in any potential gaps in the
search query. We were able to expand the size of the nal sample using these techniques from 33 to 67 scientic contributions. Figure
1 illustrates the systematic literature process and related ndings.
3.2 Taxonomy development process
After the rst foundational step, the systematic literature review, the taxonomy development method according to [19] was applied,
which is tailored to the research area information systems. As the research topic data value is addressed by a wide range of domains
(see Chapter 3.1), the iterative approach of the method is particularly well suited to encompass the complexity and interdisciplinarity
of the topic.
[19] propose two taxonomy development approaches. While the conceptual-to-empirical approach is more suitable when few data is
available on the objects to be classied (in this case, DVBC), the empirical-to-conceptual approach is more suitable for a taxonomy
with a broad data foundation. In this paper, a combination of both approaches was used. In the rst iteration, the conceptual-to-
empirical approach was applied in order to establish guiderails for the proper denition of business capabilities. The depth of the
content of a DVBC in relation to dimensions and characteristics was created using the empirical-to-conceptual approach in further
iterations, as many data valuation approaches are available due to the SLR carried out. Figure 2 below shows the applied method.
The rst step in taxonomy development according to [19] is to determine the meta-characteristics of the taxonomy. It is of particular
relevance that the meta-characteristic is aligned with the purpose of the taxonomy (see Chapter 1). Therefore, the meta-characteristic
may be described as the identication of the especially pertinent layers, dimensions, and characteristics of a DVBC.
In addition, the applied methodology in step 2 provides for dening ending conditions of subjective as well as objective nature. In the
context of this paper,
conciseness
(consider a maximum of ve to nine dimensions [19, 41]),
robustness
,
comprehensiveness
,
extendibility
, and
explainability
are regarded as subjective ending conditions.
In order to meet scientic requirements, [19] further specify eight objective ending conditions. These ending conditions have been
condensed into the four applicable ending conditions
completeness
,
granularity
,
uniqueness
, and
stability
to ensure a simplied
understanding of this taxonomy development without signicantly changing their content. Figure 3 lists and explains the ending
conditions and shows the development of the degree of fulllment of the ending conditions after each iteration.
Page 5/16
3.2.1 Conceptual-to-empirical iteration
In the rst iteration in the taxonomy development process, the framework for detailing a DVBC was created by using the conceptual-to-
empirical approach (see steps three to ve in Figure 2). Specically, the framework serves as a logical bracket of the underlying
dimensions as well as characteristics. The technique of determining a framework by using layers has been applied in various
taxonomies already [42–44] and is therefore dened as state of the art.
The dened layers
information
,
resources
,
roles
, and
processes
are based on the components of a business capability according to
TOGAF
(see Section 2.1) [16, 45]. From this the hypothesis is derived, that every DVBC includes the four dimensions of
information
,
resources
,
roles
, and
processes
.
3.2.2 Empirical-to-conceptual iterations
The second iteration is based on the systematic literature review and therefore follows the empirical-to-conceptual approach in the
taxonomy development process. Based on the SLR, we found that the literature on data valuation is examined from a variety of
angles, especially from the research areas
business, management, and accounting
as well as
computer science
according to [46]. The
high granularity level in the literature regarding approaches and concepts to determine the data value was lifted to a higher level of
abstraction and therefore formed the three taxonomy dimensions
data value driver
,
data valuation theory
, as well as
data valuation
tooling
.
In the third iteration, the
information
layer was completed. In this context, the
purpose
dimension is particularly relevant in order to
adequately frame a business capability [16, 18, 24, 47] as well as to serve as a starting point for determining the data value [27]. In
addition, a DVBC requires information on objects, which are going to be classied in the taxonomy. Consequently, the
data valuation
object
dimension was included.
The fourth iteration sought to complete the
roles
layer. The analysis of the SLR ndings illustrates that two main groups particularly
stood out with regard to their frequency of mentions with reference to past and future research. The rst, most frequently and deeply
discussed group includes all stakeholders involved in determining the data value and is therefore summarized under the
value
determination stakeholder
dimension in this taxonomy [29, 48]. The second relevant group is derived particularly from the relevance
for future research and can be subsumed in the dimension
value auditing stakeholder
[48–51].
The fth iteration deepened the functional content and outcomes of the
process
layer of a DVBC. From this the dimension
component
is formedwhich details the content of a DVBC [52]. Further the outcomes are considered in the
result
dimension [35, 49,
53].
In the sixth iteration, all of the ending conditions of a subjective and objective nature (see Figure 3) were met and thus the taxonomy
development process was terminated. The outcome of the last sixth iteration is declared as a nal DVBC taxonomy and is further
described below.
4 Taxonomy: Data Valuation Business Capability
The nal taxonomy for DVBC is described in the following sections. In total the developed taxonomy in Figure 4 has four layers, nine
dimensions and a subset of 36 characteristics.
We learned during the taxonomy development process that it is impossible to limit the choice of characteristics to be mutually
exclusive for some dimensions since important information would be lost. The dimensions of
data value driver
,
data valuation theory,
and
component
in particular imply the most diverse forms of expression per scientic study, per enterprise, and per data valuation
approach. Consequently, non-exclusive characteristics in these dimensions are supported in accordance with other recently published
taxonomies in the research areas of digital transformation and information systems (Berger et al., 2018; Engel et al., 2022; Gelhaar et
al., 2021; Jöhnk et al., 2017; Lis and Otto, 2021). Further, the visualization style of a morphological box is chosen in order to increase
the intuitiveness and usability of the taxonomy [55, 56].
4.1 Layer 1 – Information
Page 6/16
The rst layer of a business capability, and therefore also of this taxonomy, comprises information or so-called knowledge that a
business capability requires and consumes in order to determine a data value [16]. Specically, the information layer encompasses the
three dimensions
purpose
,
data valuation object,
and
data value driver
.
4.1.1 Purpose
The
purpose
dimension characterizes the goal of the data valuation and thus sets the guide rails for the detailing of a DVBC. Two
characteristics and their combination can therefore be exclusively dened.
While
qualitative data valuation
focuses on generating contextual knowledge about the data value [34, 38, 50],
quantitative data
valuation
concentrates on numerical information [34, 35, 57]. The existing literature shows that a
combination
[34] of both
characteristics to different extents is also possible.
4.1.2 Data valuation object
After the question why data valuation should take place (purpose), the dimension
data valuation object
aims on the question whose
value should be determined.
From this, the two exclusive characteristics
bundled information
and
non-bundled information
can be dened. Bundled information
refers to data clusters that can be logically grouped according to their content in each use case. Examples of this are data products,
data assets, and datasets in various forms. In contrast, non-bundled information are isolated raw data points that have not yet
undergone any logical clustering [15, 58–60].
4.1.3 Data value driver
The third dimension
data value driver
poses the question of which parameters affecting the data value are considered for the data
value determination. Since data valuation is an interdisciplinary topic with an arbitrarily high degree of complexity, it is evident that the
data value driver
dimension cannot have exclusive characteristics. Rather, DVBC can consider a variety of data value drivers. At this
point, it is hypothesized that the more data value drivers are considered, the more accurate the determined data value will be. This
hypothesis underlines the non-exclusive nature of the dimension, although the hypothesis needs to be validated in another study.
We distinguish six data value drivers and, to underline the non-exclusive nature of this dimension, add the characteristic
other
. The
other
characteristic includes both proprietary data value drivers and potential additional data value drivers outside the analyzed
literature.
The characteristic
business utility
includes the impact of a data-driven use case on a process or an enterprise [12, 26, 61]. In addition,
the characteristic
cost
may include expenses associated with the data being evaluated, such as data collection, processing, analysis,
and management costs [29, 34, 62–64] as well as opportunity costs [35, 61, 65]. Furthermore, we complement data management
related data value drivers and tailor them into the characteristics
data durability and lifetime
[66, 67],
data quality
[51, 61, 64, 68, 69],
and
data security and privacy
[64, 70–72]. As a nal characteristic, data value drivers are considered which imply a certain subjectivity.
The
sentiment and perception
characteristic includes, for example, the perceived data value and thus the willingness-to-pay [28, 32,
73–75] as well as risks associated with the valuation and monetization of data [76].
4.2 Layer 2 – Resources
The second layer in the DVBC taxonomy considers the associated resources of tangible and intangible types, which are required for
determining the value of data [16]. As an intangible resource, the dimension
data valuation theory
is introduced. Furthermore, the
dimension
data valuation tooling
represents the tangible dimension of the resource layer.
4.2.1 Data valuation theory
Similar to the data value drivers in Section 4.1.3, the dimension
data valuation theory
is dened as a cluster of non-exclusive
characteristics, since DVBC can be based on numerous theories to determine the data value. Here, we distinguish seven data valuation
theories.
Page 7/16
The rst characteristic
economic
comprises all theories that determine the data value based on price-quantity diagrams, cost curves
and conventional hardware-oriented pricing (cost, competition, customer) [29, 32, 57]. In addition,
game theory
can be used as a data
valuation theory, which can be divided into two characteristics,
cooperative
[30, 31] and
non-cooperative game theory
[63, 75]. The
fourth characteristic
decision theory
summarizes approaches, e.g., analytic hierarchy process [51, 67] or fair knapsack [77], that assess
the data value while taking uncertainty and vagueness into account. A more technical data valuation theory deals with the valuation
of database queries of different types and can therefore be subsumed under the term
query-based
[29, 64, 78–80]. Furthermore, the
sixth characteristic
index-based
deals with the indexation of data value drivers for the determination of an indexed data value [62, 67].
In addition to the aforementioned data valuation theories, which represent certain paradigms in data value determination, the seventh
characteristic clusters all
proprietary
theories that have not been considered in the taxonomy dimension so far or are based on expert’s
gut feeling only [10].
4.2.2 Data valuation tooling
To combine and apply data valuation theories and data value drivers, scholars propose different rather personal as well as rather
application-based approaches. Consequently, we dene three exclusive characteristics in the dimension
data valuation tooling
in our
taxonomy.
The rst characteristic in the data valuation tooling dimension is
interpersonal elaboration
. Interpersonal elaboration is a vehicle to
support data value determination used by multiple scholars. The assessment of the value of data and their use cases by domain
experts is particularly relevant for this characteristic [12, 35, 50].
In contrast, there are
models and applications
that support and facilitate the determination of data value. Models and applications can
occur in various forms. More theoretical constructs such as economic cost or price curves are considered, as are ecosystem-oriented
intermediary solutions, for example in the form of data marketplaces [48, 66, 81].
As a third characteristic in the data valuation tooling dimension, a
combination
of interpersonal elaboration as well as models and
applications is introduced, since particularly practical research approaches suggest these combined data valuation approaches [34,
62].
4.3 Layer 3 – Roles
The third layer of the DVBC taxonomy focuses on the roles and responsibilities that stakeholders, individuals, and organizational units
play in order to determine the value of data [16]. In concrete terms, the roles layer consists of two roles dimensions which deal with the
determination of data value (
value determination stakeholder
) on the one hand and its auditing (
value auditing stakeholder
) on the
other.
4.3.1 Value determination stakeholder
The stakeholders included in the determination of data value play a central role in a DVBC, since they affect the ecosystem of an
enterprise to be addressed. Four exclusive characteristics in the form of a continuum are established in this taxonomy, ranging from
the inclusion of purely
internal
stakeholders to the inclusion of purely
external
stakeholders. Mixed forms of internal and external
stakeholders are dened as intermediate characteristics. These mixed forms include direct collaboration
without an intermediary
as
well as
with an intermediary
such as a data broker or data marketplace [29, 48, 76]. Regardless of the internal or external nature of
data valuation stakeholders, such as data providers [29, 48, 82] or data buyers [29, 48, 82], all are relevant to some extend in a variety
of data valuation approaches, which underscores their necessity in this taxonomy.
4.3.2 Value auditing stakeholder
The second dimension of the roles layer containing of three exclusive characteristics is
value auditing stakeholder
. Auditing the data
value is relevant for validating its determination process and result. At this point,
internal data value auditors
or
third-party data value
auditors
can occur in science and practice [48, 50, 51]. Moreover, no audit can be performed at all, leading to the third characteristic
not existing
.
4.4 Layer 4 – Processes
Page 8/16
The fourth layer of the DVBC taxonomy focuses on related processes and patterns in order to accomplish a certain output of a
business capability [16]. To be more precise, in this case
components
and
results
are dened as dimensions under the process layer.
4.4.1 Component
The components dimension describes the main practices of a business capability, which in turn include individual sub-processes,
activities, and functional modules. The results of the executed SLR suggest that four main characteristics of a DVBC can be formed.
The components and thus characteristics in this dimension can also occur in a combined manner. Consequently, the components
dimension including its characteristics is also dened as non-exclusive.
A predominant number of data valuation approaches focus on the data value assessment, i.e., the pure determination of the data
value, which is why the
data value assessment
is recorded as a characteristic in this taxonomy [12, 50, 83]. In addition, there are
approaches that assign the data value to dedicated entities. The resulting characteristic is described as
data value allocation
[52]. Two
further components include the temporal dimension in their scope of functions. While the
data value prediction
characteristic
forecasts a future data value [50, 52], e.g., via customer lifetime value [34], the
data value monitoring
characteristic compares the
planned and actual data value [52, 83].
4.4.2 Result
The results dimension contrasts three exclusive characteristics that describe the outcome of the DVBC. On the one hand, a
relative
data value
can be determined [35], which compares individual data values, data initiatives, and use cases with each other in relative
terms and outputs them, for example, in order of the corresponding value. On the other hand, an absolute data value can be dened as
the result of a data valuation. A distinction can be made between a
specic absolute data value
and an
approximate absolute data
value
. While the specic absolute data value aims at an exact determination of the data value [34, 53], the approximate absolute data
value focuses on a corresponding estimation of the data value, for example, considering uncertainty or the necessary computing
power [79, 84].
5 Application Of The Dvdc Taxonomy
This section assesses the DVBC taxonomy's utility and exemplies its intended usage, as suggested by [19]. In order to test our DVBC
taxonomy, we have selected two contemporary data value determination approaches that follow a more technology-supported
approach by [35] on the one hand and a more ecosystem-supported multilayered approach by [34] on the other hand. Figure 5
illustrates the ndings of the application, which are further described in Sections 5.1 and 5.2.
5.1 Approach 1– Automatic data value analysis method for relational databases
Approach 1 by [35] is an automated, metric-based data value assessment technique that provides a scoring mechanism for
automatically determining the business value of bundled data aka datasets data in relational databases.
To calculate the data value, a multitude of metrics are considered that are represented as data value drivers in the DVBC taxonomy,
which legitimates the non-exclusive denition of its data value driver dimension. In concrete terms, the metrics
utility
,
volume
, and
usage
of approach 1 can be assigned to the taxonomy characteristic
business utility
. The approach of [35] also reects the
characteristic
data quality
by the metrics
timeliness
and
quality
. The
cost
characteristic is embodied by Bendechache et al. via the
metric
replacement costs
. In addition, [35] also reect the characteristic
sentiment and perception
by including the metrics of
legal risk
and
competitive advantage
.
The resource layer of the DVBC taxonomy is characterized by a combination of
interpersonal elaboration
as well as
models and
applications
for determining the data value for the example of approach 1. On the one hand, Bendechache et al. apply a survey-based
questionnaire, which they is validated using a query-based approach and assign corresponding data value indices in the form of a
scoring system to the datasets. Therefore,
internal stakeholders
are required for determining the data value, even though
no data value
audit
takes place.
As a result of the approach 1 data valuation approach, a
relative data value
is issued by the scoring system, which serves solely for
the
data value assessment
.
Page 9/16
5.2 Approach 2– From qualitative to quantitative data valuation
In contrast to approach 1, the data valuation approach of [34] combines both qualitative and quantitative elements. The object of
valuation in approach 2 is bundled data, described as datasets, which are based on use cases. In addition to considering
business
utility
as a data value driver, data attributes in general are considered, which are represented by the characteristics
data durability and
lifetime
,
data quality
, as well as
data security and privacy
in the DVBC taxonomy. Furthermore,
costs
relating to the data value are
considered, which are processed, among other things, in a data valuation theory with economic characteristics. This is the so-called
cost-based
and
transaction-based data valuation
approach. To complete the data valuation framework,
proprietary
theories are
included, such as dening threshold metrics, which increase or decrease the data value to a dedicated percentage.
To enable data value determination, [34] apply both
interpersonal elaboration,
e.g., in the form of an initial segmentation of relevant
use cases, as well as models and applications, e.g., data marketplaces as vehicles for transaction-based data valuation.
The value determination stakeholders can be
internal and external
, considering intermediaries, e.g., data marketplaces. Similar to
approach 1, there is no data value audit in approach 2
As a result of the data valuation framework according to Stein et al., a
specic absolute data value
is to be ascertained, which is used
for
data value assessment
on the one hand and for
data value prediction
on the other, for example in the form of the customer lifetime
value.
6 Discussion
After the completion of the DVBC taxonomy, the raison d'être requires to be challenged regarding quality and content of the taxonomy.
First of all, it must be noted that although data valuation approaches and concepts are described in this literature sample, they are not
yet dened as business capabilities. Rather, this DVBC taxonomy also serves to raise the multitude of data valuation approaches to
the level of abstraction of a business capability. Therefore, it is crucial to generalize ne-grained concepts and to detail generic
concepts.
For this taxonomy the TOGAF standard served as frame construct providing the layers
information, processes, roles
, and
resources.
At
this point, however, it is important to note that other notions for describing a business capability [17, 18] also have their reason for
existence and would possibly result in a different structuring of the taxonomy.
Regardless of the framework construct for describing a business capability, the quality of the DVBC taxonomy based on it must be
discussed. Therefore, scholars have shown that the taxonomy evaluation criteria
(a) usefulness, (b) applicability, (c)
comprehensiveness, (d) robustness, (e) conciseness, (f) extensibility
, and
(g) explanatory
are of particular relevance [85]. As Figure 3
shows, the evaluation criteria (c) – (g) are explicitly fullled for the development of the DVBC taxonomy. Further, evaluation criteria (a)
– (b) are implicitly fullled by the application of the taxonomy to the two approaches in Chapter 5.
In addition, [85] dene six guidelines to ensure the compliance of excellence standards regarding the development of taxonomies.
Guideline 1
(scoping of taxonomy evaluation),
guideline 2
(justication of objective ending conditions),
guideline 3
(justication of
subjective ending conditions) and
guideline 4
(demonstration of DVBC taxonomy applicability) have been successfully carried out on
the basis of this paper. Further,
guideline 5
(evaluation of DVBC taxonomy usefulness) and
guideline 6
(long-term re-evaluation) are
planned for future work.
Consequently, the excellence standards for the content and quality of the developed DVBC taxonomy are considered as given with the
note of future eld testing and long-term validation.
7 Conclusion
Our research is motivated by several contributions that highlight the signicance of data valuation and demand for more in-depth
study in this area [11–15, 38]. Therefore, this paper focuses on the development of a data valuation business capability in order to
provide a concept to classify as well as structure existing and emerging data valuation approaches from the perspective of a business
capability. To achieve this, the taxonomy development method for information systems by [19] is applied, tested against two
contemporary data valuation approaches [34, 35], and the taxonomy is reviewed for its raison d'être in terms of content [86] and
Page 10/16
quality based on the evaluation criteria by [85]. The result is an excellence-assured data valuation business capability taxonomy
consisting of four layers, nine dimensions, and 36 characteristics.
Our research has limitations. One limitation relates to the structure of the taxonomy, which is based on the practice-approved and
scientically sound
TOGAF
standard, knowing that other structuring concepts exist for describing a business capability. Another
limitation of content relates to the subject of the dimensions and characteristics based on the results of a systematic literature review.
It is possible that particular concepts in the literature may not be covered by this taxonomy. Due to the breadth and scope of existing
data valuation approaches and their meaningful integration in this taxonomy, some taxonomy dimensions have been dened as non-
exclusive, which may dilute the strict delineation of data valuation business capabilities to some degree.
With a view to future scientic work, three thematic areas can be formulated. On the one hand, it is recommended to validate the
developed taxonomy in a eld study of long-term nature. In addition, a data value ontology, which follows scientically sound
standards such as
OntoClean
[87] could be developed on the basis of the taxonomy. Furthermore, the concept for dening data
valuation as a business capability should be tested and rened in real-world scenarios following the design science research paradigm
[86].
Declarations
No research funds and cooperations inuencing the research are to be declared. Further, no competing interests are directly or
indirectly related to the paper.
References
1. KPMG International Cooperative (2020) KPMG’s Global Automotive Executive Survey 2020
2. Thieullent A-L, Jiang Z, Perhirin V, et al (2020) The data-powered enterprise
3. Kaufmann M (2019) Big data management canvas: A reference model for value creation from data. Big Data and Cognitive
Computing 3:1–18. https://doi.org/10.3390/bdcc3010019
4. Faroukhi AZ, el Alaoui I, Gahi Y, Amine A (2020) Big data monetization throughout Big Data Value Chain: a comprehensive review.
J Big Data 7:. https://doi.org/10.1186/s40537-019-0281-5
5. Gartner Inc (2022) Denition of Data Monetization - IT Glossary I Gartner. In: Gartner Glossary.
https://www.gartner.com/en/information-technology/glossary/data-monetization. Accessed 16 Feb 2023
. Eggers J, Hein A (2020) Turning Big Data into Value: A Literature Review on Business Value from Process Mining. In: European
Conference on Information Systems (ECIS). pp 1–21
7. Hafner M, Hirsch M, Willemsen M, et al (2022) Transformation towards a Data-Driven Business.
https://www2.deloitte.com/content/dam/Deloitte/de/Documents/consulting/Deloitte_Transformation%20towards%20a%20Data-
Driven%20Business.pdf. Accessed 16 Feb 2023
. Leitner-Hanetseder S, Lehner OM (2022) AI-powered information and Big Data: current regulations and ways forward in IFRS
reporting. Journal of Applied Accounting Research. https://doi.org/10.1108/JAAR-01-2022-0022
9. Pei J (2022) A Survey on Data Pricing: from Economics to Data Science. IEEE Trans Knowl Data Eng 34:4586–4608.
https://doi.org/10.1109/TKDE.2020.3045927
10. Grover V, Chiang RHL, Liang TP, Zhang D (2018) Creating Strategic Business Value from Big Data Analytics: A Research
Framework. Journal of Management Information Systems 35:388–423. https://doi.org/10.1080/07421222.2018.1451951
11. Elia G, Polimeno G, Solazzo G, Passiante G (2020) A multi-dimension framework for value creation through big data. Industrial
Marketing Management 90:508–522
12. Brennan R, Attard J, Petkov P, et al (2019) Exploring data value assessment: A survey method and investigation of the perceived
relative importance of data value dimensions. In: 21st International Conference on Enterprise Information Systems. SciTePress,
pp 188–195
13. Enders T (2018) Exploring the Value of Data - a Research Agenda. In: International Conference on Exploring Services Science
14. Brinch M, Gunasekaran A, Fosso Wamba S (2021) Firm-level capabilities towards big data value creation. J Bus Res 131:539–
548. https://doi.org/10.1016/J.JBUSRES.2020.07.036
Page 11/16
15. Fricker S, Maksimov Y (2017) Pricing of Data Products in Data Marketplaces. In: International Conference on Software Business
1. Gonzalez S, Hansen K, Hendricky H, et al (2018) TOGAF ® Series Guide Business Capabilities
17. Brits J-P, Botha G, Herselman M (2007) Conceptual Framework for Modeling Business Capabilities. In: Informing Science and IT
Education Joint Conference
1. Offerman T, Stettina CJ, Plaat A (2017) Business Capabilities: A Systematic Literature Review and a Research Agenda. In:
International Conference on Engineering, Technology and Innovation (ICE/ITMC). IEEE, Madeira
19. Nickerson RC, Varshney U, Muntermann J (2017) A method for taxonomy development and its application in information
systems. European Journal of Information Systems 336–359
20. Omair B, Alturki A (2020) An Improved Method for Taxonomy Development in Information Systems. International Journal of
Advanced Computer Science and Applications
21. Bui QN (2017) Evaluating Enterprise Architecture Frameworks Using Essential Elements. Communications of the Association for
Information Systems 41:121–149. https://doi.org/10.17705/1CAIS.04106
22. Al-Turkistani HF, Aldobaian S, Latif R (2021) Enterprise Architecture Frameworks Assessment: Capabilities, Cyber Security and
Resiliency Review. In: 1st International Conference on Articial Intelligence and Data Analytics (CAIDA). pp 79–84
23. Cameron BH, McMillan EN (2013) Analyzing the Current Trends in Enterprise Architecture Frameworks. In: Journal of Enterprise
Architecture. pp 60–71
24. Wissotzki M (2015) An Exploration of Capability Research. In: IEEE International Enterprise Distributed Object Computing
Workshop, EDOCW. Institute of Electrical and Electronics Engineers Inc., pp 179–184
25. Attard J, Brennan R (2018) A semantic data value vocabulary supporting data value assessment and measurement integration.
20th International Conference on Enterprise Information Systems 2:133–144. https://doi.org/10.5220/0006777701330144
2. Jürg M, Rodolfo B, Lukas S, et al (2022) Quantitative Modelling of the Value of Data for Manufacturing SMEs in Smart Service
Provision. In: ITM Web of Conferences. EDP Sciences, p 4001
27. Green A (2012) Understanding the value of customer data. Journal of Direct, Data & Digital Marketing Practice 13:221–233
2. Spiekermann S, Korunovska J (2017) Towards a value theory for personal data. Journal of Information Technology 62–84
29. Tian Y, Ding Y, Fu S, Liu D (2022) Data Boundary and Data Pricing Based on the Shapley Value. IEEE Access 10:14288–14300.
https://doi.org/10.1109/ACCESS.2022.3147799
30. Ghorbani A, Kim MP, Zou J (2020) A Distributional Framework For Data Valuation. In: 37th International Conference on Machine
Learning. PMLR, pp 3535–3544
31. Xu A, Zheng Z, Wu F, Chen G (2022) Online Data Valuation and Pricing for Machine Learning Tasks in Mobile Health. In:
Proceedings - IEEE INFOCOM. Institute of Electrical and Electronics Engineers Inc., pp 850–859
32. Koutris P, Upadhyaya P, Balazinska M, et al (2015) Query-Based Data Pricing. Journal of the ACM 62:.
https://doi.org/10.1145/2770870
33. Verma N, Singh V (2022) Query-Based Data Valuation Strategy: An Exploratory View. In: Saraswat M, Roy S, Chowdhury C,
Gandomi AH (eds) Lecture Notes in Networks and Systems. Springer Science and Business Media Deutschland GmbH, pp 687–
702
34. Stein H, Holst L, Stich V, Maass W (2021) From Qualitative to Quantitative Data Valuation in Manufacturing Companies. In: Dolgui
A, Bernard A, Lemoine D, et al (eds) Advances in Production Management Systems. Articial Intelligence for Sustainable and
Resilient Production Systems. Springer International Publishing, Cham, pp 172–180
35. Bendechache M, Limaye N, Brennan R (2020) Towards an Automatic Data Value Analysis Method for Relational Databases. In:
22nd International Conference on Enterprise Information systems (ICEIS-2020)
3. Seufert S, Wulfert T, Wernsdörfer JE (2021) Towards a Reference Value Catalogue for a Company-Specic Assessment of the IT
Business Value - Proposing a Taxonomy to Select IT Impacts From Existing Catalogues. In: 29th European Conference on
Information Systems. Duisburg-Essen
37. Engel C, Buschhoff J, Ebel P (2022) Structuring the Quest for Strategic Alignment of Articial Intelligence (AI): A Taxonomy of the
Organizational Business Value of AI Use Cases. In: Hawaii International Conference on System Sciences
3. Lega M, Colot C, Burnay C, Linden I (2022) Supporting Data Selection for Decision Support Systems: Towards a Decision-Making
Data Value Taxonomy. In: International Conference on Software Engineering and Knowledge Engineering, SEKE. Knowledge
Page 12/16
Systems Institute Graduate School, pp 487–492
39. Okoli C (2015) A guide to conducting a standalone systematic literature review. Communications of the Association for
Information Systems 37:879–910. https://doi.org/10.17705/1cais.03743
40. Webster J, Watson RT (2002) Analyzing the Past to Prepare for the Future: Writing a Literature Review. MIS Quarterly 26:xiii–xxiii
41. Miller GA (1994) The magical number seven, plus or minus two: Some limits on our capacity for processing information. Psychol
Rev 343–352
42. Gelhaar J, Groß T, Otto B (2021) A Taxonomy for Data Ecosystems. In: 54th Hawaii International Conference on System Sciences.
University of Hawai’i at Manoa
43. Gomes SB, Santoro FM, Mira Da Silva M (2021) A Taxonomy for Digital Technology. In: 27th Americas Conference on Information
Systems
44. Lis D, Otto B (2021) Towards a Taxonomy of Ecosystem Data Governance. In: 54th Hawaii International Conference on System
Sciences. University of Hawai’i at Manoa
45. The Open Group (2005) The TOGAF® Standard, Version 9.2
4. Scimago Lab, Scopus (2022) SJR: Scientic Journal Rankings. In: Scimago Journal & Country Rank.
https://www.scimagojr.com/journalrank.php. Accessed 16 Feb 2023
47. Amiri AK, Cavusoglu H, Benbasat I (2015) Enhancing strategic IT alignment through common language: Using the terminology of
the resource-based view or the capability-based view? In: The 2015 International Conference on Information Systems. Fort Worth
4. Bataineh AS, Mizouni R, el Barachi M, Bentahar J (2016) Monetizing Personal Data: A Two-Sided Market Approach. In:
International Conference on Ambient Systems, Networks and Technologies. Elsevier, pp 472–479
49. Wang Q, Zhao H, Wang Q, et al (2020) Enabling secure wireless multimedia resource pricing using consortium blockchains. Future
Generation Computer Systems 110:696–707. https://doi.org/10.1016/J.FUTURE.2019.09.026
50. Holst L, Stich V, Schuh G, Frank J (2020) Towards a Comparative Data Value Assessment Framework for Smart Product Service
Systems. In: IFIP Advances in Information and Communication Technology. Springer, pp 330–337
51. Li Z, Ni Y, Gao X, Cai G (2019) Value Evaluation of Data Assets: Progress and Enlightenment. In: 4th IEEE International Conference
on Big Data Analytics, ICBDA 2019. Institute of Electrical and Electronics Engineers Inc., pp 88–93
52. Brennan R, Attard J, Helfert M (2018) Management of data value chains, a value monitoring capability maturity model. In: 20th
International Conference on Enterprise Information Systems. SciTePress, pp 573–584
53. Barthel S, Schallehn E (2013) The monetary value of information: A leakage-resistant data valuation. In: Lecture Notes in
Informatics (LNI), Proceedings - Series of the Gesellschaft fur Informatik (GI). pp 131–138
54. Berger S, Denner M-S, Roeglinger M (2018) The Nature of Digital Technologies - Development of a Multi-layer Taxonomy. In: 26th
European Conference on Information Systems (ECIS)
55. Szopinski D, Kundisch D, Schoormann T (2020) Visualize Different: Towards Researching the Fit Between Taxonomy
Visualizations and Taxonomy Tasks. In: 15th International Conference on Wirtschaftsinformatik. GITO Verlag, pp 1255–1261
5. Ritchey T (2006) Problem structuring using computer-aided morphological analysis. Journal of the Operational Research Society
57. Liang J, Yuan C (2021) Data Price Determinants Based on a Hedonic Pricing Model. Big Data Research 25:100249.
https://doi.org/10.1016/J.BDR.2021.100249
5. Niyato D, Hoang DT, Luong NC, et al (2016) Smart data pricing models for the internet of things: A bundling strategy approach.
IEEE Netw 30:18–25. https://doi.org/10.1109/MNET.2016.7437020
59. Yu H, Zhang M (2017) Data pricing strategy based on data quality. Comput Ind Eng 112:1–10.
https://doi.org/10.1016/J.CIE.2017.08.008
0. Rix C, Frank J, Stich V, Urban D (2021) Pricing Models for Data Products in the Industrial Food Production. In: Dolgui A, Bernard A,
Lemoine D, et al (eds) Advances in Production Management Systems. Articial Intelligence for Sustainable and Resilient
Production Systems. Springer International Publishing, Cham, pp 553–563
1. Schneider J, Abraham R, Meske C, vom Brocke J (2022) Articial Intelligence Governance For Businesses. Information Systems
Management. https://doi.org/10.1080/10580530.2022.2085825
2. Li J, Wang D, Qi G, et al (2022) Alliance chain-based simulation on a new clinical research data pricing model. Ann Transl Med
10:836–836. https://doi.org/10.21037/atm-22-3671
Page 13/16
3. Song B, Song J, Ye J (2020) A Dynamic Pricing Mechanism in IoT for DaaS: A Reinforcement Learning Approach. In: Liu Y, Wang
L, Zhao L, Yu Z (eds) Advances in Intelligent Systems and Computing. Springer, pp 604–615
4. Shen Y, Guo B, Shen Y, et al (2022) Personal big data pricing method based on differential privacy. Comput Secur 113:102529.
https://doi.org/10.1016/J.COSE.2021.102529
5. Ghorbani A, Zou J (2019) Data Shapley: Equitable Valuation of Data for Machine Learning. In: 36th International Conference on
Machine Learning
. Robinson SC (2017) What’s your anonymity worth? Establishing a marketplace for the valuation and control of individuals’
anonymity and personal data. Digital Policy, Regulation and Governance 19:353–366. https://doi.org/10.1108/DPRG-05-2017-
0018
7. Lin Z, Wu Y (2016) Research on the Method of Evaluating the Value of Data Assets. In: International Conference on Education, E-
learning and Management Technology
. Otto B (2015) Quality and Value of the Data Resource in Large Enterprises. Information Systems Management 32:234–251.
https://doi.org/10.1080/10580530.2015.1044344
9. Lu J, Li S, Zhang X (2021) A Study on the Business Data Evaluation Method of the Power Grid Value-Added Service. In: 5th
International Conference on Power and Energy Engineering, ICPEE 2021. Institute of Electrical and Electronics Engineers Inc., pp
288–292
70. Huang Y, Milani M, Chiang F (2020) Privacy-aware data cleaning-as-a-service. Inf Syst 94:101608.
https://doi.org/10.1016/J.IS.2020.101608
71. Wagner A, Wessels N, Buxmann P, Krasnova H (2018) Putting a price tag on personal information – A literature review. In: Hawaii
International Conference on System Sciences. IEEE Computer Society, pp 3760–3769
72. Duan Y, Liu P, Feng Y (2022) Pricing strategies of two-sided platforms considering privacy concerns. Journal of Retailing and
Consumer Services 64:. https://doi.org/https://doi.org/10.1016/j.jretconser.2021.102781
73. Heckman J, Boehmer E, Peters E, et al (2015) A Pricing Model for Data Markets. In: iConference 2015 Proceedings
74. Fehrenbach D, Herrando C (2021) The effect of customer-perceived value when paying for a product with personal data: A real-life
experimental study. J Bus Res 137:222–232. https://doi.org/10.1016/J.JBUSRES.2021.08.029
75. Liang F, Yu W, An D, et al (2018) A Survey on Big Data Market: Pricing, Trading and Protection. IEEE Access 6:15132–15154.
https://doi.org/10.1109/ACCESS.2018.2806881
7. Yang J, Xing C (2019) Personal data market optimization pricing model based on privacy level. Information (Switzerland) 10:.
https://doi.org/10.3390/info10040123
77. Stahl F, Vossen G (2016) Fair knapsack pricing for data marketplaces. In: East European Conference on Advances in Databases
and Information Systems. Springer Verlag
7. Deep S, Koutris P (2017) QIRANA: A framework for scalable query pricing. In: ACM SIGMOD International Conference on
Management of Data. Association for Computing Machinery, pp 699–713
79. Wang X, Wei X, Liu Y, Gao S (2018) On pricing approximate queries. Inf Sci (N Y) 453:198–215.
https://doi.org/10.1016/J.INS.2018.04.036
0. Miao X, Gao Y, Chen L, et al (2022) Towards Query Pricing on Incomplete Data. IEEE Trans Knowl Data Eng 34:4024–4036.
https://doi.org/10.1109/TKDE.2020.3026031
1. Zhang M, Beltrán F, Liu J (2022) A Survey of Data Pricing for Data Marketplaces
2. Zhang Z, Song W, Shen Y (2021) A Reasonable Data Pricing Mechanism for Personal Data Transactions with Privacy Concern. In:
APWeb-WAIM Joint Conference on Web and Big Data 2017. Springer Science and Business Media Deutschland GmbH, pp 64–71
3. Debattista J, Attard J, Brennan R (2018) Semantic data ingestion for intelligent, value-driven big data analytics. In: International
Conference on Big Data Innovations and Applications. Institute of Electrical and Electronics Engineers Inc., pp 1–8
4. Deep S, Koutris P (2017) The design of arbitrage-free data pricing schemes. In: 20th International Conference on Database Theory
(ICDT 2017. Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing
5. Szopinski D, Schoormann T, Kundisch D (2020) Criteria as a Prelude for Guiding Taxonomy Evaluation. In: 53rd Hawaii
International Conference on System Sciences
. vom Brocke J, Hevner A, Maedche A (2020) Introduction to Design Science Research. In: Design Science Research Cases. pp 1–13
Page 14/16
7. Guarino N, Welty CA (2009) An Overview of OntoClean. In: Staab S, Studer R (eds) Handbook on Ontologies. Springer-Verlag,
Berlin Heidelberg, pp 201–220
Figures
Figure 1
Systematic literature review process based on [39, 40]
Figure 2
Taxonomy development method for information systems by [19]applied to this paper
Page 15/16
Figure 3
Degree of fulllment of taxonomy development ending conditions per iteration
Figure 4
Taxonomy for Data Valuation Business Capabilities