ArticlePublisher preview available

Standard International Trade Classification: From Spreadsheet to OWL-2 Ontology

Authors:
To read the full-text of this research, you can request a copy directly from the authors.

Abstract and Figures

Trade classifications are a necessary prerequisite for the compilation of trade statistics, and they should – beyond that – be regarded as a valuable base for the definition of shared controlled vocabularies for linked business data that deal with import, export etc. The Standard International Trade Classification (SITC) provided by the UN Statistics Division is a widely used classification mostly applied for scientific and analytical purposes. SITC – as most other trade classifications – is available today only in text or spreadsheet formats. These formats reveal the inner hierarchical structure of SITC to the human reader, because SITC trade codes are built according to the decimal classification scheme, but unfortunately, SITC’s inner structure is opaque to computer applications in text and spreadsheet formats. The paper discusses an approach to set up an OWL-2 ontology for SITC that states subsumption relations between classes of goods. This kind of semantic underpinning of SITC is suited to ease both checking and extending SITC and to derive from it a shared controlled vocabulary for business linked data. Some problems of today’s SITC (among them missing inner nodes of the trade code hierarchy) are carefully discussed, and the paper motivates several decisions that were taken for ontology design. Finally, the study introduces the semantic reasoner as a tool for the (at least partial) automatic derivation of structural information for SITC from the trade code building rule. The paper reports on reasoner runtimes observed for different versions of the SITC ontology and for different versions of the Pellet reasoner.
This content is subject to copyright. Terms and conditions apply.
RESEARCH PAPER
Standard International Trade Classification
From Spreadsheet to OWL-2 Ontology
Norbert Luttenberger Jesper Zedlitz
Received: 19 December 2016 / Accepted: 10 April 2017 / Published online: 1 September 2017
Springer Fachmedien Wiesbaden GmbH 2017
Abstract Trade classifications are a necessary prerequisite
for the compilation of trade statistics, and they should –
beyond that – be regarded as a valuable base for the defi-
nition of shared controlled vocabularies for linked business
data that deal with import, export etc. The Standard
International Trade Classification (SITC) provided by the
UN Statistics Division is a widely used classification
mostly applied for scientific and analytical purposes. SITC
– as most other trade classifications – is available today
only in text or spreadsheet formats. These formats reveal
the inner hierarchical structure of SITC to the human
reader, because SITC trade codes are built according to the
decimal classification scheme, but unfortunately, SITC’s
inner structure is opaque to computer applications in text
and spreadsheet formats. The paper discusses an approach
to set up an OWL-2 ontology for SITC that states sub-
sumption relations between classes of goods. This kind of
semantic underpinning of SITC is suited to ease both
checking and extending SITC and to derive from it a shared
controlled vocabulary for business linked data. Some
problems of today’s SITC (among them missing inner
nodes of the trade code hierarchy) are carefully discussed,
and the paper motivates several decisions that were taken
for ontology design. Finally, the study introduces the
semantic reasoner as a tool for the (at least partial)
automatic derivation of structural information for SITC
from the trade code building rule. The paper reports on
reasoner runtimes observed for different versions of the
SITC ontology and for different versions of the Pellet
reasoner.
Keywords Trade classification Semantic reasoning
1 Introduction
Trade classifications are a necessary prerequisite for the
preparation of trade statistics that are used to describe – for
administrative and/or scientific purposes – domestic and
international flows of goods. Without challenging this
intended use, we discuss trade classifications in this paper
with a different motivation: We argue that trade classifi-
cations – beyond their obvious purpose – can also be
considered as valuable sources for the definition of shared
controlled vocabularies. Shared controlled vocabularies are
at the foundation of Linked (Open) Data collections. With
this perspective in mind, we argue that the terminology
work that is required for the provision of meaningful linked
business data can profit from existing vocabularies, among
them those that give trade statistics their shape.
Unfortunately, most trade classifications today are
available in text or table (spreadsheet) formats. These
formats address the human reader; they are not well suited
to reveal the inner, mostly hierarchical structure of trade
classifications to computer processing and examination.
Experience has shown that with this kind of formats
structural problems may arise on several occasions, for
instance when adjusting a trade classification to new
demands. To avoid these problems, we therefore prefer a
logics-based format for the development of shared
Accepted after 2 revisions by Prof. Dr. Abramowicz.
Prof. Dr. N. Luttenberger (&)Dr. J. Zedlitz
Research Group for Communication Systems, Department of
Computer Science, Christian-Albrechts-University in Kiel, Kiel,
Germany
e-mail: n.luttenberger@email.uni-kiel.de
Dr. J. Zedlitz
e-mail: j.zedlitz@email.uni-kiel.de
123
Bus Inf Syst Eng 60(4):305–316 (2018)
https://doi.org/10.1007/s12599-017-0495-z
Content courtesy of Springer Nature, terms of use apply. Rights reserved.
... The first one is mainly used by countries to collect their trade statistics. The latter one, which is the one selected for this analysis, is maintained by the United Nations (UN) and recommended for analytical purposes [44,45]. Within the SITC nomenclature, there were four revisions available at the time when data was collected: revision 1 containing data from 1962, revision 2 containing data from 1976, revision 3 with data from 1986 and revision 4 with data from 2007. ...
Article
Full-text available
Aerospace manufacturing industry is predicted to continue growing. Rising demand is triggering the current global aerospace ecosystem to evolve and adapt to challenges never faced before. New players into the aerospace manufacturing industry and the development of new ecosystems are evidencing its evolution. Understanding how the aerospace ecosystem has evolved is thus essential to prepare optimal conditions to nurture its growth. Recent studies have successfully combined economics and network science methods to map, analyse and predict the evolution of industrial ecosystems. In comparison to previous studies which apply network science-based methodologies to macro-economic research, this paper uses these methods to analyse the evolution of a particular industrial ecosystem, namely the aerospace sector. In particular, we develop bipartite country-product networks based on trade data over 25 years, to identify patterns and similarities in the evolution of developed aerospace manufacturing countries ecosystems. The analysis is elaborated at a macroscopic (network) and microscopic (nodes) levels. Motivated by studies in ecological networks, we use nestedness analysis to find patterns depicting the distribution and evolution of exported products across ecosystems. Our analysis reveals that developed ecosystems tend to become more analogous, as countries lean towards having a revealed comparative advantage (RCA) in the same group of products. Countries also tend to become more nested in their aerospace product space as they start developing a higher RCA. It is revealed that although countries develop an advantage on unique products, they also tend to increase competition with each other. Further analysis shows that manufactured products have a stronger correlation to an aerospace ecosystem than primary products; and in particular, the automotive sector shows the highest correlation with positive aerospace sector evolution. Competition between countries with well-developed aerospace ecosystems tends to centre on automotive parts, general industrial machinery, power generating machinery and equipment, and chemical materials and products.
Chapter
For countries whose economies are largely based on oil revenues, the impacts of global oil price shocks play a significant role. This research aims at investigating the impacts of the post-2014 oil price shock on the post-soviet resources-rich countries of the Caspian Basin, namely Azerbaijan, Russia, and Kazakhstan. The drop in oil prices caused the economic slowdown in all three countries. In comparison with the other two countries, Azerbaijan was able to prevent the negative impact with a social package. The post-shock social package was implemented by the Azerbaijani government in 2019, which covered more than 3 million people. This package can be considered as an “Azerbaijani model” for the stability of macroeconomic indicators during the devaluation period.
Article
Full-text available
Traditionally connected to Philosophy, the term ontology is increasingly related to information systems areas. Some researchers consider the approaches of the two disciplinary contexts to be completely different. Others consider that, although different, they should talk to each other, as both seek to answer similar questions. With the extensive literature on this topic, we intend to contribute to the understanding of the use of the term ontology in current research and which references support this use. An exploratory study was developed with a mixed methodology and a sample collected from the Web of Science of articles published in 2018. The results show the current prevalence of Computer Science in studies related to ontology and also of Gruber's view suggesting ontology as kind of conceptualization, a dominant view in that field. Some researchers, particularly in the field of Biomedicine, do not adhere to this dominant view, but to another one that seems closer to ontological study in the philosophical context. The term ontology, in the context of information systems, appears to be consolidating with a different meaning from the original presenting traces of the process of metaphorization in the transfer of the term between the two fields of study.
Conference Paper
Full-text available
The classification of products and services enables reliable and efficient electronic exchanges of product data across organizations. Many companies classify products (a) according to generic or industry specific product classification standards, or (b) by using proprietary category systems. Such classification systems often contain thousands of product classes that are updated over time. This implies a large quantity of useful product category information for e-commerce applications on the Web of Data. Thus, instead of building up product ontologies from scratch, which is costly, tedious, error-prone, and high-maintenance, it is generally easier to derive them from existing classifications. In this paper, we (1) describe a generic, semi-automated method for deriving OWL ontologies from product classification standards and proprietary category systems. Moreover, we (2) show that our approach generates logically and semantically correct vocabularies, and (3) present the practical benefit of our approach. The resulting product ontologies are compatible with the GoodRelations vocabulary for e-commerce and with schema.org and can be used to enrich product and offer descriptions on the Semantic Web with granular product type information from existing data sources.
Article
Full-text available
Web technology is starting to penetrate many aspects of our daily life. Its importance as a medium for business transactions will grow significantly during the next few years. In terms of market volume B2B will be the most interesting area and it will also be the area where the new technology will lead to drastic changes in established customer relationships and business models. Simple established one2one trading relationships will be replaced by open and flexible n2m relationships between customers and vendors. However, this new flexibility in electronic trading also generates serious challenges for the parties who want to realize it. The main problem here is the heterogeneity of information descriptions used by vendors and customers. Product descriptions, catalog formats, and business documents are often unstructured and non-standardized. Intelligent solutions that help to mechanize the process of structuring, standardizing, aligning and personalizing are a key requisite to successfully overcome the current bottlenecks of eCommerce enabling its further growth. In this paper, we discuss the main problems in information integration in this area and describe how Ontology technology can help to solve many of them. 1
Conference Paper
Full-text available
Hierarchical classifications, thesauri, and informal taxonomies are likely the most valuable input for creating, at reasonable cost, non-toy ontologies in many domains. They contain, readily available, a wealth of category definitions plus a hierarchy, and they reflect some degree of community consensus. However, their transformation into useful ontologies is not as straightforward as it appears. In this paper, we show that (1) it often depends on the context of usage whether an informal hierarchical categorization schema is a classification, a thesaurus, or a taxonomy, and (2) present a novel methodology for automatically deriving consistent RDF-S and OWL ontologies from such schemas. Finally, we (3) demonstrate the usefulness of this approach by transforming the two e-business categorization standards eCl@ss and UNSPSC into ontologies that overcome the limitations of earlier prototypes. Our approach allows for the script-based creation of meaningful ontology classes for a particular context while preserving the original hierarchy, even if the latter is not a real subsumption hierarchy in this particular context. Human intervention in the transformation is limited to checking some conceptual properties and identifying frequent anomalies, and the only input required is an informal categorization plus a notion of the target context. In particular, the approach does not require instance data, as ontology learning approaches would usually do.
Conference Paper
Full-text available
The emerging Semantic Web technologies have the potential to deeply influence the further development of the Internet Economy. In this paper we pro- pose a scenario based research approach to predict the effects of these new tech- nologies on electronic markets and business models of traders and intermediaries. We develop a concrete scenario for the domain of business to consumer electronic commerce and explain its technical realization. Furthermore, we analyse the po- tential impacts on market participants and highlight a range of technical and or- ganizational obstacles which have to be solved before semantic technologies can be deployed in an e-business context.
Chapter
In this chapter, we illustrate the work conducted at the Food and Agriculture Organization of the United Nations (FAO) with the creation of a network of ontologies about fisheries, developed with NeOn technologies and methodologies. The network included the main thematic areas needed to talk about fish stocks (often referred to as aquatic resources) and included data sources of various types: reference data for time series, thesauri for document indexing, actual time series, and the reuse of an existing well-known ontology maintained by FAO (the geopolitical ontology). Such a network of ontologies was also used within a prototypical web-based application. After describing the methodologies used to create the network, and its contents and features, we draw some conclusions and highlight the lessons learned during the process.
Article
Representing time-dependent information has become increasingly important for reasoning and querying services defined on top of RDF and OWL. In particular, addressing this task properly is vital for practical applications such as modern biographical information systems, but also for the Semantic Web/Web 2.0/Social Web in general. Extending binary relation instances with temporal information often translates into a massive proliferation of useless container objects when trying to keep the underlying RDF model. In this paper, we argue for directly extending RDF triples with further arguments in order to easily represent time-dependent factual knowledge and to allow for practical forms of reasoning. We also report on a freely available lightweight OWL ontology for representing biographical knowledge that models entities of interest via a tri-partite structure of the pairwise disjoint classes Abstract, Object, and Happening. Even though the ontology was manually developed utilizing the Protégé ontology editor, and thus sticking to the triple model of RDF, the meta-modelling facilities allowed us to cross-classify all properties as being either synchronic or diachronic. When viewing the temporal arguments as "extra" arguments that only apply to relation instances, universal biographical knowledge from the ontology can still be described as if there is no time.
Article
In Semantic Web languages, such as RDF and OWL, a property is a binary relation: it is used to link two individuals or an individual and a value. However, in some cases, the natural and convenient way to represent certain concepts is to use relations to link an individual to more than just one individual or value. These relations are called n-ary relations. For example, we may want to represent properties of a relation, such as our certainty about it, severity or strength of a relation, relevance of a relation, and so on. Another example is representing relations among multiple individuals, such as a buyer, a seller, and an object that was bought when describing a purchase of a book. This document presents ontology patterns for representing n-ary relations in RDF and OWL and discusses what users must consider when choosing these patterns.
Article
This paper provides a self-contained first introduction to description logics (DLs). The main concepts and features are explained with examples before syntax and semantics of the DL SROIQ are defined in detail. Additional sections review light-weight DL languages, discuss the relationship to the Web Ontology Language OWL and give pointers to further reading.