
Thomas HartmannBosch · Center of Competence Big Data
Thomas Hartmann
Dr.-Ing.
About
39
Publications
8,022
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
204
Citations
Publications
Publications (39)
For research institutes, data libraries, and data archives, validating RDF data according to predefined constraints is a much sought-after feature, particularly as this is taken for granted in the XML world. Based on our work in two international working groups on RDF validation and jointly identified requirements to formulate constraints and valid...
The formulation of constraints and the validation of RDF data against these constraints is a common requirement and a much sought-after feature, particularly as this is taken for granted in the XML world. Recently, RDF validation as a research field gained speed due to shared needs of data practitioners from a variety of domains. For constraint for...
For research institutes, data libraries, and data archives, RDF data validation according to predefined constraints is a much sought-after feature, particularly as this is taken for granted in the XML world. Based on our work in the DCMI RDF Application Profiles Task Group and in cooperation with the W3C Data Shapes Working Group, we identified and...
Physical data description (PHDD) of existing or published data (tables) in a rectangular format. The data could be either represented in records with character-separated values (CSV) or in records with fixed length. PHDD could be used standalone or together with related vocabularies like Data Catalog Vocabulary (DCAT) or DDI-RDF Discovery. Descript...
This specification defines the DDI-RDF Discovery Vocabulary (Disco), an RDF Schema vocabulary that enables discovery of research and survey data on the Web. It is based on DDI (Data Documentation Initiative) XML formats.
This paper serves as appendix for the PhD thesis entitled 'Validation Framework for RDF-based Constraint Languages', submitted to the Department of Economics and Management at the Karlsruhe Institute of Technology (KIT).
The provided research data, research results, and publications form the basis for the PhD thesis entitled ’Validation Framework for RDF-based Constraint Languages’, submitted to the Department of Economics and Management at the Karlsruhe Institute of Technology (KIT).
Ontology engineers worked in close collaboration with experts from the statistical domain in order to develop an ontology of a subset of the Data Documentation Initiative. In this paper, we give a brief overview of the DDI ontology’s current status and discuss in detail the most significant use cases associated with the DDI data model’s ontology an...
Ontology engineers and experts from the social, behavioral, and economic sciences developed a data discovery ontology covering a subset of both the DDI Codebook and Lifecycle models, and implemented a rendering of DDI XML instances to RDF (Resource Description Framework). The main goals associated with the design process of the DDI ontology were to...
In recent years, Semantic Web technologies have matured and have made their way into various domains – for example, bioinformatics and eGovernment -- where they are used in different applications to provide value-added services for users. In this paper, we present an overview of several representative applications that use Semantic Web technologies...
The GESIS Data Catalogue contains the study descriptions for all archived studies at GESIS, currently more than 5000 datasets mainly from survey research in the social sciences. These descriptions include information about primary researchers, research topics and objects, used methods, and the resulting dataset, which is mainly used for archiving a...
For data practitioners embracing the world of RDF and Linked Data, the openness and flexibility is a mixed blessing. For them, data validation according to predefined constraints is a much sought-after feature, particularly as this is taken for granted in the XML world. Based on our work in the DCMI RDF Application Profiles Task Group and in cooper...
From 2012 to 2015 together with other Linked Data community members and experts from the social, behavioral, and economic sciences (SBE), we developed diverse vocabularies to represent SBE metadata and tabular data in RDF. The DDI-RDF Discovery Vocabulary (DDI-RDF) is designed to support the dissemination, management, and reuse of unit-record data,...
To ensure high quality of and trust in both metadata and data, their representation in RDF must satisfy certain criteria - specified in terms of RDF constraints. From 2012 to 2015 together with other Linked Data community members and experts from the social, behavioral, and economic sciences (SBE), we developed diverse vocabularies to represent SBE...
There are many case studies for which the formulation of RDF constraints and the validation of RDF data conforming to these constraint is very important. As a part of the collaboration with the W3C and the DCMI working groups on RDF validation, we identified major RDF validation requirements and initiated an RDF validation requirements database whi...
This document describes the RDF Application Profile case studies of the "DCMI RDF Application Profiles Task Force" (DCMI RDF-AP) in July 2015. It replaces the use case document from October 2014. The DCMI RDF-AP aims at defining best practices for documenting application profiles, requests for handling RDF application profiles and for RDF constrain...
This report supplements the Report on Use Cases these Use Cases. Requirements are derived from the use cases and specific case studies. See that report for the list of projects that submitted data for this study.
The full descriptions of case studies and use cases can be found in the task force wiki. Case studies and the corresponding use cases are...
In the context of the DCMI RDF Application Profile task group and the W3C Data Shapes Working Group solutions for the proper formulation of constraints and validation of RDF data on these constraints are being developed. Several approaches and constraint languages exist but there is no clear favorite and none of the languages is able to meet all re...
Statistical domain experts - core members of the DDI Alliance Technical Committee, representatives
of national statistical institutes and national data archives - and Linked Data community members
have spent four years to develop the DDI-RDF Discovery Vocabulary (Disco). In total, 26 persons from
23 organizations and 12 countries have contributed t...
An ontology of the DDI 3 data model will be designed by following the ontology engineering methodology to be evolved based on state-of-the-art methodologies. Hence DDI 3 data and metadata can be represented in form of a standard web interchange format RDF and processed by highly available RDF tools. As a consequence the DDI community has the possib...
For many RDF applications, the formulation of constraints and the automatic validation of data according to these constraints is a much sought-after feature. In 2013, the W3C invited experts from industry, government and academia to the RDF Validation Workshop, where first use cases have been presented and discussed. In collaboration with the W3C,...
Description Set Profiles (DSP) are used to formulate constraints on valid data within a Dublin Core Application Profile. For RDF, SPARQL is generally seen as the method of choice to validate data according to certain constraints, although it is not ideal for their formulation. In contrast, DSPs are comparatively easy to understand, but lack an impl...
Domain ontologies and XML Schemas serve to describe domain data models although they follow different modelling goals. By lifting the syntactic level of XML documents and validating XML Schemas to the semantic level of OWL ontologies and their RDF representations in an automatic way, all the information located in the XML Schemas of the domains can...
The Linked Data and the Social Science data communities developed the DDI-RDF Discovery Vocabulary, an ontology of the Data Documentation Initiative, in order to support the discovery of person-level data and its metadata. The Data Documentation Initiative (DDI) is an acknowledged international standard for the documentation and management of data...
The Data Documentation Initiative (DDI) is an acknowledged international standard for the documentation and management of data from the social, behavioral, and economic sciences. Statistical domain experts, i.e. representatives of national statistical institutes and national data archives, and Linked Open Data community members have developed the D...
The process designing domain ontologies from scratch is very time-consuming and is associated with a lot of effort. In the most cases, domain experts have defined XML Schemas, describing domain data models, before ontologies have been created. Our idea is to generate ontologies out of XML Schemas automatically using XSLT transformations in a first...
Designing domain ontologies from scratch is a time-consuming endeavor requiring a lot of close collaboration with domain experts. However, domain descriptions such as XML Schemas are often available in early stages of the ontology development process. For my dissertation, I propose a method to convert XML Schemas to OWL ontologies in an automatic w...
Experts from the statistical domain worked in close collaboration with ontology engineers to develop an ontology of a subset of the Data Documentation Initiative, an established international standard for the documentation and management of data from the social, behavioral, and economic sciences. Experts in the statistics domain formulated use case...
The DDI Alliance has initiated new work to build a model-based specification from which technical
bindings can be generated. This approach will carry many benefits in terms of communicating with other standards efforts and maintaining consistency. To launch this project, a group of experts met in Wadern, Germany, in October 2012, where different ap...
Designing domain ontologies from scratch is a time-consuming process. In many cases, both the terminologies and the syntactic structures of domain data models are already described in form of XML Schemas. XSLT transformations are used to lift the syntactic level of XML documents to the semantic level of OWL ontologies by mapping any XML Schemas to...
Designing an ontology for a specific domain is a time-consuming process. In many cases, information sources like XML Schemas serve as a basis for ontology engineers to conceptualize the intended ontologies. The ontology design process is sped up significantly when XML Schemas are transformed automatically into generated ontologies. An XML Schema Me...
An ontology of the DDI 3 data model will be designed by following the ontology engineering methodology to be evolved based on state-of-the-art methodologies. Hence DDI 3 data and metadata can be represented in form of a standard web interchange format RDF and processed by highly available RDF tools. As a consequence the DDI community has the possib...
Mit dieser Arbeit ist das Ziel verbunden, Anforderungen an Funktionalitäten und Tools zur
Förderung der Kreativität in Innovations-Communities zu identifizieren, ein Modell zur
Klassifizierung dieser Funktionalitäten und Softwarewerkzeuge zu entwickeln und auf diesem
Klassifikationsschema basierend Gestaltungsempfehlungen für Innovationsgemeinschaf...
The purpose of this research paper is to apply correspondence analysis to the underlying data set in order to examine how two different demographics sex and age relate to the color and the brand of cars.
The questions to be answered are the following:
1. Are there any interesting relationships?
2. How strong are these relationships?
3. How does th...
The task of the author of this research paper was to select a specific linear classification method to create a rule to classify new customers into two mutually exclusive clusters. With this model, you can predict, if a customer of a bank will pay back a loan or not. In order to create such a rule, the author chose the linear discriminant analysis....
The purpose of this research paper is to verify the hypothesis that the ANETT classification is based on logical criteria although these criteria are of subjective nature. To implement this, the author of this paper compared the outcomes of two different types of cluster analysis approaches using two variable sets with the result of the ANETT class...
The purpose of this research paper is to verify the hypothesis that the ANETT classification is based on logical criteria although these criteria are of subjective nature. To implement this, the author of this paper compared the outcomes of two different types of cluster analysis approaches using two variable sets with the result of the ANETT class...
Mit dieser Arbeit werden zwei komplementäre Zielsetzungen verfolgt. Das Primärziel soll
durch den pragmatischen, das sekundäre Ziel durch den wissenschaftlichen Teil der Arbeit
erreicht werden.
Der wissenschaftliche Abschnitt dient einerseits dazu, ein prinzipielles Verständnis über
SOAs zu vermitteln und andererseits, die Relevanz des Einsatzes e...