Thomas Hartmann

Thomas Hartmann
Bosch · Center of Competence Big Data

Dr.-Ing.

About

39
Publications
8,022
Reads
How we measure 'reads'
A 'read' is counted each time someone views a publication summary (such as the title, abstract, and list of authors), clicks on a figure, or views or downloads the full-text. Learn more
204
Citations

Publications

Publications (39)
Article
For research institutes, data libraries, and data archives, validating RDF data according to predefined constraints is a much sought-after feature, particularly as this is taken for granted in the XML world. Based on our work in two international working groups on RDF validation and jointly identified requirements to formulate constraints and valid...
Thesis
Full-text available
The formulation of constraints and the validation of RDF data against these constraints is a common requirement and a much sought-after feature, particularly as this is taken for granted in the XML world. Recently, RDF validation as a research field gained speed due to shared needs of data practitioners from a variety of domains. For constraint for...
Conference Paper
For research institutes, data libraries, and data archives, RDF data validation according to predefined constraints is a much sought-after feature, particularly as this is taken for granted in the XML world. Based on our work in the DCMI RDF Application Profiles Task Group and in cooperation with the W3C Data Shapes Working Group, we identified and...
Technical Report
Physical data description (PHDD) of existing or published data (tables) in a rectangular format. The data could be either represented in records with character-separated values (CSV) or in records with fixed length. PHDD could be used standalone or together with related vocabularies like Data Catalog Vocabulary (DCAT) or DDI-RDF Discovery. Descript...
Technical Report
This specification defines the DDI-RDF Discovery Vocabulary (Disco), an RDF Schema vocabulary that enables discovery of research and survey data on the Web. It is based on DDI (Data Documentation Initiative) XML formats.
Book
Full-text available
This paper serves as appendix for the PhD thesis entitled 'Validation Framework for RDF-based Constraint Languages', submitted to the Department of Economics and Management at the Karlsruhe Institute of Technology (KIT).
Raw Data
The provided research data, research results, and publications form the basis for the PhD thesis entitled ’Validation Framework for RDF-based Constraint Languages’, submitted to the Department of Economics and Management at the Karlsruhe Institute of Technology (KIT).
Article
Full-text available
Ontology engineers worked in close collaboration with experts from the statistical domain in order to develop an ontology of a subset of the Data Documentation Initiative. In this paper, we give a brief overview of the DDI ontology’s current status and discuss in detail the most significant use cases associated with the DDI data model’s ontology an...
Article
Full-text available
Ontology engineers and experts from the social, behavioral, and economic sciences developed a data discovery ontology covering a subset of both the DDI Codebook and Lifecycle models, and implemented a rendering of DDI XML instances to RDF (Resource Description Framework). The main goals associated with the design process of the DDI ontology were to...
Article
Full-text available
In recent years, Semantic Web technologies have matured and have made their way into various domains – for example, bioinformatics and eGovernment -- where they are used in different applications to provide value-added services for users. In this paper, we present an overview of several representative applications that use Semantic Web technologies...
Article
Full-text available
The GESIS Data Catalogue contains the study descriptions for all archived studies at GESIS, currently more than 5000 datasets mainly from survey research in the social sciences. These descriptions include information about primary researchers, research topics and objects, used methods, and the resulting dataset, which is mainly used for archiving a...
Conference Paper
Full-text available
For data practitioners embracing the world of RDF and Linked Data, the openness and flexibility is a mixed blessing. For them, data validation according to predefined constraints is a much sought-after feature, particularly as this is taken for granted in the XML world. Based on our work in the DCMI RDF Application Profiles Task Group and in cooper...
Technical Report
Full-text available
From 2012 to 2015 together with other Linked Data community members and experts from the social, behavioral, and economic sciences (SBE), we developed diverse vocabularies to represent SBE metadata and tabular data in RDF. The DDI-RDF Discovery Vocabulary (DDI-RDF) is designed to support the dissemination, management, and reuse of unit-record data,...
Technical Report
Full-text available
To ensure high quality of and trust in both metadata and data, their representation in RDF must satisfy certain criteria - specified in terms of RDF constraints. From 2012 to 2015 together with other Linked Data community members and experts from the social, behavioral, and economic sciences (SBE), we developed diverse vocabularies to represent SBE...
Technical Report
Full-text available
There are many case studies for which the formulation of RDF constraints and the validation of RDF data conforming to these constraint is very important. As a part of the collaboration with the W3C and the DCMI working groups on RDF validation, we identified major RDF validation requirements and initiated an RDF validation requirements database whi...
Technical Report
This document describes the RDF Application Profile case studies of the "DCMI RDF Application Profiles Task Force" (DCMI RDF-AP) in July 2015. It replaces the use case document from October 2014. The DCMI RDF-AP aims at defining best practices for documenting application profiles, requests for handling RDF application profiles and for RDF constrain...
Technical Report
This report supplements the Report on Use Cases these Use Cases. Requirements are derived from the use cases and specific case studies. See that report for the list of projects that submitted data for this study. The full descriptions of case studies and use cases can be found in the task force wiki. Case studies and the corresponding use cases are...
Conference Paper
Full-text available
In the context of the DCMI RDF Application Profile task group and the W3C Data Shapes Working Group solutions for the proper formulation of constraints and validation of RDF data on these constraints are being developed. Several approaches and constraint languages exist but there is no clear favorite and none of the languages is able to meet all re...
Technical Report
Full-text available
Statistical domain experts - core members of the DDI Alliance Technical Committee, representatives of national statistical institutes and national data archives - and Linked Data community members have spent four years to develop the DDI-RDF Discovery Vocabulary (Disco). In total, 26 persons from 23 organizations and 12 countries have contributed t...
Technical Report
Full-text available
An ontology of the DDI 3 data model will be designed by following the ontology engineering methodology to be evolved based on state-of-the-art methodologies. Hence DDI 3 data and metadata can be represented in form of a standard web interchange format RDF and processed by highly available RDF tools. As a consequence the DDI community has the possib...
Conference Paper
Full-text available
For many RDF applications, the formulation of constraints and the automatic validation of data according to these constraints is a much sought-after feature. In 2013, the W3C invited experts from industry, government and academia to the RDF Validation Workshop, where first use cases have been presented and discussed. In collaboration with the W3C,...
Conference Paper
Full-text available
Description Set Profiles (DSP) are used to formulate constraints on valid data within a Dublin Core Application Profile. For RDF, SPARQL is generally seen as the method of choice to validate data according to certain constraints, although it is not ideal for their formulation. In contrast, DSPs are comparatively easy to understand, but lack an impl...
Article
Full-text available
Domain ontologies and XML Schemas serve to describe domain data models although they follow different modelling goals. By lifting the syntactic level of XML documents and validating XML Schemas to the semantic level of OWL ontologies and their RDF representations in an automatic way, all the information located in the XML Schemas of the domains can...
Conference Paper
Full-text available
The Linked Data and the Social Science data communities developed the DDI-RDF Discovery Vocabulary, an ontology of the Data Documentation Initiative, in order to support the discovery of person-level data and its metadata. The Data Documentation Initiative (DDI) is an acknowledged international standard for the documentation and management of data...
Conference Paper
Full-text available
The Data Documentation Initiative (DDI) is an acknowledged international standard for the documentation and management of data from the social, behavioral, and economic sciences. Statistical domain experts, i.e. representatives of national statistical institutes and national data archives, and Linked Open Data community members have developed the D...
Technical Report
Full-text available
The process designing domain ontologies from scratch is very time-consuming and is associated with a lot of effort. In the most cases, domain experts have defined XML Schemas, describing domain data models, before ontologies have been created. Our idea is to generate ontologies out of XML Schemas automatically using XSLT transformations in a first...
Conference Paper
Full-text available
Designing domain ontologies from scratch is a time-consuming endeavor requiring a lot of close collaboration with domain experts. However, domain descriptions such as XML Schemas are often available in early stages of the ontology development process. For my dissertation, I propose a method to convert XML Schemas to OWL ontologies in an automatic w...
Conference Paper
Experts from the statistical domain worked in close collaboration with ontology engineers to develop an ontology of a subset of the Data Documentation Initiative, an established international standard for the documentation and management of data from the social, behavioral, and economic sciences. Experts in the statistics domain formulated use case...
Technical Report
Full-text available
The DDI Alliance has initiated new work to build a model-based specification from which technical bindings can be generated. This approach will carry many benefits in terms of communicating with other standards efforts and maintaining consistency. To launch this project, a group of experts met in Wadern, Germany, in October 2012, where different ap...
Conference Paper
Full-text available
Designing domain ontologies from scratch is a time-consuming process. In many cases, both the terminologies and the syntactic structures of domain data models are already described in form of XML Schemas. XSLT transformations are used to lift the syntactic level of XML documents to the semantic level of OWL ontologies by mapping any XML Schemas to...
Conference Paper
Full-text available
Designing an ontology for a specific domain is a time-consuming process. In many cases, information sources like XML Schemas serve as a basis for ontology engineers to conceptualize the intended ontologies. The ontology design process is sped up significantly when XML Schemas are transformed automatically into generated ontologies. An XML Schema Me...
Conference Paper
Full-text available
An ontology of the DDI 3 data model will be designed by following the ontology engineering methodology to be evolved based on state-of-the-art methodologies. Hence DDI 3 data and metadata can be represented in form of a standard web interchange format RDF and processed by highly available RDF tools. As a consequence the DDI community has the possib...
Thesis
Full-text available
Mit dieser Arbeit ist das Ziel verbunden, Anforderungen an Funktionalitäten und Tools zur Förderung der Kreativität in Innovations-Communities zu identifizieren, ein Modell zur Klassifizierung dieser Funktionalitäten und Softwarewerkzeuge zu entwickeln und auf diesem Klassifikationsschema basierend Gestaltungsempfehlungen für Innovationsgemeinschaf...
Technical Report
Full-text available
The purpose of this research paper is to apply correspondence analysis to the underlying data set in order to examine how two different demographics sex and age relate to the color and the brand of cars. The questions to be answered are the following: 1. Are there any interesting relationships? 2. How strong are these relationships? 3. How does th...
Technical Report
Full-text available
The task of the author of this research paper was to select a specific linear classification method to create a rule to classify new customers into two mutually exclusive clusters. With this model, you can predict, if a customer of a bank will pay back a loan or not. In order to create such a rule, the author chose the linear discriminant analysis....
Technical Report
Full-text available
The purpose of this research paper is to verify the hypothesis that the ANETT classification is based on logical criteria although these criteria are of subjective nature. To implement this, the author of this paper compared the outcomes of two different types of cluster analysis approaches using two variable sets with the result of the ANETT class...
Technical Report
Full-text available
The purpose of this research paper is to verify the hypothesis that the ANETT classification is based on logical criteria although these criteria are of subjective nature. To implement this, the author of this paper compared the outcomes of two different types of cluster analysis approaches using two variable sets with the result of the ANETT class...
Thesis
Full-text available
Mit dieser Arbeit werden zwei komplementäre Zielsetzungen verfolgt. Das Primärziel soll durch den pragmatischen, das sekundäre Ziel durch den wissenschaftlichen Teil der Arbeit erreicht werden. Der wissenschaftliche Abschnitt dient einerseits dazu, ein prinzipielles Verständnis über SOAs zu vermitteln und andererseits, die Relevanz des Einsatzes e...

Network

Cited By