Philipp Zumstein's research while affiliated with Universität Mannheim and other places

Publications (6)

Chapter
A variety of schemas and ontologies are currently used for the machine-readable description of bibliographic entities and citations. This diversity, and the reuse of the same ontology terms with different nuances, generates inconsistencies in data. Adoption of a single data model would facilitate data integration tasks regardless of the data suppli...
Preprint
Full-text available
A variety of schemas and ontologies are currently used for the machine-readable description of bibliographic entities and citations. This diversity, and the reuse of the same ontology terms with different nuances, generates inconsistencies in data. Adoption of a single data model would facilitate data integration tasks regardless of the data suppli...
Article
Full-text available
This short paper presents preliminary considerations regarding LexBib, a corpus, bibliography, and domain ontology of Lexicography and Dictionary Research, which is currently being developed at University of Hildesheim. The LexBib project is intended to provide a bibliographic metadata collection made available through an online reference platform....
Preprint
Full-text available
12 This short paper presents preliminary considerations regarding LexBib, a corpus, bibliography, and 13 domain ontology of Lexicography and Dictionary Research, which is currently being developed 14 at University of Hildesheim. The LexBib project is intended to provide a bibliographic metadata 15 collection made available through an online referen...
Conference Paper
Full-text available
Citations play a crucial role in the scientific discourse, in information retrieval, and in bibliometrics. Many initiatives are currently promoting the idea of having free and open citation data. Creation of citation data, however, is not part of the cataloging workflow in libraries nowadays. In this paper, we present our project Linked Open Citati...

Citations

... The data model is implemented by means of the OpenCitations Ontology (OCO), 13 which is not yet another bibliographic ontology, but rather simply a mechanism for grouping together existing complementary ontological entities from several other ontologies, for the purpose of providing descriptive metadata all in one place. As introduced in [52], the OCDM has already been adopted by several projects in the scholarly domain for organising bibliographic information such as the Venice Scholar Index 14 [32], the Linked Open Citations Database (LOC-DB) 15 [47] and the EXCITE Project 16 [53]. ...
... All tasks are organized as supervised procedures for ML, using CRF as their learning mechanism. ParsCit uses Conditional Random Fields to mark the reference string token sequences, and it is combined with heuristic processing to classify reference strings from an unstructured text file, and to retrieve the citation contexts [39]. ParsCit starts by locating the references in the text using a collection of heuristics that consists of searching by a designated reference segment (e.g., "References," "Bibliography,") To do so the text is iteratively separated into strings that are likely to be labels in the reference segment. ...