Philipp Zumstein's research while affiliated with Universität Mannheim and other places

Publications (8)

Chapter
A variety of schemas and ontologies are currently used for the machine-readable description of bibliographic entities and citations. This diversity, and the reuse of the same ontology terms with different nuances, generates inconsistencies in data. Adoption of a single data model would facilitate data integration tasks regardless of the data suppli...
Preprint
Full-text available
A variety of schemas and ontologies are currently used for the machine-readable description of bibliographic entities and citations. This diversity, and the reuse of the same ontology terms with different nuances, generates inconsistencies in data. Adoption of a single data model would facilitate data integration tasks regardless of the data suppli...
Article
Full-text available
This short paper presents preliminary considerations regarding LexBib, a corpus, bibliography, and domain ontology of Lexicography and Dictionary Research, which is currently being developed at University of Hildesheim. The LexBib project is intended to provide a bibliographic metadata collection made available through an online reference platform....
Preprint
Full-text available
12 This short paper presents preliminary considerations regarding LexBib, a corpus, bibliography, and 13 domain ontology of Lexicography and Dictionary Research, which is currently being developed 14 at University of Hildesheim. The LexBib project is intended to provide a bibliographic metadata 15 collection made available through an online referen...
Conference Paper
Full-text available
Citations play a crucial role in the scientific discourse, in information retrieval, and in bibliometrics. Many initiatives are currently promoting the idea of having free and open citation data. Creation of citation data, however, is not part of the cataloging workflow in libraries nowadays. In this paper, we present our project Linked Open Citati...
Article
Full-text available
Möglichkeiten zur Verbesserung der automatischen Texterkennung (OCR) in digitalen Sammlungen insbesondere durch computerlinguistische Methoden werden beschrieben und bisherige PostOCR-Verfahren analysiert. Im Gegensatz zu diesen Möglichkeiten aus der Forschung oder aus einzelnen Projekten unterscheidet sich die momentane Anwendung von OCR in der Bi...
Article
Zusammenfassung Literaturverwaltungsprogramme haben sich zu funktionsmächtigen, alltäglichen Begleitern des wissenschaftlichen Arbeitens sowie auch allgemein für die persönliche Wissensorganisation entwickelt. Eine seit jeher zentrale Funktion dieser Tools ist die Übernahme bibliographischer Daten aus Datenbanken, insbesondere aus elektronischen Bi...

Citations

... • Domain ontologies: SWAN Ontology for Neuromedicine (Ciccarese et al., 2008), Provenir Ontology for eScience (Sahoo & Sheth, 2009), and PREMIS for archived digital objects (Caplan, 2017). • Provenance-related ontologies: Dublin Core Metadata Terms (Board, 2020), and the OpenCitations Data Model (Daquino, Peroni, et al., 2020). Table 1 compares the metadata representation models mentioned above. ...
... In order to accomplish this, a Fully Convolutional Neural Network (FCN) was used (Long et al. 2015) to segment the references and then post-processed in order to identify individual references. A layout-based citation detection method was incorporated into Lauscher et al. (2018) to build an open database of citations for libraries for indexing purposes. To detect bibliographic references in scientific publications, Rizvi et al. (2019) evaluated four state-of-the-art object detection models based on layout information. ...
... Notably, however, data papers are becoming more and more common and hereby blur our distinction. Literature management understood in this way encompasses the entire research process ranging from reading background information up to the automatic generation of the reference list while authoring publications, while always containing a significant part personal to the individual researcher, e.g., [53]. It should therefore be considered as an omnipresent, integral part of Research Data Management as we define it for the purpose of this paper. ...