Conference PaperPDF Available

Applying Effective Data Modelling Approaches for the Creation of a Participatory Archive Platform

Authors:
  • Swiss Federal Archives

Abstract

The development of a participatory archive platform such as the one being carried out for the PIA research project requires a flexible infrastructure allowing genuine data curation and a robust underlying data model. A strong assumption to achieve this is to primarily leverage Linked Open Usable Data (LOUD) standards, such as IIIF, Linked Art or the Web Annotation Data Model, which help in the dissemination and reuse of cultural heritage resources as well as contributing that digital humanities initiatives become more sustainable.
Applying Eective Data Modelling Approaches for the
Creation of a Participatory Archive Platform
JULIEN ANTOINE RAEMY,University of Basel, Switzerland
The development of a participatory archive platform such as the one being carried out for the PIA research
project requires a exible infrastructure allowing genuine data curation and a robust underlying data model.
A strong assumption to achieve this is to primarily leverage Linked Open Usable Data (LOUD) standards, such
as IIIF, Linked Art or the Web Annotation Data Model, which help in the dissemination and reuse of cultural
heritage resources as well as contributing that digital humanities initiatives become more sustainable.
CCS Concepts: Theory of computation
Data modeling;Information systems
Semantic web
description languages;RESTful web services;Ontologies;Thesauri;Digital libraries and archives;
Software and its engineering Interoperability.
Additional Key Words and Phrases: Citizen Science, Cultural Heritage, Digital Sustainability, International
Image Interoperability Framework (IIIF), Linked Art, Linked Open Usable Data (LOUD), Schema.org
1 INTRODUCTION
The digitisation of cultural heritage resources by memory organisations, such as libraries, archives
and museums, has greatly beneted the research community, enabling them to answer previously
elusive research questions, to raise entirely new ones, and to engage in critical reection on the
consequences of digital transformation.
While cultural heritage resources often have the advantage of being in the public domain or
licensed for digital processing and publication, the challenges of accessing - which is ultimately the
purpose of preservation - and reusing these outputs in the long term are generally not guaranteed
because infrastructures to ensure this type of process are lacking [
10
]. Although there are many
institutional or eld-specic repositories, generally only a part of the research data is preserved,
usually bound to a Data Management Plan (DMP), if not frozen.
The FAIR
1
Principles [
23
] can help digital humanities to achieve some measure of digital sustain-
ability and keep these data principles as a common thread. However, this is not enough to achieve
the exibility that an infrastructure and the underlying data model should have with regard to
sustainability.
One of the strategies to have a constant maintenance of research data in digital humanities is to
leverage the architecture of the Web [
9
], specically using Linked Open Data (LOD) approaches
and ontologies such as CIDOC-CRM [4,12].
This paper will look at how the human dimensions around the project titled Participatory
Knowledge Practices in Analogue and Digital Image Archives (PIA) can impact data modelling as well
as identifying the requirements for developing a participatory archive platform where web-based
standards play a key role in the dissemination and re-usability of knowledge [20].
2 PARTICIPATORY KNOWLEDGE PRACTICES IN ANALOGUE AND DIGITAL IMAGE
ARCHIVES (PIA)
PIA is a four-year interdisciplinary research project - running from February 2021 to January
2025 - funded by the Swiss National Science Foundation (SNSF)2. The project is conducted by the
1Findable, Accessible, Interoperable, Reusable (FAIR)
2SNSF. Participatory Knowledge Practices in Analogue and Digital Image Archives. https://p3.snf.ch/Project-193788
Author’s address: Julien Antoine Raemy, julien.raemy@unibas.ch, University of Basel, Digital Humanities Lab, Spalenberg
65, CH-4051, Basel, Switzerland.
2 Julien A. Raemy. Human Factors in Digital Humanities, PhD Seminar @EPFL, Dec 2-3, 2021
Institute for Cultural Anthropology and European Ethnology and the Digital Humanities Lab of
the University of Basel as well as the Bern University of the Arts.
The purpose of PIA is to design a visual interface with machine learning-based tools to make it
easy to annotate, contextualize, organise, and link both images and their metadata, to deliberately
encourage the participatory use of archives. It focuses on three very diverse photographic collections
of the Swiss Society for Folklore Studies (SSFS)3, namely:
SGV_05: The Atlas of Swiss Folklore consisting of 256 maps and 1000 pages of commentary.
This collection, which hasn’t yet been digitised, was commissioned by the SSFS to do an
extensive survey of the Swiss population in the 1930s and 1940s on many issues pertaining,
for instance, to everyday life, local laws, superstitions, celebrations or labour;
SGV_10: Kreis Family comprises approximately 20,000 loose photographic objects, mostly
kept in photo albums, from a wealthy Basel-based family and spanning from the 1850s to
the 1970s. The pictures were taken by studio photographers as well as by family members
themselves;
SGV_12: Ernst Brunner is a donation of about 48,000 negatives and 20,000 prints to the SSFS
from a professional photographer who lived from 1901 to 1979 and who documented mainly
in the 1930s and 1940s a wide range of folkloristic themes.
The fact that part of these collections still has to be digitised alongside the project will be an
opportunity to investigate the digital transformation processes, the generation of digital surrogates
and their associated metadata as well as how the underlying materiality aspects may or may not be
preserved.
More specically, there are two main goals within PIA: the rst having a theoretical focus on the
systematic analysis and description of the analogue and digital photo archives and the second an
implementation focus related to the design of a participatory image archive.
PIA is as much about traditional crowdsourcing as it is about creating a modern Citizen Science
platform that enables new uses and helps to streamline the research process of scholars. For this
purpose, a user interface and various application programming interfaces (APIs) will be deployed
to accommodate various forms of re-use by third parties [17].
3 IDENTIFIED REQUIREMENTS FOR DEPLOYING A CITIZEN SCIENCE
INFRASTRUCTURE
Substantial preliminary work, particularly with regard to data cleaning and reconciliation, needs to
be carried out prior to truly deploying an infrastructure that ensures content re-usability, whether
by the scientic community or for the wider public [
16
]. Notably, one of the intentions of PIA is to
adopt metadata standards and web-based specications that are well established within the cultural
heritage eld [
13
,
15
] in the interests of providing a high level of interoperability. To this end, the
following list of requirements inuencing the data modelling and the overarching architecture of
the infrastructure has been identied:
Management of existing (non-controlled) keywords and the forthcoming folksonomy;
Creation of controlled vocabularies using the Simple Knowledge Organization System
(SKOS) for all PIA-related collections where dedicated terminology can complement existing
LOD taxonomies and thesauri, such as the Getty Vocabularies;
Metadata correction and enrichment through crowdsourcing;
Validation of metadata created by Machine Learning methods (notably Object Detection
and Visual Text Co-Embedding) through the content-based multimedia retrieval system
vitrivr [8];
3The photo archive of the SSFS can be consulted online at https://archiv.sgv-sstp.ch/
Applying Eective Data Modelling Approaches for the Creation of a Participatory Archive Platform 3
Annotation and transcription of digital surrogates and born-digital content that are com-
patible with the W3C Web Annotation Data Model;
Reconciliation of entities (e.g. agent, concept, place) on the basis of the existing data model
and the user-generated metadata;
Making digitised collections of the SSFS as well as resources uploaded by end users compli-
ant with the Image and Presentation APIs 3.0 of the International Image Interoperability
Framework (IIIF) [1,2,21];
Leveraging the IIIF Change Discovery API 1.0, which is an Activity Streams endpoint, to
notify amendments to IIIF resources and make them more easily discoverable in a machine-
readable manner [3,7].
The identication of these requirements provides the framework for establishing an infrastructure
that could be in line with the FAIR principles. Furthermore, one of the cornerstones of the project
is also the synchronisation of the enriched data between the PIA infrastructure and the Data &
Service Center for the Humanities (DaSCH), which is in charge of the long-term preservation of
the SSFS (digitised or digital-born) photographs and their associated metadata.
4 THE PIA DATA MODEL
In the context of the database migration from Salsah to the DaSCH Service Platform (DSP) and to
anticipate the needs of the PIA project, it was decided to add new classes and properties within the
RDF-based SSFS ontology, to rename some of them with the
CamelCase
practice, to delete some not
or hardly used properties as well as to identify a common denominator with the PIA data model
4
,
which is still a work in progress [
6
]. We believe that the best candidate for this mapping, which
will be done using the
rdfs:subClassOf
and
rdfs:subPropertyOf
predicates, is Schema.org, a
vocabulary providing a very extensive number of classes and properties as well as having the added
benet of being easily indexed by search engines like Google, Bing or Yandex [22].
Fig. 1. Partial Representation in Linked Art of [Katze auf einer Mauer], SGV_12N_19553, https://archiv.sgv-
sstp.ch/resource/441788 - object that will have a IIIF Image API 3.0 digital service on the PIA platform
4
The PIA Data Model GitHub repository is available at: https://github.com/Participatory-Image- Archives/pia-data- model/
4 Julien A. Raemy. Human Factors in Digital Humanities, PhD Seminar @EPFL, Dec 2-3, 2021
If Schema.org is also going to be used within the PIA project and notably by embedding a
JSON-LD serialization within the resources’ web pages, we also want to provide a representation
of the activities surrounding the collections with a focus on their provenance. Thus, Linked Art,
which is both a community-based eort and an application prole based on CIDOC-CRM as well
as providing an API in JSON-LD [5], will be deployed.
Another advantage of Linked Art is that we can integrate other digital services, such as IIIF
(see Figure 1, which is based on the Linked Art Digital Integration component). As announced in
the previous chapter, we also want to make the annotations compatible with the Web Annotation
Data Model, which can be easily used in conjunction with the IIIF APIs. The three specications
(Linked Art, IIIF, Web Annotatioin Data Model) can be leveraged seamlessly in unison and all
enable published data to be easily consumed by both machines and humans. Moreover, these
technologies adhere to the design principles of Linked Open Usable Data (LOUD), term coined by
Robert Sanderson - which attempts to extend the principles of LOD and who has been working on
these specications [19].
Last but not least, another aspect of the PIA Data Model will be to address specic needs of
the SGV_05, SGV_10 or SGV_12 collections in terms of controlled vocabularies. For the SGV_12
Ernst Brunner collection, there is a prominent interest since the photographer had decided to
develop his own terminology to structure and classify his photographs. These terms have so far not
been included in the metadata. A thesaurus in SKOS has therefore been created and published via
SkoHub-Vocabs [
14
]. It will be rened over the next few months and the items will be connected to
this classication for research purposes.
5 CONCLUSION
A exible data model in digital humanities projects requires seeing the Web as a technology on
which other building blocks can be developed (e.g. RESTful APIs) and preferably on LOD for
enabling inferences, and even better moving towards specications that conform to the LOUD
design principles for publishing data that is truly usable and which will consequently improve the
accessibility and sustainability of digital cultural heritage resources.
The limitation to deploying and maintaining an infrastructure with LOUD standards remains
that organisations need to dene access persistence policies, notably by assigning cool URIs or
persistent identiers (PIDs) that could at least resolve to alternative web pages containing the
associated metadata should they disappear [11,18].
ACKNOWLEDGMENTS
PIA is made possible thanks to a SNSF Sinergia grant and I would like to thank all my colleagues,
especially my supervisors (Peter Fornaro and Walter Leimgruber), the ve other fellow PhD
candidates (Murielle Cornut, Birgit Huber, Fabienne Lüthi, Florian Spiess and Max Frischknecht)
and our software developer (Adrian Demleitner), working on this research project.
Many thanks as well to Yumeng Hou, Doctoral Assistant at the Laboratory for Experimental
Museology of the Ecole Polytechnique Fédérale de Lausanne (EPFL), for organising the PhD seminar
on Human Factors in Digital Humanities both on site and online on December 2-3, 2021.
REFERENCES
[1]
Michael Appleby, Tom Crane, Robert Sanderson, Jon Stroop, and Simeon Warner. 2020. IIIF Image API 3.0. https:
//iiif.io/api/image/3.0/
[2]
Michael Appleby, Tom Crane, Robert Sanderson, Jon Stroop, and Simeon Warner. 2020. IIIF Presentation API 3.0.
https://iiif.io/api/presentation/3.0/
[3]
Michael Appleby, Tom Crane, Robert Sanderson, Jon Stroop, and Simeon Warner. 2021. IIIF Change Discovery API
1.0.0. https://iiif.io/api/discovery/1.0/
Applying Eective Data Modelling Approaches for the Creation of a Participatory Archive Platform 5
[4]
George Bruseker, Nicola Carboni, and Anaïs Guillem. 2017. Cultural Heritage Data Management: The Role of Formal
Ontology and CIDOC CRM. In Heritage and Archaeology in the Digital Age: Acquisition, Curation, and Dissemination of
Spatial Cultural Heritage Data, Matthew L. Vincent, Víctor Manuel López-Menchero Bendicho, Marinos Ioannides, and
Thomas E. Levy (Eds.). Springer International Publishing, Cham, 93–131. https://doi.org/10.1007/978-3-319- 65370-9_6
[5]
Emmanuelle Delmas-Glass and Robert Sanderson. 2020. Fostering a community of PHAROS scholars through the
adoption of open standards. Art Libraries Journal 45, 1 (Jan. 2020), 19–23. https://doi.org/10.1017/alj.2019.32 Publisher:
Cambridge University Press.
[6] Adrian Demleitner and Julien Antoine Raemy. 2021. PIA Data Model. https://doi.org/10.5281/zenodo.5142605
[7]
Nuno Freire, Enno Meijers, Sjors de Valk, Julien A. Raemy, and Antoine Isaac. 2021. Metadata Aggregation via Linked
Data: Results of the Europeana Common Culture Project. In Metadata and Semantic Research (Communications in
Computer and Information Science), Emmanouel Garoufallou and María-Antonia Ovalle-Perandones (Eds.). Springer
International Publishing, Cham, 383–394. https://doi.org/10.1007/978-3- 030-71903-6_35
[8]
Ralph Gasser, Luca Rossetto, and Heiko Schuldt. 2019. Multimodal Multimedia Retrieval with vitrivr. In Proceedings of
the 2019 on International Conference on Multimedia Retrieval (ICMR ’19). Association for Computing Machinery, New
York, NY, USA, 391–394. https://doi.org/10.1145/3323873.3326921
[9]
Ian Jacobs and Norman Walsh. 2004. Architecture of the World Wide Web, Volume One. https://www.w3.org/TR/
webarch/
[10]
Andŕe Kilchenmann, Flavie Laurens, and Lukas Rosenthaler. 2019. Digitizing, archiving... and then? Ideas about the
usability of a digital archive. Archiving Conference 2019, 1 (May 2019), 146–150. https://doi.org/10.2352/issn.2168-
3204.2019.1.0.34
[11]
Lukas Koster. 2020. Persistent identiers for heritage objects. The Code4Lib Journal 47 (Feb. 2020). https://journal.
code4lib.org/articles/14978
[12]
Florian Kräutli, Esther Chen, and Matteo Valleriani. 2021. Linked data strategies for conserving digital research outputs:
The shelf life of digital humanities. In Information and Knowledge Organisation in Digital Humanities. Routledge,
London. https://www.taylorfrancis.com/chapters/oa-edit/10.4324/9781003131816-10/
[13]
Marian Clemens Manz, Julien Antoine Raemy, Béatrice Gauvain, and Vera Chiquet. 2021. Let’s talk about standards –
A write-up of a discussion on metadata standardization in the Digital Humanities. https://dh-ch.ch/blog/posts/let’s-
talk-about- standards-a- write-up-of- a-discussion- on-metadata-standardization- in-the- digital-humanities.html
[14]
Adrian Pohl and Adrian Ostrowski. 2019. Presenting the SkoHub Vocabs Prototype. https://blog.skohub.io/2019-09-
27-skohub- vocabs/
[15]
Julien Antoine Raemy. 2020. Enabling better aggregation and discovery of cultural heritage content for Europeana and
its partner institutions. Master’s thesis. HES-SO University of Applied Sciences and Arts, Haute école de gestion de
Genève, Geneva, Switzerland. https://doc.rero.ch/record/329698
[16]
Julien Antoine Raemy. 2021. Data modelling and Citizen Science: impact of user-generated content within the PIA
research project. https://doi.org/10.5281/zenodo.5763545
[17]
Julien Antoine Raemy. 2021. The International Image Interoperability Framework (IIIF) APIs as the backbone of
scientic and participatory research. https://doi.org/10.5281/zenodo.4890821
[18]
Julien A. Raemy and René Schneider. 2019. Suggested measures for deploying IIIF in Swiss cultural heritage institutions.
White paper. HES-SO University of Applied Sciences and Arts, Haute école de gestion de Genève, Geneva, Switzerland.
https://doi.org/10.5281/zenodo.2640416
[19]
Robert Sanderson. 2020. The Importance of being LOUD. https://www.slideshare.net/azaroth42/the-importance-of-
being-loud
[20]
Tobias Schweizer, Lukas Rosenthaler, and Peter Fornaro. 2017. Content-based Interoperability: Beyond Technical
Specications of Interfaces. Archiving Conference 2017, 1 (May 2017), 34–38. https://doi.org/10.2352/issn.2168-
3204.2017.1.0.34
[21]
Stuart Snydman, Robert Sanderson, and Tom Cramer. 2015. The International Image Interoperability Framework (IIIF):
A community & technology approach for web-based images. In Archiving Conference, Vol. 2015. IS&T, Los Angeles,
CA, 16–21. https://purl.stanford.edu/df650pk4327
[22]
Richard Wallis, Antoine Isaac, Valentine Charles, and Hugo Manguinhas. 2017. Recommendations for the application
of Schema.org to aggregated Cultural Heritage metadata to increase relevance and visibility to search engines: the
case of Europeana. The Code4Lib Journal 36 (April 2017). https://journal.code4lib.org/articles/12330
[23]
Mark D. Wilkinson et al. 2016. The FAIR Guiding Principles for scientic data management and stewardship. Scientic
Data 3 (March 2016), 160018. https://doi.org/10.1038/sdata.2016.18
... Dans ce cas là, les propriétés member ou member_of sont utilisés. Par ailleurs, Un avantage non négligeable du modèle est la possibilité d'intégrer des objets et services numériques 10 , de sorte qu'il est aisé de pointer vers des ressources IIIF en donnant des informations quant à leur niveau de conformité par rapport à une API au sein d'un modèle de données [22]. ...
... PIA Project : https://about.participatory-archives.ch/ 12. Fotoarchiv der Schweizerischen Gesellschaft für Volkskunde : https://archiv.sgv-sstp.ch/ photographies sur des sujets divers tels que la vie quotidienne, la tradition et l'identité, les formes de travail et d'habitation [22]. Parallèlement au déploiement de l'infrastructure et au design de ces interfaces, le projet étudie les phases des archives analogiques et numériques dans une perspective d'anthropologie de la connaissance, de technique et de communication. ...
Conference Paper
Full-text available
Résumé: La réutilisabilité, et a fortiori la valorisation, des données ouvertes liées (LOD - Linked Open Data ) dans les humanités numériques peut grandement être améliorée en appliquant les principes de conception du Linked Open Usable Data (LOUD) et des standards qui en sont conformes que sont l'API Présentation 3.0 du « cadre international d'interopérabilité des images » (IIIF - International Image Interoperability Framework ), le W3C Web Annotation Data Model (WADM) ainsi que Linked Art, un profil RDF de CIDOC-CRM qui recourt à JSON-LD et aux vocabulaires Getty. Les diverses organisations qui ont mis à profit ces spécifications, autant des solutions techniques que sociales, ont largement contribué à la valorisation des données du patrimoine cutlurel et ont ouvert de nouvelles possibilités aux utilisateurs, en particulier en répondant aussi bien aux besoins de la communauté scientifique qu'à ceux des développeurs de logiciels. A titre d'exemple, le projet de recherche « Pratiques de connaissance participatives dans les archives d'images analogiques et numériques » (PIA) est présenté. Celui-ci souhaite tirer parti de IIIF, WADM et Linked Art au sein du modèle de données dans le cadre d'une initiative de science citoyenne axée sur trois collections des archives photographiques de la société suisse des traditions populaires (SSTP). Grâce à ce socle technologique mettant en pratique le LOUD, PIA permettra ainsi une utilisation participative des archives offrant des possibilités aux différents publics cibles de contextualiser, relier, contraster et annoter les images et leurs métadonnées associées. Abstract The re-usability, and thus the valorisation, of Linked Open Data (LOD) in the Digital Humanities can be greatly improved by applying the Linked Open Usable Data (LOUD) design principles and the adhering standards which are the Presentation API 3.0 of the International Image Interoperability Framework (IIIF), the W3C Web Annotation Data Model (WADM) and Linked Art, an RDF application profile based on CIDOC-CRM that uses JSON-LD and Getty vocabularies. The various organisations that have taken advantage of these specifications, both technical and social solutions, have greatly contributed to making the most of cultural heritage data and have opened up new possibilities for end users, in particular by meeting the needs of both the scientific community and software developers. As an example, the research project titled "Participatory Knowledge Practices in Analogue and Digital Image Archives" (PIA) is highlighted. This project aims to leverage IIIF, WADM and Linked Art as part of its data model in the context of a Citizen Science initiative focusing on three photographic collections of the Swiss Society for Folklore Studies (SSFS). Thanks to this technological foundation putting LOUD into practice, PIA will enable a more participatory use of the archive, offering possibilities for different target audiences to contextualise, link, contrast and annotate images and their associated metadata.
Thesis
Full-text available
Europeana, a non-profit foundation launched in 2008, aims to improve access to Europe’s digital cultural heritage through its open data platform that aggregates metadata and links to digital surrogates held by over 3700 providers. The data comes both directly from cultural heritage institutions (libraries, archives, museums) as well as through intermediary aggregators. Europeana’s current operating model leverages the Open Archives Initiative Protocol for Metadata Harvesting (OAI-PMH) and the Europeana Data Model (EDM) for data import through Metis, Europeana's ingestion and aggregation service. However, OAI-PMH is an outdated technology, and is not web-centric, which presents high maintenance implications, in particular for smaller institutions. Consequently, Europeana seeks to find alternative aggregation mechanisms that could complement or supersede it over the long-term, and which could also bring further potential benefits. In scope, this master’s thesis seeks to extend the research on earlier aggregation experiments that Europeana successfully carried out with various technologies, such as aggregation based on Linked Open Data (LOD) datasets or through the International Image Interoperability Framework (IIIF) APIs. The literature review first focuses on metadata standards and the aggregation landscape in the cultural heritage domain, and then provides an extensive overview of Web-based technologies with respect to two essential components that enable aggregation: data transfer and synchronisation as well as data modelling and representation. Three key results were obtained. First, the participation in the Europeana Common Culture project resulted in the documentation revision of the LOD-aggregator, a generic toolset for harvesting and transforming LOD. Second, 52 respondents completed an online survey to gauge the awareness, interest, and use of technologies other than OAI-PMH for (meta)data aggregation. Third, an assessment of potential aggregation pilots was carried out considering the 23 organisations who expressed interest in follow-up experiments on the basis of the available data and existing implementations. In the allotted time, one pilot was attempted using Sitemaps and Schema.org. In order to encourage the adoption of new aggregation mechanisms, a list of proposed suggestions was then established. All of these recommendations were aligned with the Europeana Strategy 2020-2025 and directed towards one or several of the key roles of the aggregation workflow (data provider, aggregator, Europeana). Even if a shift in Europena’s operating model would require extensive human and technical resources, such an effort is clearly worthwhile as solutions presented in this dissertation are well-suited for data enrichment and for allowing data to be easily updated. The transition from OAI-PMH will also be facilitated by the integration of such mechanisms within the Metis Sandbox, Europeana's new ad-hoc system where contributors will be able to test their data sources before ingestion into Metis. Ultimately, this shift is also expected to lead to a better discoverability of digital cultural heritage objects.
Article
Full-text available
There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders—representing academia, industry, funding agencies, and scholarly publishers—have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community.
Conference Paper
Digital cultural heritage resources are widely available on the web through the digital libraries of heritage institutions. To address the difficulties of discoverability in cultural heritage, the common practice is metadata aggregation, where centralized efforts like Europeana facilitate discoverability by collecting the resources’ metadata. We present the results of the linked data aggregation task conducted within the Europeana Common Culture project, which attempted an innovative approach to aggregation based on linked data made available by cultural heritage institutions. This task ran for one year with participation of twelve organizations, involving the three member roles of the Europeana network: data providers, intermediary aggregators, and the central aggregation hub, Europeana. We report on the challenges that were faced by data providers, the standards and specifications applied, and the resulting aggregated metadata.
Article
The PHAROS consortium is adopting the Linked Art data model to make its descriptions of photo archives collections available as Linked Open Data to further support scholars in their research. Linked Art is both a community and a data model. As an international community, it works together to create a shared data model to describe art. As a data model, it is a data profile of the CIDOC Conceptual Reference Model and using Linked Open Data techniques. The goal of Linked Art is to enable museums and developers to engage in LOD initiatives more easily by providing them with shared data modelling decisions and consistent design principles.
Conference Paper
The steady growth of multimedia collections - both in terms of size and heterogeneity - necessitates systems that are able to conjointly deal with several types of media as well as large volumes of data. This is especially true when it comes to satisfying a particular information need, i.e., retrieving a particular object of interest from a large collection. Nevertheless, existing multimedia management and retrieval systems are mostly organized in silos and treat different media types separately. Hence, they are limited when it comes to crossing these silos for accessing objects. In this paper, we present vitrivr, a general-purpose content-based multimedia retrieval stack. In addition to the keyword search provided by most media management systems, vitrivr also exploits the object's content in order to facilitate different types of similarity search. This can be done within and, most importantly, across different media types giving rise to new, interesting use cases. To the best of our knowledge, the full vitrivr stack is unique in that it seamlessly integrates support for four different types of media, namely images, audio, videos, and 3D models.
Chapter
Building models for integrating the diverse data generated in Cultural Heritage disciplines is a long-term challenge both for securing presently generated knowledge and for making it progressively more widely accessible and interoperable into the future. This chapter reviews the multiple approaches undertaken to address this problem, finally proposing CIDOC CRM as the most robust solution for information integration in CH. The chapter begins by outlining the data challenge specific to the field and the main approaches that can be taken in facing it. Within this frame, it distinguishes knowledge engineering and formal ontology from other information modelling techniques as the necessary approach for tackling the broader data integration problem. It then outlines the basic principles of CIDOC CRM, the ISO standard formal ontology for CH. From there, an overview is given of some of the work that has been done both theoretically and in practice over the past five years in developing and implementing CRM as a practical data integration strategy in CH, particularly looking at model extensions to handle knowledge provenance across various disciplines and typical documentation and reasoning activities, as well as at successful implementation projects. Lastly, it summarizes the present potentials and challenges for using CIDOC CRM for solving the CH data management and integration puzzle. The intended audience of this chapter are specialists from all backgrounds within the broader domain of CH with an interest in data integration and CIDOC CRM.
IIIF Presentation API 3
  • Michael Appleby
  • Tom Crane
  • Robert Sanderson
  • Jon Stroop
  • Simeon Warner
Michael Appleby, Tom Crane, Robert Sanderson, Jon Stroop, and Simeon Warner. 2020. IIIF Presentation API 3.0. https://iiif.io/api/presentation/3.0/
  • Ian Jacobs
  • Norman Walsh
Ian Jacobs and Norman Walsh. 2004. Architecture of the World Wide Web, Volume One. https://www.w3.org/TR/ webarch/