ArticlePDF Available
Application of linked data technologies in digital
libraries: a review of literature
Zahid Raza, Khalid Mahmood and Nosheen Fatima Warraich
Introduction
Digital libraries are on the way to
link their data on the Web to serve their
users. Unfortunately, digital libraries
are lagging behind to publish their
quality resources on the Web for wider
community usage. Many libraries are
succeeding in efforts to introduce
modern Web standards in the form of
Semantic Web using linked data
technologies. Web 1.0 had information
characteristics, Web 2.0 was interactive
and now Web 3.0 provides a machine
readable Web, or the Web of Data, or a
Semantic Web. It allows machines to
better understand, interpret and reuse
the meaning of the content just like
humans. Semantic Web has been
created using structured data that allow
computers to understand and interpret
the relationships between entities of
data. Semantic evolution of Web was
also due to the extensive usage of linked
data technologies (Byrne and Goddard,
2010).
Haslhofer and Isaac (2011) described
that linked data was a data publishing
technique used on the World Wide Web
to connect related data for the purpose
of accessibility on the Web. Linked data
are based on hypertext transfer protocol
(HTTP), uniform resource identifiers
(URI) and using Semantic Web
standards such as the resource
description framework (RDF) which
allows related data to be connected with
each other and with other related
resources as well. Guerrini (2013)
described that “linked data is a set of
best practices required for publishing
and connecting structured data on the
Web for use by a machine”. Machine
will be capable just like humans to read,
interpret and reuse information using
URIs on the Web to facilitate access.
Linked Open Data (LOD) integrates
open data sets on the Web for wider
usage of the community.
The concept of linked data is closely
relatedtotheSemanticWeb,andlinked
data are considered as a brain child of the
Sematic Web. Berners-Lee et al. (2001)
described the Semantic Web as: “an
extension of the current Web in which
information is given well-defined
meaning, better enabling computers and
people to work in cooperation”. The basic
principle of the Semantic Web is to enable
machines to understand and interpret data
on the Web which can be shared and
processed by automatic tools as well as
humans. Machines will not only be able to
display data but also be able to use data for
integration and reuse it for various
applications (Sure and Studer, 2005).
Berners-Lee et al. (2001) described
four rules for the creation of linked data
on the Web:
1. use URIs as names for things;
2. use HTTP URIs so that people
can look up those names;
3. when someone looks up a URI,
provide useful information using
the standards (RDF, SPARQL);
and
4. include links to other URIs so that
they can discover more things.
Guerrini (2013) described that five star
principles of Tim Berners-Lee should
be opted to expose the data on the Web.
These five stars principles describe the
ascending order of publishing data on
the Web, and the last star was the
ultimate aim of open linked data. These
principles are:
1. one’s own data being available
on the Web (in whatever format);
2. the material put on the Web is
available as structured data (e.g.
in excel instead of as a scanned
image of a table);
3. having chosen nonproprietary
formats (e.g., in csw\instead of
excel);
4. having used URL to identify the
objects, so that users can point to
these objects; and
5. one’s own data are linked to data
produced by others so as to define
a context.
This review article presents several key
issues for policy makers, software
developers, decision makers and library
administrators about linked data
technologies and its implementations in
digital libraries. Search engines will
also achieve their longstanding goal to
exploit the quality resources of the
libraries for users to make their Web
appearance more credible and
trustworthy.
Change in the digital library
collections in linked data
environment
Traditionally libraries were working
as storehouses of books, journals,
manuscripts, audiovisual material and
other sources of information. On the
emergence of digital and Information
communication technologies (ICT),
libraries also became information
providers without the requirement of
buildings. Digital library and virtual
libraries were the advanced forms of
library services for remote users
(Encyclopedia Britannica Online
Britannica, 2007). In a traditional
library, the catalog is used to locate
conventional library material. With the
existence of electronic information on
the Web, librarians are bound now to
follow the relationship between
libraries and computers. The ICT
revolution is curtailing the traditional
roles of the librarians and libraries
(Ogunsola, 2011).
Sure and Studer (2005) described
that traditional libraries are
transforming into digital libraries.
Digital libraries are developed with the
LIBRARY HITECH NEWS Number 3 2019, pp. 9-12, V
CEmerald Publishing Limited, 0741-9058, DOI 10.1108/LHTN-10-2018-0067 9
preservation of digital born content and
by following the process of digitization
as well. Digital libraries have made the
accessibility of the information more
productive and effective. The growth of
Web 2.0 and Web 3.0 has provided new
opportunities for users to access their
required information at anywhere and at
any time on the Web. Similarly, Cole
et al. (2013) expressed that the
phenomenon of transformation of
library catalog records into LOD is
evolving. Library metadata is
converting into XML-based metadata
standards for wider community usage
on the Web.
Benefits of linked data applications in
digital libraries
Application of linked data
technologies in digital libraries has
enabled Web users to access quality
information of libraries round-the-clock
on the Web. Linked data technologies
support that information resources
should be available on the Web without
any retrieval restriction. Linked data
technologies use faceted browsing
facility using different linked data
systems to provide required information
of the users (Alemu et al., 2012). Maria
et al. (2016) described in their study that
application of linked data technologies
in libraries has made capable the digital
collection to be viewed by machines on
the Web just like humans. Machines can
better read, interpret and reuse
information using linked data
technologies as to response the human
queries in Semantic Web environment.
Williams (2010) explained in his study
that linked data technologies facilitate
the decentralization of information on
the Web to develop links and
relationships of information on net for
wider access of information. Exposing
library data on the Web will increase the
interest for other professionals of the
world as well (Malmsten, 2009).
Gonzales (2014) pointed out in his
study that if libraries wanted to keep
alive in this modern information
technology world they will have to
adopt LOD initiatives. The remarkable
initiative to motivate libraries to publish
their resources on the Web is the
development of Bibliographic
Framework (BIBFRAME). This XML-
based cataloging standards of metadata
were developed by the Library of
Congress. According to Mitchell
(2013), the emergence of BIBFRAME
is the greatest revolution for the
libraries to publish their metadata and
information resources on the Web.
BIBFRAME facilitates interoperability
of data and provide relationships
between elements of metadata on the
Semantic Web using linked data
technologies.
Moran (2012) emphasized that the
greatest achievement of the Sematic
Web, and lined open data were that it
allowed the interoperability of the data
between organizations and information
systems. Another big achievement of
the Semantic Web is that its users can
create information about digital objects
using any language, metadata scheme or
classification scheme. Once this
information is linked to that digital
object’s DOI, machines will
automatically integrate that information
scattered over the Web to create a more
comprehensive record of that object.
Alemu et al. (2012) and Borst et al.
(2010) described that linked data
technologies have eliminated the
chances of duplicity of information on
the Web. Tillett (2013) also noted that
space problems of libraries will be
solved by putting information resources
on the Web, and Web will not allow the
chances of duplication. Another
important benefit for the Semantic Web
is that it will have richness of
information using quality libraries data.
Initiatives taken by the libraries
Hallo et al. (2014) point out that
libraries are developing metadata
models based on liked data and are now
involved in new roles such as
publishing data in the forms of authority
files, digital projects, bibliographic
catalogs and information from
Wikipedia. Wilson (2010) describes
that National libraries of the world are
taking a leading role to publish their
bibliographic data across the Web for
free and open access.
Mitchell (2013) and Maria et al.
(2016) describe that linked data
applications are used by Europeana, a
digital collections from museums,
archives and libraries of Europe
including audio visual collection. Its
users can search and share the material
on social media sites. It has semantic
searching URL http://eculture.cs.vu.nl/
europeana/session/search. XML-based
model EDM (Europeana Data Model)
are used, based on the principles and
practice of the Semantic Web and
Linked Data.
The Digital Public Library of
Americana (DPLA) is a comprehensive
initiative in the USA. This project was
formed in 2010 and launched in 2013
with the aim to integrate digital
collections of America’s libraries,
museums and archives. In 2013, the
DPLA started its linked data services
with the collaboration of its 18 partners
for online open access to its over 3,200
collections, and over two million
records (Mitchell, 2013). Maria et al.
(2016) described that the National
Library of France (data.bnf.fr) has
created the (International MARC)
INTERMARC Bibliographic format to
create semantic technologies.
Library of Congress started linked
data service in 2009. It included
approximately 260,000 records. It
provides access to vocabularies and
standards published by Library of
Congress. Since 2011, Library of
congress started Bibliographic
Framework (BIBFRAME) model which
was an XML-based linked data model
used for interconnection of network
resources.
Challenges faced by the digital
libraries in linked data environment
By becoming part of the semantic
cloud or linked data Web, libraries can
better meet user expectations
(Hannemann and Kett, 2010). Similarly
the W3C Library Linked Data Incubator
Group (W3C, 2011) noted that libraries
originally were adopting linked data
technologies at a very slow speed. This
paper also reported that MARC had
been a great obstacle to publish library
metadata and information resources on
the Web for wider community usage.
MARC has rigidity and inflexibility in
its structure and it is also unable to
connect the relationships of its entities
using the Web. This is now possible
with the XML standard that is missing
in MARC (Alemu et al., 2012).
Yadagiri and Ramesh (2013) noted
that privacy concerns of organizations
were a big hurdle to adopt linked data
10 LIBRARY HITECH NEWS Number 3 2019
technologies in digital libraries.
Gonzales (2014) discussed that
copyright laws and legalities were the
main obstacles to publish data on the
Web. Libraries were endeavoring their
best to gain licenses of various journals,
database and other online resources so
that they could serve their users without
any restriction. While commercial and
noncommercial use of data should be
decided in linked data environments.
Charles et al. (2014) pointed
multilingual and heterogeneous data is
also a big challenge for LOD on the
Web.
Hannemann and Kett (2010)
described huge technical infrastructural
development was required which
includes a Web server, digital
preservation system and a resolver that
point out the queries and respond it
quickly. Similarly, Alemu et al. (2012)
described that multifarious technologies
of linked data such as OWL, SPARQL
RDFS and RDF/XML were main
obstacle to implement linked data
technologies in digital libraries.
Berners-Lee et al. (2001) note that
linked data technologies should be
made easy to learn similar to the
experience of HTML during the early
days of Web. The integration of
educational resources across the Web is
facing challenges of equal shared
principles of linked data and schemas
(Dietze et al., 2013).
Concluding remarks
Traditional roles of libraries are
enhanced with open access Web-based
role round-the-clock. Linked data
technologies support that information
resources should be available on the
Web without any retrieval restriction.
Linked data technologies use faceted
browsing facility using different linked
data systems to provide required
information of the users (Alemu et al.,
2012). As already noted, machines can
better read, interpret and reuse
information using linked data
technologies as to response the human
queries in Semantic Web environment.
Williams (2010) explained in his study
that linked data technologies facilitate
the decentralization of information
on the Web and the important specialty
is this that this information is machine
readable just like human readability to
develop links and relationships of
information on net for wider access of
information. Exposing library data on
the Web will increase the interest for
other professionals of the world
(Malmsten, 2009).
The review notes several papers in
which the Web visibility of data has
improved. It has become possible to
link data with other online resources.
Open data recovery has been
introduced, and it has become possible
to access data from multiple institutes
through linked data service.
Interoperability has become possible
due to linked data technologies, and it
has made links of related data such as
author, publisher, place, events and
themes. The credibility of data has been
invigorated by introducing the links of
annotations by users. As others note, it
is recommended that libraries should
use linked data technologies so that
wider community could be facilitated
by the authentic and quality resources of
the libraries on the Web round-the-
clock.
The review summarizes the
challenges for some libraries to adopt
this new framework. The literature
reviewed for the study described that
libraries face some challenges as well.
A major area is cataloging as new
standards are difficult for many libraries
to adopt, such as BIBFAME and
Europeana Data Model (EDM). There
are also too many vocabularies and
ontologies to implement linked data
technologies such as Dublin Core,
FOAF, FRBR, FRAD, INTERMARC,
MADS/RDF, XML-EAD, OAI_ORE,
RDF, RDF Schema ORG, OWL, RDA,
SKOS and WGS84. As of this writing,
there are still no agreements to access
data online and migrating data to new
models, unavailability of linked data
tools to transform the data and lack of
expertise to deal with linked data tools
and applications, linking or mapping
problems such as MARC records to
FRBR. There are still ownership
challenges and controls, and
multilingual and heterogeneous forms
of data to deal with.
We end with some recommendations
from this reviews:
XML-based metadata standards
based on linked data technologies are
required to be developed that can easily
replace the MARC-based records.
The complex technologies of linked
data need experts who can apply these
technologies in these libraries. For the
purpose, librarians may also be trained
to implement linked data technologies
in their respective digital libraries.
Linked data courses should be
introduced by library and information
science schools.
Conferences, seminars and
workshops should be organized to
enhance the skills of the library
professionals so that they could be
capable to implement linked data
technologies in the libraries.
Professional bodies such as ALA,
Library of Congress, IFLA, OCLC and
W3C should play their role and should
address the challenges to implement
linked data technologies in libraries.
Vendors should coordinate with
librarians to deal with the copyrights
and legal issues. Open license is the
basic requirement for LOD on the
Semantic Web. Library and Information
Science associations such as IFLA,
OCLC, ALA and other leading
associations of library and information
science should play their role for the
availability of data in Open license on
the Web.
Libraries and information centers
along with their concerned professional
associations should play their role to
motivate the concerned higher
authorities about the benefits of linked
data technologies so that it could be
implemented in their respective
organizations.
REFERENCES
Alemu, G., Stevens, B., Ross, P. and
Chandler, J. (2012), “Linked data for
libraries: benefits of a conceptual shift from
library-specific record structures to RDF-
based data models”, New Library World,
Vol. 113 Nos 11/12, pp. 549-570.
Berners-Lee, T., Hendler, J. and Lassila, O.
(2001), “The semantic web”, Scientific
American, Vol. 284 No. 5, pp. 34-43.
Borst, T., Fingerle, B., Neubert, J. and Seiler,
A. (2010), “How do libraries find their way
onto the semantic web?”, Liber Quarterly,
Vol. 19 Nos 3/4.
Britannica, E. (2007), “Encyclopedia
britannica online”, 15 May 2010.
LIBRARY HITECH NEWS Number 3 2019 11
Byrne, G. and Goddard, L. (2010), “The
strongest link: libraries and linked data”,
D-Lib Magazine, Vol. 16 Nos 11/12.
Charles, V., Freire, N. and Isaac, A. (2014),
“Links, languages and semantics: linked data
approaches”, in The European Library and
Europeana,IFLA,Lyon.
Cole, T.W., Han, M.J., Weathers, W.F. and
Joyner, E. (2013), “Library marc records into
linked open data: Challenges and
opportunities”, Journal of Library Metadata,
Vol. 13 Nos 2/3, pp. 163-196.
Dietze, S., Sanchez-Alonso, S., Ebner, H.,
Qing Yu, H., Giordano, D., Marenzi, I. and
Pereira Nunes, B. (2013), “Interlinking
educational resources and the web of data: a
survey of challenges and approaches”,
Program, Vol. 47 No. 1, pp. 60-91.
Gonzales, B.M. (2014), “Linking libraries to
the web: linked data and the future of the
bibliographic record”, Information Technology
and Libraries (Online), Vol. 33 No. 4, p. 10.
Guerrini, M. (2013), “Introduction to the
seminar global interoperability and linked
data in libraries”, Italian Journal of Library
and Information Science, Vol. 4 No. 1, p. 21.
Hallo, M., Luja
´n Mora, S. and Trujillo
Monde
´jar, J.C. (2014), “Transforming
library catalogs into linked data”, This Paper
was Published at: ICERI2014 Proceedings,
2014, pp. 1845-1853, available at: http://
library.iated.org/view/HALLO2014TRA
Hannemann, J. and Kett, J. (2010), “Linked
data for libraries”, Proc of the World Library
Index, Vol. 174, pp. 13-18.
Haslhofer, B. and Isaac, A. (2011), “Data.
europeana. eu: the europeana linked open
data pilot”, in International Conference on
Dublin Core and Metadata Applications,
pp. 94-104.
Malmsten, M. (2009), “Exposing library data
as linked data”, IFLA Satellite Preconference
Sponsored by the Information Technology
Section Emerging Trends in.
Maria, H., Luja
´n-Mora, S., Mate
´,A.and
Trujillo, J. (2016), “Current state of linked
data in digital libraries”, Journal of
Information Science, Vol. 42 No. 2,
pp. 117-127.
Mitchell, E.T. (2013), “Metadata
developments in libraries and other cultural
heritage institutions”, Library Technology
Reports, Vol. 49 No.5, pp. 5-10.
Moran, S. (2012), “Using linked data to
create a typological knowledge base”, in
Linked Data in Linguistics, Springer, Berlin,
Heidelberg, pp. 129-138.
Ogunsola, L.A. (2011), “The next step in
librarianship: is the traditional library
dead?”, Library Philosophy and practice,
available at: https://digitalcommons.unl.edu/
cgi/viewcontent.cgi?article=1638&context=
libphilprac
Sure, Y. and Studer, R. (2005), “Semantic
web technologies for digital libraries”,
Library Management, Vol. 26 Nos 4/5,
pp. 190-195.
Tillett, B. (2013), “RDA and the semantic
web, linked data environment”, Italian
Journal of Library and Information Science,
Vol. 4 No. 1, p. 139.
W3C (2011), Library linked data incubator
group final report, W3C, Cambridge, MA.
Williams, H.K. (2010), “Linked data and
libraries”, Catalogue and Index, Vol. 160,
pp. 2-5.
Wilson, N. (2010), “Linked data prototyping
at the british library”, paper presented at the
Talis Linked Data and Libraries event,
available at: http://blogs.talis.com/nodalities/
neil-wilsonlightning-talk-%E2%80%93-
linked-data-and-libraries-2010
Yadagiri, N. and Ramesh, P. (2013),
“Semantic web and the libraries: an
overview”, International Journal of Library
ScienceTM, Vol. 7 No.1, pp. 80-94.
Zahid Raza (zahidraza78697@yahoo.
com) is based at the Department of
Information Management, University of
the Punjab, Lahore, Pakistan.
Khalid Mahmood (khalid.im@pu.
edu.pk) is Charmian and Nosheen
Fatima Warraich (Nosheen.im@pu.
edu.pk) is Associate Professor both
at the Department of Information
Management, University of the Punjab,
Lahore, Pakistan.
12 LIBRARY HITECH NEWS Number 3 2019
Article
Full-text available
This paper discusses about the concept of the semantic web, technology, web content writing, and the necessity for the development of web 3.0. The various components of semantic web technology such as HTTP, URI, RDF, XML, Ontology, W3C, and other components specified as W3C standards are touched upon briefly. The benefits of implementing the semantic web in the Library functions to provide effective information services and for optimum use of the Library, collection is illustrated here.
Article
Full-text available
Connecting to friends, colleagues, customers, and others on the internet is an everyday experience for most people these days. We use email, Twitter, Facebook, and other social networking systems quickly and easily when there is wifi or an internet service provider that reaches our geographic location – even as we move around. This change in our communication systems even extends sometimes to replacing phone calls with communications like Skype or Facetime. A former phone communication can now be a multi-media experience where you not only talk but also see each other (or groups of people), share pictures or videos or documents quickly and easily all at the same time. Where are libraries in this world?
Article
Full-text available
Why a Linked data Seminar? Linked data provides a new language in the world of global communication: from the public administration, banks, insurance companies, to archives, libraries and museums. Global communication via Linked Data is about using the Web to connect related data that wasn't previously linked, or using the Web to lower the barriers to linking data currently linked using other methods.
Article
Full-text available
The Semantic Web encourages institutions, including libraries, to collect, link and share their data across the Web in order to ease its processing by machines to get better queries and results. Linked Data technologies enable us to connect related data on the Web using the principles outlined by Tim Berners-Lee in 2006. Digital libraries have great potential to exchange and disseminate data linked to external resources using Linked Data. In this paper, a study about the current uses of Linked Data in digital libraries, including the most important implementations around the world, is presented. The study focuses on selected vocabularies and ontologies, benefits and problems encountered in implementing Linked Data in digital libraries. In addition, it also identifies and discusses specific challenges that digital libraries face, offering suggestions for ways in which libraries can contribute to the Semantic Web. The study uses an adapted methodology for literature review, to find data available to answer research questions. It is based on the information found in the library websites recommended by W3C Library Linked Data Incubator Group in 2011, and scientific publications from Google Scholar, Scopus, ACM and Springer from the last 5 years. The selected libraries for the study are the National Library of France, the Europeana Library, the Library of Congress of the USA, the British Library and the National Library of Spain. In this paper, we outline the best practices found in each experience and identify gaps and future trends.
Article
Full-text available
The ideas behind Linked Data and the Semantic Web have recently gained ground and shown the potential to redefine the world of the web. Linked Data could conceivably create a huge database out of the Internet linked by relationships understandable by both humans and machines. The benefits of Linked Data to libraries and their users are potentially great, but so are the many challenges to its implementation. The BIBFRAME Initiative provides the possible framework that will link library resources with the web, bringing them out of their information silos and making them accessible to all users.
Conference Paper
Full-text available
The European Library and Europeana have both an extensive experience in aggregating metadata for bibliographical records or digital resources from the cultural heritage institutions of Europe. For both of them bypassing the challenges offered by multilingual and heterogeneous data is an ongoing effort. The growth of the Semantic Web and the more generalised publication of knowledge organisation systems as linked open data offer the possibility to make these services truly multilingual. This paper shows how The European Library and Europeana exploit the semantic relations and translations offered by knowledge organisations systems in order to solve the problem of data integration at a European scale. It also demonstrates the potential of Linked Open vocabularies for enabling multilingual search and retrieval services.
Conference Paper
Full-text available
Traditionally, in most digital library environments, the discovery of resources takes place mostly through the harvesting and indexing of the metadata content. Such search and retrieval services provide very effective ways for persons to find items of interest but lacks the ability to lead users looking for potential related resources or to make more complex queries. In contrast, modern web information management techniques related to Semantic Web, a new form of the Web, encourages institutions, including libraries, to collect, link and share their data across the web in order to ease its processing by machines and humans offering better queries and results increasing the visibility and interoperability of the data. Linked Data technologies enable connecting related data across the Web using the principles and recommendations set out by Tim Berners-Lee in 2006, resulting on the use of URIs (Uniform Resource Identifier) as identifiers for objects, and the use of RDF (Resource Description Framework) for links representation. Today, libraries are giving increasing importance to the Semantic Web in a variety of ways like creating metadata models and publishing Linked Data from authority files, bibliographic catalogs, digital projects information or crowdsourced information from another projects like Wikipedia. This paper reports a process for publishing library metadata on the Web using Linked Data technologies. The proposed process was applied for extracting metadata from a university library, representing them in RDF format and publishing them using a Sparql endpoint (an interface to a knowledge database). The library metadata from a subject were linked to external sources such us another libraries and then related to the bibliography from syllabus of the courses in order to discover missing subjects and new or out of date bibliography. In this process, the use of open standards facilitates the exploitation of knowledge from libraries.
Article
Today researchers search for books in various ways. Once discovered, a variety of Web technologies can be used to link to related resources and/or associate context with a book. This environment creates an opportunity for libraries. The linked open data (LOD) model of the Web offers a potential foundation for innovative user services and the wider dissemination of bibliographic metadata. However, best practices for transforming library catalog records into LOD are still evolving. The practical utility on the Semantic Web of library metadata transformed from MARC remains unclear. Using a test set of MARC21 records describing 30,000 retrospectively digitized books, the University of Illinois at Urbana-Champaign (UIUC) Library explored options for adding links, transforming into non-library specific LOD-friendly semantics, and deploying as RDF to maximize the utility of these records. This paper highlights lessons learned during this process, discusses findings to date, and suggests possible avenues for further work and experimentation.
Article
Since 1999 the W3C has been working on a set of Semantic Web standards that have the potential to revolutionize web search. Also known as Linked Data, the Machine-Readable Web, the Web of Data, or Web 3.0, the Semantic Web relies on highly structured metadata that allow computers to understand the relationships between objects. Semantic web standards are complex, and difficult to conceptualize, but they offer solutions to many of the issues that plague libraries, including precise web search, authority control, classification, data portability, and disambiguation. This article will outline some of the benefits that linked data could have for libraries, will discuss some of the non-technical obstacles that we face in moving forward, and will finally offer suggestions for practical ways in which libraries can participate in the development of the semantic web.
Article
This article provides a history of libraries from their founding in the ancient world through the later half of the 20th century, when both technological and political forces radically reshaped library development. According to Encyclopedia Britannica Online 2007, the paper identifies the series of technological developments with which both librarians and traditional libraries have had to contend. The impression that the global technological revolution now taking place has "murdered" traditional libraries as well as the librarians working in them was critically assessed. It was later concluded that both digital and traditional libraries will co-exist side by side and they will complement each other in the educational, economical and social development of the country concerned.