ArticlePDF Available

An analysis of academic librarians competencies and skills for implementation of Big Data analytics in libraries: A correlational study

Authors:

Abstract

Purpose The purpose of this paper is to analyze the views and capabilities of librarians for the implementation of Big Data analytics in academic libraries of Pakistan. The study also sets out to check the relationship between the required skills of librarians and the application of Big Data analytics. Design/methodology/approach A survey was conducted to gather the required data from the targeted audience. The targeted population of the study was Head/In charge library managers of Pakistani university libraries, which were 173 in total. All the respondents (academic librarians) were invited through an e-mail to respond to the survey voluntarily. Out of 173 respondents from higher education commission of Pakistan chartered university libraries, 118 librarians (68.2 percent) completed the survey that was finally considered, and after checking data, recommendation for analysis was made. To analyze the collected data, statistical technique Pearson correlation was applied using statistical package for social science version 25 to know the strength of the mutual correlation of variables. Findings The findings of the study show a strong correlation between the required competencies and skills of librarians for the implementation of Big Data analytics in academic libraries. In all variables of the study, the correlation was highly significant, except two of the variables, including “concept of Big Data” and “different forms of data.” The study also reveals that most of the respondents were well aware of the concept of Big Data analytics. Moreover, they were using a large amount of data to carry out various library operations, including the acquisition, preservation, curation and analysis of data. Originality/value This study is significant in the sense that it fills a substantial gap in the literature regarding the perspective of librarians on Big Data analytics.
An analysis of academic
librarians competencies and
skills for implementation of
Big Data analytics in libraries
A correlational study
Khurshid Ahmad, Zheng JianMing and Muhammad Rafi
Department of Information Management, Nanjing University, Nanjing, China
Abstract
Purpose The purpose of this paper is to analyze the views and capabilities of librarians for the
implementation of Big Data analytics in academic libraries of Pakistan. The study also sets out to check the
relationship between the required skills of librarians and the application of Big Data analytics.
Design/methodology/approach A survey was conducted to gather the required data from the targeted
audience. The targeted population of the study was Head/In charge library managers of Pakistani university
libraries, which were 173 in total. All the respondents (academic librarians) were invited through an e-mail to
respond to the survey voluntarily. Out of 173 respondents from higher education commission of Pakistan
chartered university libraries, 118 librarians (68.2 percent) completed the survey that was finally considered,
and after checking data, recommendation for analysis was made. To analyze the collected data, statistical
technique Pearson correlation was applied using statistical package for social science version 25 to know the
strength of the mutual correlation of variables.
Findings The findings of the study show a strong correlation between the required competencies and skills
of librarians for the implementation of Big Data analytics in academic libraries. In all variables of the study,
the correlation was highly significant, except two of the variables, including concept of Big Dataand
different forms of data.The study also reveals that most of the respondents were well aware of the concept
of Big Data analytics. Moreover, they were using a large amount of data to carry out various library
operations, including the acquisition, preservation, curation and analysis of data.
Originality/value This study is significant in the sense that it fills a substantial gap in the literature
regarding the perspective of librarians on Big Data analytics.
Keywords Competencies, Academic libraries, Librarians, Skills, Analytics, Big Data
Paper type Research paper
1. Introduction
The development in this digital era, the use of information communication, Internet of Things
and cloud system technology caused an extensive data growth in almost every
area of life in this digital environment (Liu et al., 2018). In this perspective, due to the
information explosion and the extensive use and growth of data, the problem is being
faced to organize this massive unstructured growth of data. This situation provides genuine
reasons to evaluate these factors, as it is described that the growth of data on such a large
scaleisreferredtoasBig Data(Djafri et al., 2018). Nowadays, the Big Data is a hot topic of
discussion in the business, industry, education and government agencies globally;
the research and developmental practices are being conducted to overcome the challenges
and the opportunities of analytic use in Big Data (Cuzzocrea, 2014). It can say that Big Data is
leading pathway toward the emerging digital native economy of the world. However, in the
current literature, there is no unified definition of Big Data. Different circles have
defined Big Data in different ways. Some of the definitions mentioned as follows: Big Data
refers to such datasets whose volume is outside the capacity of traditional softwares
(Manyika et al., 2011). Big Data comprises of those data sets which are so large and complex
that commonly used softwares are incapable of dealing with them(Garcia and Wang 2013).
Data Technologies and
Applications
Vol. 53 No. 2, 2019
pp. 201-216
© Emerald Publishing Limited
2514-9288
DOI 10.1108/DTA-09-2018-0085
Received 30 September 2018
Revised 10 November 2018
Accepted 15 January 2019
The current issue and full text archive of this journal is available on Emerald Insight at:
www.emeraldinsight.com/2514-9288.htm
201
Implementation
of Big Data
analytics in
libraries
Similarly, Laney (2001) first presented the concept of Big Datathat it has some special
characteristics that differentiate it from ordinary data. These characteristics are categorized
and explained into 5Vs, that is, huge volume, high velocity, high variety, low veracity, and
high value(Jin et al., 2015). The size of Big Data is enormous in comparison with
regular ordinary data. There is no specified limitation for data volume and growth. The
velocity of Big Data refers to its dynamic and fast creation. The feature and variety of data
sets create the difficulties to the utilization process and organization of data. The data
that are collected systematically in a proper type by the data scientists or business
organization can be structured. However, some types of data are found in an unstructured
form that are gathered from different resources as e-mails and online collected data (Wang
et al., 2016). The development of Big Data creation, acquisition, storage and flow has come up
with some challenges and opportunities for modern libraries. The advancement in the
communication and information technology has changed the structures of organizations that
are being modified to compete the environmental and social changes in the society.
In this context, the importance of Big Data in librarianship is also being recognized in the
circles of library professionals (Zhan and Widén, 2017). Another research by Ilesanmi (2013)
described that libraries are the centers of knowledge organization, retrieval and dissemination
of information and maintaining information systems in the society to meet the requirements of
community. However, now the emergence of Big Data is forcing libraries to redesign the
patterns of their services that they usually had for carrying out their operations (Affelt, 2015).
To respond the change in this digital ere, Noh (2015) argued that the present form of libraries
can be converted into library 4.0. The library 4.0 can be defined as an intelligentlibrary, which
can analyze the massive data utilization and present the findings to their users. It means that
the unique feature of library 4.0 will be the handling of the substantial form of data. It also
shows that the emerging trend of Big Data is a helpful toward the improvement of libraries
and infrastructure development to provide the better services to the users of libraries and
community. In present situation, libraries are facing the challenge of data handling and the
lack of skills of library professionals. There is also a need for these skills to be improved to
handle the opportunities and issues that are created in this Big Data era (Gordon-Murnane,
2012). This shifting paradigm from traditional to contemporary library infrastructure and
services creates an unusual situation that is to be analyzed. This study has some important
implications of Big Data analytics for the academic libraries. The literature review of
conducted studies and the current paradigm shift helps us to know the competencies and
skills of librarians reflecting the librariescapacity that is also necessary to address the
potential utilization of Big Data analytics in academic libraries.
1.1 Objectives
The objectives of this study are as follows:
(1) to explore the extent to which the Big Data analytics is being used in Pakistani
university libraries;
(2) to analyze the perceptions of LIS professionals toward the use of Big Data analytics; and
(3) to check the relationship between the competencies and skills of librarians for the
implementation of Big Data analytics.
2. Literature review
In previous studies, researchers have concentrated on the opportunities, merits and
demerits, required qualifications and skills, organization, and some other aspects of Big
Data analytics. In this section, the researchers review some selected studies conducted on
Big Data-related aspects. In this digital environment, the production, storage, organization
202
DTA
53,2
and analysis of data have much increased the volume of data. In this scenario, the study of
Gorman (2015) focused on the new challenges and the opportunities for the libraries; he
proposed a new model for the implementation of Big Data analytics. It is also essential to
evaluate the strength and weakness of the organization to improve the services and
products. By knowing the challenges and opportunities of the organization, the
implementation of analytics helps to overcome these challenges. With this, Gunasekaran
et al. (2018) emphasized that the analytical data of large companies play an important role in
organizational flexibility by capturing and analyzing product requirements. It also
contributes to the planning, design and rapid development of supply chain networks, as well
as the production of new products. Yaqoob et al. (2016) divided the Big Data paradigm into
two phases, i.e. structuralism and functionalism. They further discussed that these are also
helpful to know the current trends in the rapid growth of Big Data. Zhuge (2015) argued that
in this era of Big Data, there is the emergence of new developing opportunities in the every
field of studies. Hence, these opportunities are also available for library and information
science (LIS) professionals to improve the services and collection of libraries.
The study results of Bedeley et al. (2018) show that the organizations are using the
analytics that focused on the application analysis because the results of the application
areeasytomeasure.Inadditiontoleveraging analytics that is used to improve the value
chain activities, many companies use different technologies to improve their
infrastructure or conduct research and development to support advanced analytics
practices. Lu et al. (2017) claimed that academic librarians are well aware of the concept of
Big Data analytics and they are forming the activities that related to data. However, there
is also a need for collaboration among the different sections of libraries for the
implementation of Big Data analytics. As the research of Triperina et al. (2018) reveals that
the used ontology in the academia provides the obviousness of classification, feminizings
to new solicitations, semantic web relationships and deepens of data mining. This process
leads to the implementation process of Big Data analytics in organizations. The tasks and
abilities of librarians can be the identification of data formats, acquisition, data
management and organization. In this context, librarians have been performing activities
such as data curation, visibility, interoperability of data, analysis and visualization of data
(Wang et al., 2016).
Similarly, Taylor-Sakyi (2016) discussed the Big Data phenomena by stating that
companies and organizations are thinking to reorganize the organizational process on the
growth of data for services and to enhance the product quality. For this approach, rational
database management systems are replacing the traditional systems of data management in
organizations. In 2005, the population of the internet was 10,24m but now reached 4,000m.
These numbers are showing that the opportunities for online researchers are increasing and
creating a more significant challenge for the organization of Big Data storage and its
privacy in the organizations (Schaich, 2018). The review of the previous literature reveals
that there are some studies conducted toward the implementation of Big Data analytics
in libraries. However, in the context of Pakistan as a country, there is a need to know
the understanding and to explore the competencies/skills of librarians about Big Data
analytics. The current research is an insight to add up the literature in the respective
field of studies.
2.1 Challenges of Big Data
In the process of data analytics, dealing with large data volume and growth is not a big
problem. The main issues are associated with the types of data, the velocity of data and
veracity of the data. Due to the different forms of the data, the choice and use of analytical
tools are difficult. It is a time taking task to structure the data format in the implementation
process of Big Data analytics in libraries, but sometimes the form of traditional
203
Implementation
of Big Data
analytics in
libraries
unstructured data and online format are found in the random text such as voice data, videos
and images. Organizing such type of open data format is difficult. From the perspective of
the information industry, Big Data is a powerful driver of the next generation of information
technology, based primarily on a third platform, focusing on data, cloud computing, internet
the use of mobile and online social networking. The most complicated parts of Big Data
analysis are context analysis, data organization and evaluation. Therefore, in business
organizations, it is essential to define the potential of research economics and global
industrial scale at the organizational level that is also related with the limitations of data
protection and digital access models ( Jin et al., 2015). The challenges faced by libraries in
Big Data analytics focus on the lack of support from parent organizations. Concerns in this
regard include the lack of infrastructure to meet conservation needs, support for the
provision of knowledge or literacy of data and the lack of conservation initiatives in libraries
and information centers (Thomas and Urban, 2018).
2.2 Big Data analytics in libraries
According to the research of Read et al. (2015), there are always challenges associated
when starting a new library service. Tenopir et al. (2014) found that many LIS
professionals provide the data services by using the extension of traditional library
services, but some of them are more involved in helping the library users by developing
the plan of data management and organization. Read et al. (2015) also discussed that the
primary objective is to improve the efficiency and effectiveness of the institutional
approach by using data analysis to help universities address emerging issues related to
low retention rates and more extended periods. Analyzing the challenges of Big Data
analytics, Katal et al. (2013) claimed that volume, velocity, variety and veracity are the
four dimensions of Big Data. Besides this, DeVan (2016) study revealed the three more Vs
of Big Data with variability,the blends of constant data changing could have an
immense impact on the homogenization of data. The visualization of data in charts,
graphs and the value of organizational data are integrated challenges of analytics.
Themergingofthedataisdifficultafterfetching from the different sources. In library
(Goldberg et al., 2014), the types of data change dramatically and various volumes of
the data must be organized and supported to enable the services of the library. Due to the
digital environment, the needs of the library users will continuously grow in the future
(Showers, 2014). Librarians should have the ability to relate to the creation, management
and preservation of data (Semeler et al., 2017). The role of librarians is essential for Big
Data analytics, so there is a need to improve the skills and knowledge of librarians for the
implementations of Big Data analytics. Xie and Fox (2017) argued that for the application
of Big Data analytics, the library professionals do not have as such expertise to provide
new value-added services. In this context, the research of Atkinson (2018) reveals that
there is a need of client-centric approach for LIS professionals to be more integrated into
theacademicprocessandtounderstandandsupport different phases of research-based
needs of library users.
2.3 Competencies and skills of librarians for handling Big Data
The contemporary era of digital environment provides opportunities for librarians to
involve with data-related activities. LIS professionals should learn the skills and knowledge
about the provision of potential data services. The study of Fink et al. (2017) describes that
managers should seek to deploy strategic intelligence systems, starting with the investment
in the formation of a highly qualified and competent intelligence team, including expertise in
integration, analysis and data presentation. In this case, it is rare to find that some people do
not have simultaneous business experience in the organization. In the duties of a data
librarian, the indispensable core of his mission is to transform the different forms of data
204
DTA
53,2
that are generated or can be helpful for the researchers and library users (Ahmad, 2017).
De Mauro et al. (2015) described that since companies are considering Big Data as
information assets, librarians too should bring in the practices of Big Data analytics.
Similarly, another study illustrated that the knowledge and skills of the new technology are
essential to the librarians, which can help in the provision of information service to the users
in time and high quality (Ullah and Anwar, 2013). Therefore, it is indeed to acquire the
landscape of libraries environment according to the context of Big Data analytics.
According to Oakleaf (2016) to achieve the goals and mitigate the challenges of learning
analytics, librarians should anticipate the use and learning of analysis in their institutions.
The librarians can prepare and develop their skills through collaboration with their
colleagues. Hoy (2014) also made the same point, with technological advancements,
librarians need to be familiar with the capabilities and problems inherent in large data and
use knowledge to help their customers choose the right tools. There is also the culture of
paranoid in librarianship to learn the new technological skills and emerging trends
(Braganza et al., 2017). In this perspective, there is a need to improve the skills and
competencies of LIS professionals in this competitive era of Big Data analytics.
3. Method of the study
In this study, a survey was carried out to achieve the targeted objectives. Accordingly, after a
thorough literature review, a research tool was developed to collect data from the respondents.
We developed a structured research questionnaire based on a five-point Likert scale and some
close-ended questions. The targeted population of the study was Head/In charge library
managers of Pakistani university libraries, which were 173 in total. All the respondents
(academic librarians) were invited through an e-mail to respond to the survey voluntarily.
We targeted the administrative librarians for data collection considering that they are the
best-informed persons about the status of technology and services in their libraries. These
professionals (academic librarians), in top management position, are involved in making
decision and implementation of new technology in central as well as in seminar libraries
of the universities. Overall, the administrative librarians are actively involved in designing
policies to all types of libraries in each university. All these universities are recognized
by higher education commission of Pakistan. These universities are functioning under the
umbrella of the higher education commission totaling 173. During the survey data were
received from 118 universities at the ratio of (68.2 percent) that were considered ultimate for
analysis. To analyze the collected data, statistical technique Pearson correlation was applied
using statistical package for social science version 25 to know the strength of the mutual
association of variables. This technique was implemented by many researchers including Fan
et al. (2014) to analyze data for determining the complex relationship between values.
Following the same procedure, we applied it for explaining the quantitative variables
related to Big Data challenges in this study. In our research to test the statistical nature
and effectiveness of these measures, a pilot test was also carried out on
20 respondents. The pilot test allows the researchers to determine whether the respondents
understand the contents of the research tool entirely or not. The respondents of the pilot study
highlighted some weak points, and variables in the research tool were corrected for
respondents facilitation. The findings in our research showed a strong mutual correlation
po0.005 between the tested values.
4. Results of the study
The population of this study consisted of LIS professionals who varied in categories
of age, qualification and working experience in the public and private sector universities and
degree awarding institutes of Pakistan. Table I shows the demographic characteristics of
the respondents of this study. Among the 118 respondents, the number of males was
205
Implementation
of Big Data
analytics in
libraries
90 (76.3 percent), and the number of female respondents was only 28 (23.7 percent).
The proportion of public sector organizations was 70 (59.3 percent), and the percentage of
private sector organizations was 48 (40.7 percent). The qualification of the respondents was as
follows: MLIS 80 (67.8 percent) and MS/MPhil was 27 (22.9 percent), whereas only 11
(9.3 percent) librarians had the PhD degree in LIS. Respondentsprofessional experiences were
divided into five categories. The number of LIS professionals having the experience of 1115
years was 33 (28.0 percent), followed by 30 respondents (25.4 percent) with 510 years of
experience. The 21 respondents (17.7 percent) had more than 20 years of work experience in
LIS . In total, 18 (15.3 percent) of the LIS professionals working in academic institutions had
five years of work experience. The last ranking category of respondentsexperience was 1620
years, which consisted of 16 (13.6 percent) respondents. These results show that LIS
professionals working in academic institutions were having extensive work experience in the
field of LIS. The results also demonstrate that the respondents also improved their
qualifications by pursuing higher degrees in LIS. About 40 percent of the population was
having MS/MPhil and PhDs degrees in the field of LIS. It shows that LIS professionals are
concerned about their professional education.
4.1 Understanding of LIS professionals about Big Data analytics
The literature review reveals that Big Data analytics is a real source of competitive
advantage in an organization. It helps the working environment of libraries and information
centers to develop the better understand according to their working environment and
improve the collection and library services. This study evaluates the understanding level of
academic librarians about Big Data analytics. In Table II, the correlation between the
concept of Big Data analytics (1) has a significant association with Big Data activities(2),
with data volume (3), data forms (4) and with an increase of data (5). This analytics is
related to the practices of libraries toward the implementation process of Big Data analytics.
The results also show that librarians were familiar with all the variables of Big
Data analytics accept the different forms (4) of Big Data analytics, po0.0.005 of
librariansviews and understanding in Big Data analytics. It is also shown in Table II
that Pearson correlation was at the highest level in public and private sector academic
libraries of Pakistan.
4.2 Level of significance ( p o0.005), **represents significance, Variables 6
The results of Table II reveal that there is a significant level of understanding of the
concept, activities, data volume and data increase in the practices of academic libraries. The
librarians are performing these activities to some context in their libraries. However, due to
Measure Items Frequency Percentage
Gender Male 90 76.3
Female 28 23.7
Type of institution Public sector 70 59.3
Private sector 48 40.7
Academic qualification MLIS 80 67.8
MS/MPhil 27 22.9
PhD 11 9.3
Professional experience W5 years 18 15.3
510 years 30 25.4
1115 years 33 28.0
1620 years 16 13.6
o20 years 21 17.7
Table I.
Profile of the
respondents
206
DTA
53,2
the technological barriers and Big Data Vs. (volume, velocity, variety, variability, veracity,
visualization and value), they are not able to recognize the different forms of Big Data.
In response to the first question of this study, this table validates that librarians of the
Pakistani academic libraries are well aware of the emerging trends and they are practicing
the Big Data analytics in their respective libraries. Therefore, Big Data reveals not only the
tendency of library digitization but also bring opportunity, development and challenge.
It also encourages the creation of technical solutions. The values also generated by the
proper manipulation and use of large amounts of data found helpful for libraries and
information organizations (Zhan and Widén, 2017).
The scatter plot matrix of correlation between the variables shown in Figure 1 reveals
that there is a significant correlation between all the variables of required skills of librarians
for the implementation of Big Data analytics in libraries. The Pearson correlation also
indicates the degree to which these factors have a substantial positive impact on each other.
It demonstrates that librarians have a strong understanding of the importance and
contribution toward the improvement of libraries. It also indicates that the libraries will be
able to improve their data-related services if they have these skills.
4.3 Required competencies and skills of LIS professionals for Big Data analytics
The present research identifies and addresses many critical concerns, involving Big Data
and educational analytics that are being practiced in different academic libraries and their
parent organizations. Librarianscompetencies are considered an integral requirement to
participate in activities related to Big Data analytics. As the research of Federer (2018)
focused on the skills of data librarians and explained that LIS professionals as data scientist
are assorted experts in the information society with different academic and career
backgrounds to deal with different forms of work. As the emerging major trends expected
among data science librarians, there are differences in the type of work data that librarians
can have concerning specific kinds of valuable expertise and the type of jobs that have to
employ the different data librarians. In this context, two professional groups, experts and
data generalists, described here suggest that the science of databases may not play a unique
role, but rather an area that allows professionals to respond to their interests or to meet of
the need of community users. With this, the study has also examined the competencies,
practices and procedures to extract the required information and data in different formats in
a diverse and varied information system. It involves in the various forms of data,
Correlation
1* Concept 2. Activities 3. Volume 4. Forms 5. Increase
0.490
2 0.000**
0.330 0.552
3 0.000** 0.000**
0.253 0.416 0.715
4 0.006** 0.000** 0.000**
0.297 0.374 0.532 0.583
5 0.001** 0.000** 0.000** 0.000**
0.075 0.120 0.321 0.331 0.409
6 0.422 0.197 0.000** 0.000** 0.000**
Notes: The activities (2) of big data are associated with the volume (3), forms (4) of data and data increase (4)
for the practices of Big Data analytics in libraries. The variable 6 is also correlated with the volume (3), forms
(4) Increase (5) of data activities in libraries. 1* is the variable Conceptand **show the significance of the
correlation between the variables
Table II.
Correlation between
understanding
and practices of
Big Data analytics
(journal format)
207
Implementation
of Big Data
analytics in
libraries
acquisition, management and organization, interoperability, data quality, metadata skills,
data curation, culture data, data preservation, data analysis, data visualization and the
policies/ethics for the practices of Big Data analytics in academic libraries. It also gives the
insight to know the role of librarians in the perspective and implementation of these skills in
their working environment and practices in libraries.
4.4 Level of significance ( p o0.005), **represents significance, Variables 12
Table III demonstrates that a significant correlation exists among all the 12 variables of
required competencies and skills. It is based on the variables that relate to the required
competencies and skills of librarians for the implementation of Big Data analytics in
libraries. In this context, the research of Lee and Kim (2018) reveals that in the field of
information and communication technologies, the visualization and analysis of the data in
an organizations database are called ecosystem links. Leveraging and providing data to
develop the infrastructure and to improve services it is a powerful tool in the organization.
There is no doubt that without these required competencies of Big Data analytics the
strategy of implementing Big Data analytics in academic libraries cannot be applied.
The study of Johnson (2017) discussed that the collaboration among the different sections
enables librarians to demonstrate the developed value of additional professionalsexpertise.
The visual analytics also lets us discover the likely or unlikely results mainly in Big Data
formats. The results of Table III also demonstrate that librarians have sufficient
competencies and skills to tackle Big Data analytics in their respective libraries and
organizations. The following details of the correlation among 12 variables of librarians
required competencies and skills are:
The different formats (1) of Big Data analytics skills are significantly correlated with
(2) acquisition of data, (3) management and organization of data, (4) interoperability,
(5) quality, (6) metadata skills, (7) curation, (8) culture of organization, (9) preservation, (10)
Concept Activities Volume Forms Increase Issue
Concept
Activities
Volume
Forms
Increase
Issue
Figure 1.
Scatter plot matrix
for correlation
between variables
208
DTA
53,2
Correlation
1* Formats 2. Acquisition 3. Management 4. Interoperability 5. Quality 6. Metadata 7. Curation 8. Culture 9. Preservation 10. Analysis 11. Visualization
2 0.553
0.000**
3 0.497 0.630
0.000** 0.000**
4 0.517 0.608 0.555
0.000** 0.000** 0.000**
5 0.492 0.559 0.595 0.562
0.000** 0.000** 0.000** 0.000**
6 0.446 0.457 0.378 0.460 0.503
0.000** 0.000** 0.000** 0.000 0.000**
7 0.460 0.474 0.469 0.481 0.455 0.623
0.000** 0.000** 0.000** 0.000** 0.000** 0.000**
8 0.400 0.544 0.445 0.553 0.473 0.482 0.704
0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000**
9 0.281 0.330 0.321 0.460 0.442 0.484 0.493 0.461
0.002** 0.000** 0.000 0.000** 0.000** 0.000 0.000** 0.000**
10 0.383 0.334 0.417 0.579 0.507 0.410 0.461 0.544 0.578
0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000**
11 0.358 0.425 0.415 0.618 0.478 0.327 0.515 0.669 0.481
0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000**
12 0.466 0.348 0.429 0.505 0.454 0.369 0.388 0.438 0.442 0.544 0.582
0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000** 0.000**
Notes: The significance of variable 2 and 12 described in the results. *Described the correlation between the variables. **Significant at the 0.01 level (two-tailed)
Table III.
Correlation between
librariansrequired
competencies and
skills for the
implementation of Big
Data analytics
209
Implementation
of Big Data
analytics in
libraries
analysis, (11) visualization of data and (12) ethics involved for the implementation of Big
Data analytics in libraries. Thus, the required skills of librarians are significant for Big
Data analytics in libraries. The variable (2) acquisition of data also significantly correlates
with (1) different formats, (3) management and organization of data, (4) interoperability,
(5) quality, (6) metadata skills, (7) curation (8) culture of organization, (9) preservation (10)
analysis, (11) visualization of data and (12) ethics. The acquisition of data in library
practices leads to the better skill of Big Data analytics in the libraries.
Management and organization of data (3) is strongly correlates with the (1) different
formats, (2) acquisition of data, (4) interoperability (5) quality (6) metadata skills (7)
curation (8) culture of organization, (9) preservation (10) analysis, (11) visualization of
data and (12) ethics. Thus, the organization and management of data are having a
significant impact on other competencies of Big Data analytics.
The variable (4) Interoperability robustly correlates with other skills of Big Data
analytics in libraries, such as association with (1) different formats, (2) acquisition
of data, (3) management and organization of data, (4) interoperability, (5) quality,
(6) metadata skills, (7) curation, (8) culture of organization, (9) preservation (10)
analysis, (11) visualization of data and (12) ethic.
Awareness about the quality of data is a very essential skill for Big Data analytics;
librarians have the competencies to know that the data quality is also significantly
correlated with (1) different formats of data, (2) acquisition of data, (3) management
and organization of data (4) interoperability of data, (6) metadata skills (7) curation,
(8) culture of organization, (9) preservation (10) analysis, (11) visualization of data
and (12) ethics.
In libraries, LIS professional has to perform the different activities of variable
metadata (6). It includes cataloging, reference services, technical section processing,
creating library reports, digital resources utilization, etc. Thus, a significant
correlation is also found with described variables of the study, and it is helpful for the
clustering of data.
Data curation (7) is a vital element in the digital environment. It also involves the
different scholarships of libraries and information centers. A significant correlation is
also found in this variable with all other variables of the study.
Organizational culture (8) also has a significant correlation with other required skills
of librarians because it affects all activities of the organization.
The role of libraries and librarians involved in the preservation of data is essential. In
this perspective, the skill of data preservation (9) also has a significant correlation
with other skills of library professionals.
Data analysis (10) is very demanding and technical skill for the implementation of
Big Data analytics. A significant correlation is also found in data analysis with other
variables of Big Data analytics.
As described data analysis (10) variable in our study, data visualization (11) also has
a significant correlation with all variables of the study.
As all the described skills of Big Data analytics are necessarily important, ethics (12)
is also essential to perform any data-related activity within the organization.
On the bases of these results of the study, the technical skills toward the implementation of
BDA are essential for university librarians. Similarly, Semeler et al. (2017) narrated that in
the age of digital environment, this is a significant factor for data formats, acquisition,
210
DTA
53,2
management and organization of data, interoperability, quality, metadata, data curation,
organizational culture, preservation, data analysis and visualization. In this perspective,
ethicsfound a significant correlation with the skills of Big Data analytics in libraries.
Our results associated with the study of Burton et al. (2018) highlighted that the research
librarians are a group of actors in the networks on which researchers rely in the course of
university research. The value of privacy, ethics and fair access to the information is critical
to library science professionals, making librarians a unique partner for researchers and
others. It is part of the universitiesresearch culture support network. At the same time,
there is an international dialogue on the ethical use of data in this digital environment.
Librarians have the opportunity to find their role in supporting researchers by addressing
new ethical issues in research, in particular by providing services to the functional units that
are responsible for ethics and coordinating in universities. Libraries have to coordinate
with the parent organization to ensure the data literacy instructions and the literacy of
data ethics. There is also the need for university librarians to improve their skills and
practices by participating in different training programs and conferences, due to the
advancement of technology the services and the paradigm of library infrastructure being
changed with the passage of time. The improvement of the library structure and functions is
associated with the skills of librarians and library works. The significance of these Big data
analytical skills plays a role of interplay with the services and practices of libraries. On the
bases of these skills, the library community will serve in a better way. And the research
culture of the university can be improved by the alignment of these technologies and the
practices of data analytics in the libraries and universities.
The scatter plot matrix of correlation between the variables on the bases of Table III
shown in Figure 2 also reveals a significant relationship among all variables of research.
The results indicate that there is a strong correlation in the required skills of librarians
toward the implementation of Big Data analytics in academic libraries. These skills are
helpful to improve the competencies of librarians in the educational environment and also to
meet the research needs of library users to provide the best research-oriented services. TH
competencies on these skills help to choose the right tool for the analytics application
process. With the study of Netto et al. (2014), the university librarians can get involved in
decision making and can take the initiatives toward the implementation of new technology
in their libraries. By providing information services to the university community, they also
get involved in the process of introducing new research trends and paradigm to improve the
research culture in academia.
5. Discussion
The concept and the implementation of Big Data analytics are growing day by day.
The environment of Big Data has provided a vast opportunity for competition within the
organizations. In this scenario, the role of librarians is also changing, and now they are
working as a data scientist, data curators and digital services managers. In this perspective,
librarians are involved in data acquisition, curation, interoperability, data organization,
metadata skills, data preservation, analysis and visualization. As the traditional practices of
libraries, librarians have to follow the policies and ethics for the methods of Big Data
analytics in libraries. The data privacy and acquisition are found critical issues nowadays.
The competencies of handling Big Data vary from organization to organization. It also
depends on the format, value, volume and organizational culture. The competencies and
skills lead toward the proper implementation of Big Data analytics in libraries.
In this research, the researchers evaluated the correlation of required competencies
and skills of librarians for the implementation of Big Data in academic libraries.
As discussed in the literature review of the study, Big Data has emerged as a new,
developing and multidisciplinary field. In other words, the researchers used the
211
Implementation
of Big Data
analytics in
libraries
results to say that academic libraries should be able to conduct Big Data analysis. The
variables of the study are developed by the in-depth study of the literature review. First,
Table II presents the six variables related to the understanding and basic knowledge of
Big Data analytics. Three variables are compared to explore the extent to which the Big
Data analytics are used in academic libraries. The researchers found a strong correlation
among all the variables. Only two variables had an insignificant relationship. It shows
that librarians have the concept and understandings of Big Data analytics, but due
to the rapid growth and volume of data, they are not familiar with the different formats of
Big Data. The findings also indicate that librarians are well aware of the analytics
of Big Data that are used in academic libraries. The level of analytics is practiced
to some context in their libraries. In this perspective, to know the analytics level
of Big Data in librarianship, the study of Zhan and Widén (2017) referred to the
different competencies and skills of librarians for the practices and utilization of Big
Data analytics. It also highlighted the working environment of libraries that emphases on
the relevant practices of Big Data Vs.
This research also emphasizes on the analysis of correlation in the required skills and
competencies of librarians. In this context, the researchers analyzed the association between
data formats, acquisition, data management and organization, quality of data, metadata skills
and data culture within the organization to evaluate the last objective of our study to
determine the level of skills and competencies of LIS professionals toward the Big Data
Frm Acqstn Mangnt Intpro Quality Meta Curtn Cultr Presrvn Analysis Vislsn Ethics
Frm Acqstn Mangnt Intpro Quality Meta Curtn Cultr Presrvn Analysis Vislsn Ethics
Figure 2.
Scatter plot matrix for
correlation of
variables
212
DTA
53,2
analytics, see Table III. The researchers found a strong correlation among all the described
activities. The findings of Semeler et al. (2017) study indicated that in this digital era and Big
Data analytics, there should be provisions for librarians that they should perform the practices
in their libraries as data administrator, to preserve, to organize and to evaluate the data.
This study reveals that LIS professionals are working in academic libraries of Pakistan having
a strong correlation between their skills of data visualization depending on organizational
culture and keeping in view the quality of data. Because of the corporate environment, it is an
essential factor with the required competencies and skills and for the implementations of Big
Data analytics. The librarians are also well aware of the data policy and ethics for the
preservation, data analysis and visualization. With this, there is a need that librarians have to
recognize the Big Data technologies and develop their skills through participation in
accelerated courses and workshops. Extensive data analysis in the library also requires
programming language and coding. In this way the analytics of Big Data in the library can
significantly improve. These existing library services are developed by the needs of library
users to adjust the library need-based services.
Librarians were traditionally involved in the practices of acquiring, organizing,
retrieving, collecting, disseminating and hoarding of information in their libraries. Their
methods of this information organization have historically been in the form of scientific
exchanges of ideas, such as books and serials publications. In this digital environment, they
have now shifted their practices to more inclusive of all types of information, irrespective of
their form. This whole paradigm shift that has broadened an area not only focused on
primarily textual scientific publications, but also on the review of the digital footprints of
data provided by researchers and library users. However, there is a need to develop new
tools for libraries and process solutions for Big Data analysis. These developments in the
library services can adapt to the needs of library users and improve and collection
development policy. This study shows that Big Data flow is being dynamically changed
based on expectation or expected external and internal influences. The study lays the
foundation for the further development of dynamic skills of Big Data work, and based on the
strategic insights gained from the Big Data initiative, a method for the organization to
adjust and transform its capabilities is developed. Researchers of LIS professionals can
explore the causal relationship between the discovery of Big Data initiatives and changes in
the organizational requirements of libraries.
6. Conclusions
In this age of information society, academic institutions, public and private business
organizations and companies are generating a large amount of data. The capabilities of LIS
professionals based on Big Data analytics facilitate to ascertain emphases of the analytics in
academic libraries. Managing the vast and complex data is a significant challenge for
business and educational organizations. This study analyzed the awareness and
engagement of Pakistani librarians toward Big Data. The relationship between the
competencies and skills of academic librarians for the implementation of Big Data analytics
is also analyzed. The study concludes that librarians have the understanding about the
concept of Big Data analytics.
Moreover, they are primarily engaged in Big Data-related activities in their respective
libraries. The study also concludes that there exists a strong correlation between the
competencies and skills of librarians for the implementation of Big Data analytics in
libraries. The skills and competencies of librarians are fundamentally essential to offer
quality services in libraries. Emerging trends such as Big Data analytics are need of
the hour to survive in the information society. Therefore, librarians must understand
the importance and nature of Big Data technologies and develop their skills to cope
with it accordingly. They should engage themselves in Big Data-related activities.
213
Implementation
of Big Data
analytics in
libraries
Librarians also need to learn and share Big Data analytics-related expertise, techniques
with their peers. Awareness about the design and implementation of Big Data analytics
should be created among LIS professionals. In order to proper implement and practice the
Big Data analytics in libraries at large, librarians of Pakistani universities need to
conduct trainings and conferences on Big Data. The computer programming languages
and coding are also helpful for the implementation of Big Data analytics in libraries.
This would result in a lot of improvement for the practices of Big Data analytics in
libraries. Studies regarding challenges associated with handling of Big Data are need to be
conducted by prospective researchers.
References
Affelt, A. (2015), The Accidental Data Scientist: Big Data Applications and Opportunities for Librarians
and Information Professionals, Information Today, Medford, NJ.
Ahmad, K. (2017), The perspective of library and information science (LIS) professionals toward
knowledge management in university libraries,Journal of Information & Knowledge
Management, Vol. 16 No. 2, pp 1-11, available at: https://doi.org/10.1142/S0219649217500150
Atkinson, J. (2018), Reflections on Collaboration and Academic Libraries. Collaboration and the
Academic Library: Internal and External, Local and Regional, National and International,Elsevier,
available at: https://doi.org/10.1016/B978-0-08-102084-5.00020-1
Bedeley, R.T., Ghoshal, T., Iyer, L.S. and Bhadury, J. (2018), Business analytics and organizational
value chains: a relational mapping,Journal of Computer Information Systems, Vol. 58 No. 2,
pp. 151-161.
Braganza, A., Brooks, L., Nepelski, D., Ali, M. and Moro, R. (2017), Resource management in
Big Data initiatives: processes and dynamic capabilities,Journal of Business Research, Vol. 70,
pp. 328-337, available at: https://doi.org/10.1016/j.jbusres.2016.08.006
Burton, M., Lyon, L., Erdmann, C. and Tijerina, B. (2018), Shifting to data savvy: the future
of data science in libraries, available at: http://d-scholarship.pitt.edu/id/eprint/33891
(accessed March 11, 2019).
Cuzzocrea, A. (2014), Privacy and security of Big Data: current challenges and future research
perspectives,Proceedings of the First International Workshop on Privacy and Security
of Big Data, pp. 45-47.
De Mauro, A., Greco, M. and Grimaldi, M. (2015), What is Big Data? A consensual definition and a
review of key research topics,AIP Conference Proceedings, Vol. 1644, pp. 97-104.
DeVan, A. (2016), The 7 Vs of Big Data, available at: www.impactradius.com/blog/7-vs-big-data/
Djafri, L., Ammar Bensabeur, D. and Adjoudj, R. (2018), Big Data analytics for prediction:
parallel processing of the big learning base with the possibility of improving the final result of
the prediction,Information Discovery and Delivery, available at: https://doi.org/10.1108/
IDD-02-2018-0002
Fan, J., Han, F. and Liu, H. (2014), Challenges of Big Data analysis,National Science Review, Vol. 1
No. 2, pp. 293-314.
Federer, L. (2018), Defining data librarianship: a survey of competencies, skills, and training,Journal
of the Medical Library Association, Vol. 106 No. 3, pp. 294-303.
Fink, L., Yogev, N. and Even, A. (2017), Business intelligence and organizational learning: an
empirical investigation of value creation processes,Information and Management, Vol. 54
No. 1, pp. 38-56.
Garcia, T. and Wang, T. (2013), Analysis of Big Data technologies and method query large web
public RDF datasets on amazon cloud using hadoop and open source parsers,2013 IEEE
Seventh International Conference on Semantic Computing, pp. 244-251.
Goldberg, D., Olivares, M., Li, Z. and Klein, A.G. (2014), Maps & GIS data libraries in the era of Big
Data and cloud computing,Journal of Map & Geography Libraries, Vol. 10 No. 1, pp. 100-122.
214
DTA
53,2
Gordon-Murnane, L. (2012), Big Data: a big opportunity for librarians, available at: https://doi.org/
1039559884
Gorman, M. (2015), Our enduring values revisited: librarianship in an ever-changing world,
available at: http://books.google.com/books
Gunasekaran, A., Yusuf, Y.Y. and Adeleye, E.O. (2018), Agile manufacturing practices: the role
of Big Data and business analytics with multiple case studies,International Journal of
Production Research, Vol. 7543, pp. 1-13.
Hoy, M.B. (2014), Big Data: an introduction for librarians,Medical Reference Services Quarterly,
Vol. 33 No. 3, pp. 320-326.
Ilesanmi, T.C. (2013), Roles of the librarian in a research library in the digital era: challenges and the
way forward,New Review of Academic Librarianship, Vol. 19 No. 1, pp. 5-14, available at:
https://doi.org/10.1080/13614533.2012.740437
Jin, X., Wah, B.W., Cheng, X. and Wang, Y. (2015), Significance and challenges of Big Data
research,Big Data Research, Vol. 2 No. 2, pp. 59-64, available at: https://doi.org/10.1016/
j.bdr.2015.01.006
Johnson, V. (2017), Leveraging technical library expertise for Big Data management,Journal of
the Australian Library and Information Association, 0158, pp. 1-16, available at:
https://doi.org/10.1080/24750158.2017.1356982
Katal, A., Wazid, M. and Goudar, R.H. (2013), Big Data: issues, challenges, tools and Good practices,
2013 6th International Conference on Contemporary Computing, pp. 404-409.
Laney, D. (2001), Meta delta,Application Delivery Strategies, Vol. 949, February, p. 4, available at:
https://doi.org/10.1016/j.infsof.2008.09.005
Lee, C. and Kim, H. (2018), The evolutionary trajectory of an ICT ecosystem: a network analysis based
on media usersdata,Information and Management, Vol. 55 No. 6, pp. 795-805.
Liu, Y., Yang, L., Sun, J., Jiang, Y. and Wang, J. (2018), Collaborative matrix factorization mechanism
for group recommendation in Big Data-based library systems,Library Hi Tech, Vol. 36
No. 3, pp. 458-481.
Lu, N., Song, R., Heng, D., Gottipati, S., Tay, C.H.A., Zheng, Z. and Tay, A. (2017), Using data analytics
for discovering library resource insights case from Singapore management university,
Research Collection School of Information Systems, available at: http://ink.library.smu.edu.sg/
sis_research/3835
Manyika, J., Chui, M., Brad, B., Bughin, J., Dobbs, R., Roxburgh, C. and Hung Byers, A. (2011), Big data: the
next frontier for innovation, competition and productivity,McKinsey Global Institute,May,available
at: https://bigdatawg.nist.gov/pdf/MGI_big_data_full_report.pdf (accessed March 11, 2019).
Netto, M.A.S., Buyya, R., Bianchi, S., Assunção, M.D. and Calheiros, R.N. (2014), Big Data computing
and clouds: trends and future directions,Journal of Parallel and Distributed Computing,
Vols 79-80, pp. 3-15, available at: https://doi.org/10.1016/j.jpdc.2014.08.003
Noh, Y. (2015), Imagining library 4.0: creating a model for future libraries,Journal of Academic
Librarianship, Vol. 41 No. 6, pp. 786-797.
Oakleaf, M. (2016), Getting ready & getting started: academic librarian involvement in institutional
learning analytics initiatives,Journal of Academic Librarianship, Vol. 42 No. 4, pp. 472-475.
Read, K.B., Surkis, A., Larson, C., McCrillis, A., Graff, A., Nicholson, J. and Xu, J. (2015), Starting the
data conversation: informing data services at an academic health sciences library,Journal of the
Medical Library Association, Vol. 103 No. 3, pp. 131-1355, available at: https://doi.org/10.3163/
1536-5050.103.3.005
Schaich, M. (2018), Information professionals,Huguenot Networks, 15601780, pp. 75-91,
available at: https://doi.org/10.4324/9781315188959-6
Semeler, A.R., Pinto, A.L. and Rozados, H.B.F. (2017), Data science in data librarianship: core
competencies of a data librarian,Journal of Librarianship and Information Science,
096100061774246, available at: https://doi.org/10.1177/0961000617742465
215
Implementation
of Big Data
analytics in
libraries
Showers, B. (2014), Developing a shared analytics service for academic libraries,Insights: The UKSG
Journal, Vol. 27 No. 2, pp. 139-146.
Taylor-Sakyi, K. (2016), Big Data: understanding Big Data, available at: http://pdf/abs/1601.04602%
5Cnhttp://pdf/pdf/1601.04602
Tenopir, C., Sandusky, R.J., Allard, S. and Birch, B. (2014), Research data management services in
academic research libraries and perceptions of librarians,Library and Information Science
Research, Vol. 36 No. 2, pp. 84-90.
Thomas, C. and Urban, R. (2018), What do data librarians think of the MLIS? Professionals
perceptions of knowledge transfer, trends, and challenges,College & Research Libraries, Vol. 79
No. 3, p. 401, available at: https://doi.org/10.5860/crl.79.3.401
Triperina, E., Bardis, G., Sgouropoulou, C., Xydas, I., Terraz, O. and Miaoulis, G. (2018), Visual-aided
ontology-based ranking on multidimensional data: a case study in academia,Data Technologies
and Applications, Vol. 52 No. 3, pp. 366-383.
Ullah, M. and Anwar, M.A. (2013), Developing competencies for medical librarians in Pakistan,
Health Information and Libraries Journal, Vol. 30 No. 1, pp. 59-71.
Wang, C., Xu, S., Chen, L. and Chen, X. (2016), Exposing library data with Big Data technology:
a review,2016 IEEE/ACIS 15th International Conference on Computer and Information
Science, pp. 1-6.
Xie, Z. and Fox, E.A. (2017), Advancing library cyber infrastructure for Big Data sharing and reuse,
Information Services and Use, Vol. 37 No. 3, pp. 319-323.
Yaqoob, I., Hashem, I.A.T., Gani, A., Mokhtar, S., Ahmed, E., Anuar, N.B. and Vasilakos, A.V. (2016),
Big Data: from beginning to future,International Journal of Information Management, Vol. 36
No. 6, pp. 1231-1247.
Zhan, M. and Widén, G. (2017), Understanding Big Data in librarianship,Journal of Librarianship and
Information Science, 096100061774245, available at: https://doi.org/10.1177/0961000617742451
Zhuge, H. (2015), Mapping Big Data into knowledge space with cognitive cyber-infrastructure,
pp. 1-59, available at: http://pdf/ftp/arxiv/papers/1507/1507.06500.pdf
Corresponding author
Khurshid Ahmad can be contacted at: khurshid.abaloch@gmail.com
For instructions on how to order reprints of this article, please visit our website:
www.emeraldgrouppublishing.com/licensing/reprints.htm
Or contact us for further details: permissions@emeraldinsight.com
216
DTA
53,2
... They advocate for the use of open data standards and licenses, ensuring that datasets are easily discoverable, accessible, and usable by researchers. Moreover, libraries support researchers in managing and preserving their own data through data curation services, which include data cleaning, organization, and long-term preservation, ensuring the reproducibility and integrity of data analysis (Ahmad, JianMing, & Rafi, 2019a). In summary, libraries play an integral role in the world of Big Data Analytics by 1. ...
... These results highlight the interconnectedness of these variables and emphasize the need for comprehensive approaches to address challenges related to copyright, data privacy, security, fair use, software and hardware infrastructure, storage devices, data recovery, and data types. Ahmad et al. (2019a), Carried out a study on the competencies and skills of academic librarians in relation to the implementation of Big Data Analytics (BDA). The researchers employed Pearson Correlation to investigate the relationship between different aspects of academic librarians' skills and competencies. ...
... The positive correlation between Knowledge and Skills and Organizational challenges indicates that the presence of these challenges hampers the successful implementation of BDA in medical libraries (Ahmad et al., 2019a). The P-values of the correlation coefficients confirm the statistical significance of these relationships, further strengthening the validity of the findings (O'Kelly, Jeffryes, Hobscheid, & Passarelli, 2023). ...
Article
Full-text available
s Purpose-The primary aim of this study is to discern the challenges confronted by library professionals operating within medical institutions and their associated teaching hospitals in Pakistan during the implementation of Big Data Analytics (BDA). The study aims to conduct correlation analysis to establish the relationship between these challenges and various factors, including legal and ethical, technological challenges, knowledge and skills, as well as organizational challenges. Design/methodology-A Quantitative Survey design was utilized to gather data for this study. The study participants were library practitioners affiliated with medical institutes and associated teaching hospitals in Pakistan. The questionnaires were disseminated to respondents through various channels, 1 including personal visits, social media platforms, emails, and Google docs. A total of 256 responses were received from library professionals out of the 369 institutions targeted, which were subsequently utilized for analysis. The collected data underwent a process of data screening and filling in missing values before being subjected to correlation analysis using the 26th version of the Statistical Package for the Social Sciences (SPSS). Findings-The findings of the study indicated the correlations between 1) Technological, legal and ethical challenges with copyright law, personal data privacy, data security, fair use, software/ hardware inadequacy, storage limitation and data recovery; 2) Knowledge, skills and organizational challenges with data management, data retrieval, data cleaning, learning and utilization, funding, training and policy in medical libraries of Pakistan. Originality/value-The present research holds substantial importance as it addresses a significant gap in the existing literature concerning the utilization of BDA in these libraries and the accompanying challenges experienced by librarians of these institutes. By focusing on this topic, the study contributes to the existing knowledge and understanding of the subject matter.
... Sehar and Shahid (2020) revealed that the digital services rendered during the pandemic are online database journal, Telegram and WhatsApp to answer users query by the reference librarians. Ahmad et al. (2019) found that most the academic librarians in Pakistan were aware of big data analytics. In Zimbabwe, Chigwada and Kasiroori (2021) showed that librarians are aware of the big data concept but they are not utilizing the tools and techniques in data mining and analysis. ...
... More, it helps in organizing content, recommending systems, and personalizing experiences. Ahmad et al. (2019) highlight the usage of big data application in areas like organization, literacy, availability, and data privacy. Furthermore, the study discovered a strong correlation between big data applications abilities, metadata expertise, data ethics, data collection, organization, cleaning, and analysis, as well as digital curation, data clustering, data protection guidelines, and digital visualization. ...
Article
Full-text available
Purpose: Large volumes of complex and diverse data sources present a tremendous barrier for big data applications in digital library in terms of processing and extracting relevant insights. This study examined librarians’ perceptions of big data applications and management for digital library services in selected academic libraries in Kwara State. Methodology: The research design that was adopted in this study is the cross-sectional research design. The total population for this study is fifty-eight (58) library professionals. Owing to the small population of this study, total enumeration sampling technique was adopted for this study. Hence, the sample size for this study is fifty-eight (58). Questionnaire was used for data collection. Collected data were analysed using descriptive statistics. Findings: The study demonstrated that digital library services provided include virtual reference services, institutional repositories, and personalized content recommendation. Also, the librarians were aware of big data applications (BDAs) like Apache Hadoop and Python. It was showed that BDAs are used for resource transfer, borrowing management, user needs, usage tracking, and personalizing content recommendations. Results showed that open-source software and hardware, and training on handling diverse datasets are essential for the management of big data. Challenges associated with the use of big data applications include scalability limitations, complex data structures, lack of big data processing, power supply issues, and data privacy. Originality: This study provides unaddressed questions and gaps in the area of using big data applications for the provision of digital library services, especially in developing countries like Nigeria.
... Implementing these technologies can lead to better decision-making for DS planning and delivery, enabling personalized DSs. For several years, social scientists have debated the vast implications of BDA innovations on resources, services and research (Ahmad et al., 2019;Kitchin, 2014;Lazer and Radford, 2017). ...
... Our findings support the findings of previous studies that awareness has a statistically significant impact on the provision of DSs in academic libraries (Ahmad et al., 2019;Islam and Roknuzzaman, 2021). In our cohort, the majority of the librarians were fully aware of the concept of BD and its relevance in the context of libraries and information services. ...
Article
Purpose The purpose of this study is to determine the level of awareness among library and information science (LIS) professionals regarding the perceived utility of big data (BD) and data analytics (DA) in academic libraries, as well as their influence on the provision of data services (DSs). Design/methodology/approach A cross-sectional survey was carried out to collect the data for this study. The population of this study comprised LIS professionals working in public sector university libraries. A four-factor measurement model estimating the influence of BD and DA on the provision of DSs in academic libraries was tested using the structural equation modelling. Findings The findings revealed that awareness (AW) ( β = 0.141, CR = 2.534, p = 0.011) demonstrated a significant positive influence on the provision of DSs. The perceived utility of BD ( β = 0.058, CR = 0.582, p = 0.561), and perceived utility of DA ( β = 0.141, CR = 2.534, p = 0.905) exhibits a positive but statistically non-significant impact on the provision of DSs ( β = 0.010, CR = 0.120, p = 0.905). The goodness of fit indices suggest a favourable fit for the model, as evidenced by the following values: χ ² = 1.400, DF = 164; p = 0.001; IFI = 0.954; TLI = 0.946; CFI = 0.953; GFI = 0.906; and RMSEA = 0.043. Originality/value A new perspective on the use of BD and DA in academic libraries is presented in this study. It presents a four-factor measurement model on the influence of BD and DA on the provision of DSs in university libraries.
... Specifically, it recognises that such technologies may challenge traditional decisionmaking practices and values in librarianship, underscoring the need for further research and careful consideration of their broader implications. Further, research by Ahmad et al. (2019) demonstrates a strong connection between the competencies required for librarians to effectively use big data analytics in university libraries. Most of their participants demonstrated a clear understanding of big data analytics. ...
Article
Full-text available
This study explores the transformative impact of big data analytics on university libraries, examining its potential to disrupt traditional systems and services through the lens of Clayton Christensen's four-step framework. A qualitative case study approach was employed involving semi-structured interviews with 25 participants from five groups of Australian university libraries (independent universities and the members of Innovative Research Universities, the Regional Universities Network, the Group of Eight and the Australian Technology Network). The data collection occurred between 2018 and 2022. The findings confirm the disruptive potential of big data analytics in modern librarianship, enhancing decision-making and demonstrating libraries’ value. Its adoption is revolutionising librarians’ decision-making practices, signifying a major shift towards data-driven approaches. This study contributes to the field of library and information science, applying the theory of disruptive technologies to analyse the importance of big data analytics in librarianship decision-making. The study's recommendations lay a foundation for future research and provide actionable strategies for implementing big data analytics in university libraries to enable informed decision-making and fully harness its potential. Additionally, this study makes a significant contribution to the existing literature on analytics adoption, particularly in the context of emerging and disruptive innovations. The findings have far-reaching practical implications for Australian libraries, emphasising the need to adopt big data analytics tools to elevate decision-making processes and enhance user experiences.
... The literature suggested that the concept of BD has been studied in different aspects of the library information science (LIS) domain. For instance, some researchers studied BD in public libraries (Bertot et al., 2014;Kim and Cooke, 2017;Zhan and Widen, 2018), whereas others focussed on BD and academic libraries (Tella and Kadri, 2021;Ahmad et al., 2019;Al-Barashdi and Al-Karousi, 2018;Deng, 2017;Luo et al., 2013;Lu et al., 2016;Yi et al., 2014;Su et al., 2014). At the same time, multiple studies have highlighted the challenges associated with BD in libraries (Khan et al., 2014;Bhat, 2018;Hussain and Shahid, 2022;Sun and Ma, 2014). ...
Article
Purpose This study aims to measure scientific literature on the emerging research area of “big data” in the field of “library and information science” (LIS). Design/methodology/approach This study used the “bibliometric method” for data curation. Web of Science and altmetric.com were used. Data analysis and visualisation were done using three widely used powerful data analytics software, R-bibliometrix, VOSviewer and Statistical Package for Social Sciences. Findings This study revealed the most preferred venues for publication. Furthermore, this study highlighted an association between the Mendeley readers of publications and citations. Furthermore, it was evident that the overall altimetric attention score (AAS) does not influence the citation score of publications. Other fascinating findings were moderate collaboration patterns overall. Furthermore, the study highlighted that big data (BD) research output and scientific influence in the LIS sector are continually increasing. Practical implications Findings related to BD analytics in LIS techniques can serve as helpful information for researchers, practitioners and policymakers. Originality/value This study contributes to the current knowledge accumulation by its unique manner of blending the two approaches, bibliometrics and altmetrics.
Article
Full-text available
This study aims to assess the favourable ecosystem for supporting OS in Tanzania and the factors hindering its adoption and implementation. Employing a cross-sectional research design, the study evaluates the enabling environment for Open Science adoption and practices within Tanzania. The study population comprises academicians, researchers, students, librarians, and science communicators. A total of 144 participants took part in the survey, predominantly early career researchers affiliated with both public and private institutions. The findings underscore that to some extent several institutions within Tanzania provides some kind of support for open science practices. This support manifests through the provision of Institutional Repositories, online journal publishing systems, financial coverage for publishing processing charges, and the establishment of improved mechanisms for researchers to share their publications and data. Nonetheless, the results also highlight several challenges. These include a lack of awareness regarding OS practices, absence of institutional policies, inadequate budget allocation to cover article processing charges (APCs), and the high costs associated with publishing in prestigious Open Access (OA) journals.
Article
Purpose This study aims to evaluate how well a University of Technology Library’s infrastructure aligns with the requirements of the Fourth Industrial Revolution (4IR). By examining the current state of technological integration and identifying potential gaps, this research seeks to provide insights into the necessary developments and adaptations needed. Additionally, the study explores the implications of these changes for librarian roles, highlighting the evolving skill sets and competencies required in this new paradigm. Design/methodology/approach The study used a mixed methods research approach, integrating an online questionnaire to gather quantitative data and conducting interviews to obtain qualitative insights. The subsequent interviews offered a thorough understanding of the library’s alignment with the 4IR and its implications for the librarians at the library. Findings The findings of this research will offer valuable perspectives for university faculties, librarians and policymakers in shaping future-ready library environments that support innovative teaching, learning and research practices. Practical implications The findings highlight the need for academic libraries to continuously evolve with enhancements to existing infrastructure to incorporate advancing technologies and encompass modern digital platforms, tools and resources to support innovative teaching, learning, and research works. Originality/value This study provides insight into the results of the importance of evolving with the advances of technologies in academic institutions within a developing country in Africa.
Article
This study investigates the impact of big data on university libraries in Bangladesh, focusing on areas such as resource accessibility, decision-making, user experiences, academic research and information management. By conducting interviews with 21 library professionals and employing thematic analysis, the study reveals critical insights. Big data significantly enhances resource accessibility by enabling customized collections and supports user-centric decision-making in library operations. It also improves user experiences through personalized recommendations and seamless navigation. Moreover, big data's analysis of citation patterns and collaborative networks bolsters academic research while addressing information overload and optimizing information retrieval processes. This research provides a thorough examination of big data's practical implications in university libraries, offering valuable insights for professionals, researchers and policymakers. It emphasizes the necessity of a proactive approach to leveraging big data's transformative potential in knowledge management, thereby shaping the future of academic libraries.
Article
Full-text available
Purpose The rapid evolution of technological infrastructure and analytical capabilities has facilitated the integration of big data analytics (BDA) across various sectors. This study aims to investigate the suitability of implementing BDA within academic libraries, addressing the demanding need for effective data utilization in contemporary educational environments. Design/methodology/approach The research is grounded in five critical components: data-driven culture, organizational infrastructure, employee responsibilities, management capabilities and the successful deployment of technology for BDA. An extensive literature review led to the development of a Likert scale-based questionnaire distributed on social media to collect data from university librarians in Pakistan. The authors were able to collect the data from 211 librarians. Descriptive statistics were employed to analyze the variables, while confirmatory factor analysis (CFA) was conducted using the AMOS to validate the research model. Findings The findings from the measurement model reveal significant positive correlations among all five components, underscoring their collective importance in facilitating the implementation of BDA. This formation is essential for addressing the evolving needs and academic requirements of users in the context of big data in a digital environment. Research limitations/implications The study acknowledges limitations about its focus on a single country’s perspective, which may affect the generalizability of the findings regarding the implementation process of BDA in academic libraries. Originality/value This research contributes to the existing body of knowledge by highlighting the practices and capabilities of librarians in the era of big data as well as the requisite organizational infrastructure for the effective implementation of analytics in academic libraries.
Chapter
In this chapter explores how libraries leverage cutting-edge technologies to enhance their services, accessibility, and user engagement. It begins by highlighting technologies like Artificial Intelligence (AI), Virtual Reality (VR), and big data analytics that are redefining traditional library functions. AI improves collection management and user services, VR creates immersive learning environments, and big data analytics allows for data-driven personalization. The chapter underscores the benefits of emerging technologies, including improved efficiency and resource management, but emphasizes the importance of balancing innovation with core library values like privacy, intellectual freedom, and equitable access. It addresses challenges like data privacy concerns, algorithmic bias, the digital divide, and technical infrastructure requirements, advocating for strategic approaches to mitigate these issues. Further sections dive into the specific applications of AI and Machine Learning in enhancing cataloging, metadata management, and User Experience (UX). At the same time, the Internet of Things (IoT) is shown to streamline inventory management and resource tracking. Blockchain technology is also explored for its potential to revolutionize digital rights management, data privacy, and collaboration. Big data analytics is discussed as a tool for understanding user behavior, optimizing services, and driving data-driven decision-making. Additionally, immersive technologies such as AR, VR, and Mixed Reality (MR) are described as innovative tools that can transform user experiences and learning opportunities. Ultimately, the chapter provides a comprehensive look at how emerging technologies are transforming libraries into dynamic hubs for learning, innovation, and community engagement while urging a balance between innovation and core library values.
Article
Full-text available
Objectives: Many librarians are taking on new roles in research data services. However, the emerging field of data librarianship, including specific roles and competencies, has not been clearly established. This study aims to better define data librarianship by exploring the skills and knowledge that data librarians utilize and the training that they need to succeed. Methods: Librarians who do data-related work were surveyed about their work and educational backgrounds and asked to rate the relevance of a set of data-related skills and knowledge to their work. Results: Respondents considered a broad range of skills and knowledge important to their work, especially "soft skills" and personal characteristics, like communication skills and the ability to develop relationships with researchers. Traditional library skills like cataloging and collection development were considered less important. A cluster analysis of the responses revealed two types of data librarians: data generalists, who tend to provide data services across a variety of fields, and subject specialists, who tend to provide more specialized services to a distinct discipline. Discussion: The findings of this study suggest that data librarians provide a broad range of services to their users and, therefore, need a variety of skills and expertise. Libraries hiring a data librarian may wish to consider whether their communities will be best served by a data generalist or a subject specialist and write their job postings accordingly. These findings also have implications for library schools, which could consider adjusting their curricula to better prepare their students for data librarian roles.
Article
Full-text available
Big data has been widely discussed. The diverse impacts and potential of big data have been pinpointed and empirically proven.Nevertheless, there is no consensus on the understanding of big data. Big data has been used to refer to different things and its characteristics are not universally accepted either. Therefore, this study aims to generate an overall understanding of big data. The domain of the study is limited to librarianship, because of its unique position in managing and utilising big data. Thus, the aim of this study is to understand big data in librarianship according to how it is defined in that profession. Articles containing definitions of big data were reviewed and 35 definitions were collected. Since the number of analysed definitions is small, a combined method was employed. Both a content analysis and a statistical description of these definitions were conducted. Five aspects are summarised based on the analysis of the collected definitions. These aspects help explicate libraries’ current understanding of big data and librarians’ big data skills.
Article
Purpose This paper aims to solve the problems of big data analytics for prediction including volume, veracity and velocity by improving the prediction result to an acceptable level and in the shortest possible time. Design/methodology/approach This paper is divided into two parts. The first one is to improve the result of the prediction. In this part, two ideas are proposed: the double pruning enhanced random forest algorithm and extracting a shared learning base from the stratified random sampling method to obtain a representative learning base of all original data. The second part proposes to design a distributed architecture supported by new technologies solutions, which in turn works in a coherent and efficient way with the sampling strategy under the supervision of the Map-Reduce algorithm. Findings The representative learning base obtained by the integration of two learning bases, the partial base and the shared base, presents an excellent representation of the original data set and gives very good results of the Big Data predictive analytics. Furthermore, these results were supported by the improved random forests supervised learning method, which played a key role in this context. Originality/value All companies are concerned, especially those with large amounts of information and want to screen them to improve their knowledge for the customer and optimize their campaigns.
Article
Purpose The purpose of this paper is to introduce a novel framework for visual-aided ontology-based multidimensional ranking and to demonstrate a case study in the academic domain. Design/methodology/approach The paper presents a method for adapting semantic web technologies on multiple criteria decision-making algorithms to endow to them dynamic characteristics. It also showcases the enhancement of the decision-making process by visual analytics. Findings The semantic enhanced ranking method enables the reproducibility and transparency of ranking results, while the visual representation of this information further benefits decision makers into making well-informed and insightful deductions about the problem. Research limitations/implications This approach is suitable for application domains that are ranked on the basis of multiple criteria. Originality/value The discussed approach provides a dynamic ranking methodology, instead of focusing only on one application field, or one multiple criteria decision-making method. It proposes a framework that allows integration of multidimensional, domain-specific information and produces complex ranking results in both textual and visual form.
Article
There are existing studies on data curation programs in library science education and studies on data services in libraries. However, there is not much insight into how educational programs have prepared data professionals for practice. This study asked 105 practicing professionals how well they thought their education prepared them for professional experience. It also asked supervisors about their perceptions of how well employees performed. After analyzing the results, the investigators of this study found that changing the educational model may lead to improvements in future library data services.
Article
We integrate users into the visualization and analysis of an information and communication technology (ICT) ecosystem by using demand-side data. We also broaden the ecosystem layer model by using a media repertoire concept and propose a clear method of showing evolutionary trends. Consequently, we discover the nature of an evolutionary path in an ICT ecosystem. This path becomes more rigid and centralized as it matures, a finding that agrees with prior studies’ results that used supply-side data. Further, we analyze the trends of firms changing layer positions and suggest multiple approaches of visualizing and analyzing interfirm relationships for practitioners.
Chapter
The book includes an overview and literature review of collaboration and academic libraries and a number of case studies on collaborative initiatives involving academic libraries, both internally within the university and externally. This concluding chapter presents an analysis of the various chapters and case studies which make up the book in which the key themes, observations, and lessons learnt are identified and presented under a number of headings: context and drivers; benefits; constraints and barriers; collaborative activities; participant attributes. This provides a useful checklist for those planning or implementing collaborative initiatives. © 2018 Jeremy Atkinson Published by Elsevier Ltd. All rights reserved.
Article
Purpose Academic groups are designed specifically for researchers. A group recommendation procedure is essential to support scholars’ research-based social activities. However, group recommendation methods are rarely applied in online libraries and they often suffer from scalability problem in big data context. The purpose of this paper is to facilitate academic group activities in big data-based library systems by recommending satisfying articles for academic groups. Design/methodology/approach The authors propose a collaborative matrix factorization (CoMF) mechanism and implement paralleled CoMF under Hadoop framework. Its rationale is collaboratively decomposing researcher-article interaction matrix and group-article interaction matrix. Furthermore, three extended models of CoMF are proposed. Findings Empirical studies on CiteULike data set demonstrate that CoMF and three variants outperform baseline algorithms in terms of accuracy and robustness. The scalability evaluation of paralleled CoMF shows its potential value in scholarly big data environment. Research limitations/implications The proposed methods fill the gap of group-article recommendation in online libraries domain. The proposed methods have enriched the group recommendation methods by considering the interaction effects between groups and members. The proposed methods are the first attempt to implement group recommendation methods in big data contexts. Practical implications The proposed methods can improve group activity effectiveness and information shareability in academic groups, which are beneficial to membership retention and enhance the service quality of online library systems. Furthermore, the proposed methods are applicable to big data contexts and make library system services more efficient. Social implications The proposed methods have potential value to improve scientific collaboration and research innovation. Originality/value The proposed CoMF method is a novel group recommendation method based on the collaboratively decomposition of researcher-article matrix and group-article matrix. The process indirectly reflects the interaction between groups and members, which accords with actual library environments and provides an interpretable recommendation result.
Article
Currently, data are stored in an always-on condition, and can be globally accessed at any point, by any user. Data librarianship has its origins in the social sciences. In particular, the creation of data services and data archives, in the United Kingdom (Data Archives Services) and in the United States and Canada (Data Library Services), is a key factor for the emergence of data librarianship. The focus of data librarianship nowadays is on the creation of new library services. Data librarians are concerned with the proposition of services for data management and curation in academic libraries and other research organizations. The purpose of this paper is to understand how the complexity of the data can serve as the basis for identifying the technical skills required by data librarians. This essay is systematically divided, first introducing the concepts of data and research data in data librarianship, followed by an overview of data science as a theory, method, and technology to assess data. Next, the identification of the competencies and skills required by data scientists and data librarians are discussed. Our final remarks highlight that data librarians should understand that the complexity and novelty associated with data science praxis. Data science provides new methods and practices for data librarianship. A data librarian need not become a programmer, statistician, or database manager, but should be interested in learning about the languages and programming logic of computers, databases, and information retrieval tools. We believe that numerous kinds of scientific data research provide opportunities for a data librarian to engage with data science.