Conference PaperPDF Available

Exploring study profiles of Computer Science students with Social Network Analysis

Authors:

Abstract and Figures

Information technology is widely adapted in all levels of education. The extensive information resources facilitate enhanced human capacity and the social environment to support learning. In particular, Social Network Analysis (SNA) has been broadly used in teaching and learning practices. In this paper, we perform community detection analysis to identify the learning behavior profiles of undergraduate computer science students in a Nordic university. The social network was created using 273 responses to an online survey. The students themselves provided their social connections at the university, and node attributes were created based on responses to questions regarding Educational Values, Goals Orientation, Self-efficacy, and the university teaching methods. We analyze the biggest communities to identify the factors that characterize the learning strategy and preferences of undergraduate computer science students.
Content may be subject to copyright.
Exploring study profiles of Computer Science students with Social Network
Analysis
Nidia Guadalupe L´
opez Flores
Department of Computer Science,
Reykjav´
ık University, Iceland
nidia20@ru.is
Anna Sigridur Islind
Department of Computer Science,
Reykjav´
ık University, Iceland
islind@ru.is
Mar´
ıa ´
Oskarsd´
ottir
Department of Computer Science,
Reykjav´
ık University, Iceland
mariaoskars@ru.is
Abstract
Information technology is widely adapted in all
levels of education. The extensive information resources
facilitate enhanced human capacity and the social
environment to support learning. In particular, Social
Network Analysis (SNA) has been broadly used in
teaching and learning practices. In this paper, we
perform community detection analysis to identify the
learning behavior profiles of undergraduate computer
science students in a Nordic university. The social
network was created using 273 responses to an online
survey. The students themselves provided their social
connections at the university, and node attributes were
created based on responses to questions regarding
Educational Values, Goals Orientation, Self-efficacy,
and the university teaching methods. We analyze
the biggest communities to identify the factors that
characterize the learning strategy and preferences of
undergraduate computer science students.
1. Introduction
Information technology has become an essential tool
of education. It is rich in information resources and can
extend human capacity and the social environment to
support learning. As part of its rapid growth, Social
Network Analysis (SNA) has been broadly used in
teaching and learning practices [1]. Recent literature
suggests that the potential of Learning Analytics and
Educational Data Mining offer benefits through the
use of educational data both for teachers and students
to further understand the way students approach their
learning [2]. With the increase in online learning
brought on by the COVID-19 pandemic, there is
a greater need for understanding students’ social
structures in relation to study preferences and motivation
so that universities can better accommodate the needs of
more students, especially of underrepresented students
[3].
This paper analyzes and describes the study profile
of undergraduate students of four Computer Science
related programs through Social Network Analysis and
community detection. In particular, we are interested
in knowing their learning preferences regarding group
working, physical attendance of lectures, self-efficacy
perception and goal orientation. We aim to answer
the research question: Can Community Detection
algorithms applied to the social network of students
identify undergraduate study profiles at a Computer
Science Department? To answer the question, we
analyzed the social connections of undergraduate
students and their study preferences to outline their
study profiles. The social network is built based on
an online survey created and distributed in 2019 to 717
undergraduate computer science students to investigate
their learning patterns and behaviors. The students’
social network is assortative and has a high clustering,
common features in social networks. We discover
five communities of students, where each of them is
characterized by a different study profile.
The rest of this paper is organized as follows. In
the next section, we discuss related research on learning
analytics, educational data mining, social network
analysis and study profiles. In Section 3 we present
the methodology used in this research followed by
the results in Section 4. The paper concludes with a
discussion on the implications and limitations of our
work and directions for future work.
2. Related work
2.1. Learning Analytics and Educational Data
Mining
Learning Analytics (LA) and Educational Data
Mining (EDM) have emerged as impactful research
fields that draw on educational data in the last decades
[4]. LA is defined as the “measurement, collection,
analysis and reporting of data about learners and
their contexts, for purposes of understanding and
optimizing learning and the environments in which it
occurs” [4]. The implementation of LA strategies
has been highlighted among the priorities of higher
education institutions [5]. However, EDM is defined
as “an emerging discipline, concerned with developing
methods for exploring the unique types of data that come
from educational settings, and using those methods
to better understand students, and the settings which
they learn in” [6], it employs data mining theories
and techniques to analyze educational data. Both
LA and EDM aim to improve and create methods
that enhance education at all levels. They revolve
around personalization, adaptive learning, predictive
analysis and user behavior profiling [7, 8]. Furthermore,
LA and EDM methods had been widely applied
to address a large set of concerns, e.g., predicting
students’ performance, retention analytics, intelligent
feedback provision and course recommendation [9].
Notwithstanding, limited research on learning styles
like the personalization of learning, learning style
identification and its application in teaching, learner
motivation, and student profiling has been carried out
[6].
Among the data mining techniques used in EDM,
prediction methods, like classification and regression,
and structure discovery methods, like clustering and
factor analysis, are the most commonly used. Most
recent research in EDM had been focused on the use
of two or more methods [10]. Usual clustering and
classification problems in EDM and LA can be extended
to Social Network Analysis [11, 12]. However, Social
Network Analysis has been used less frequently to
examine educational data [10].
2.2. Network Science in educational context
Among SNA applications in the educational context,
recent research has been focused on a wide range of
areas of interest. Homophily is a fundamental property
of social networks; it establishes that people with similar
properties are more likely to connect [13]. Nguyen
et al. [14] analyzed homophily regarding gender,
ethnic minority identity, family income, and academic
performance using WIFI log data. Their studies
confirmed homophily concerning demographics and
academic performance and showed that gender-based
homophily increases over time. Shirvani et al. [15]
applied SNA modeling techniques to analyze the social
dimension and learners’ roles on MOOCs discussion
forums and their changes over time. Their research
found that activity level can be predicted one week in
advance based on the course structure, forum activity
and properties of the communication network.
Community detection is one of the most significant
problems in Social Network Analysis; the analysis
of closely linked social groups is one of the
comprehensible methods of describing social structures
[11]. In higher education, community detection
algorithms have been successfully implemented
to address varied topics about learning processes.
Sturlud´
ottir et al. [16] identified fields of interest
in the courses offered in undergraduate education
programs. Xu et al. [17] analyzed discussion forum
data of MOOCs courses to gain insights on the creation
of social structures and how they change over time.
Finally, Yassine et al. [18] used community detection
algorithms to study users’ engagement patterns on
online learning networks.
2.3. Learning style, study patterns, and study
profile
Learning style theories are used in an educational
context to improve learners’ learning strengths and
instructors’ teaching abilities. EDM methods had been
used to investigate learning styles [6]. Ahmad and Tasir
[19] used log files of online learning activities to analyze
the behavior patterns of engineering students; they
concluded that the course structure, students’ previous
experience, and subject influence the thresholds defined
for learning style identification. Costaguta and Menini
[20] studied the relationship between learning style and
performance to improve group creation. More recently,
Shobbrook et al. [21] implemented elements of EDM to
validate the Fedler and Silverman’s Index of Learning
Styles (ILS) developed for engineering education. In
their research, no correlation supporting the validity of
the ILS was found, except for Lecture attendance. The
research about learning styles had been controversial
due to the limitations in measuring and determining the
learners’ learning styles individually [22].
EDM methods are also applied to investigate study
patterns in varied contexts. Shirvani et al. [23]
research presented a data-driven approach to identify
and trace study patterns in an unsupervised manner
and a hypothesis-driven approach to extract predefined
patterns from learners’ interactions. Casey and Azcona
[24] used the student activity pattern for early detection
of poor performers and to identify topics that the
students found less interesting or more difficult to
understand. Regarding using SNA to investigate study
patterns, Leeet al. [25] analyzed click-stream data using
hierarchical clustering analysis to identify behavior
patterns concerning the use of a video discussion
platform. They analyzed the transition pattern between
consecutive activities in a video discussion platform.
Considering the difficulties related to the individual
identification of learning styles, this paper applies
Social Network Analysis and community detection
methods to analyze the undergraduate students’ study
profiles. For our purposes, we define the study profiles
through a set of attributes related to educational values,
goals, self-efficacy perception, and teaching methods
preferences. Our approach relies on homophily to
explore the characterization of the study profiles by
analyzing the structure of the student community instead
of focusing on the individual characteristics of each
student.
3. Methods
3.1. Dataset
The data in this study were collected from
an online survey distributed to 717 undergraduate
students enrolled in the four computer science bachelor
programs at Reykjav´
ık University; BSc Computer
Science, BSc Computer Science research-based, BSc
Software Engineering, and BSc Discrete Mathematics
and Computer Science. The survey was created
and distributed in 2019, before the pandemic. It
included 42 questions related to Institutional Support,
Educational Values, Goals, Self-efficacy and Academic
apathy, based on a students’ readiness survey, the
Academic Readiness Questionnaire [26]. The survey
was initially distributed to study and understand
undergraduate students’ learning patterns and behaviors.
The Cronbach’s alpha coefficient of this questionnaire in
this sample is 0.70, indicating good internal consistency
reliability [27].
Among the 42 questions included in the survey, the
first two questions asked for age and gender. Questions
three to five were related to Institutional support; the
students were asked about the amount of information
they had regarding the university, their degree program
and their career possibilities. In questions six to eight,
the students’ Educational values were evaluated; to
measure the priority degree assigned to the university
studies and the grades obtained. The following ten
questions were Goal-related; in this section, the students
were asked about their drivers for goal definition,
organization, learning behavior, methodological
preferences, and long-term expectations. The following
twelve questions addressed the student’s Self-efficacy
perception; expected performance, skills and abilities,
self-motivation, confidence and capacity of adaptation
were addressed in those questions. In the next section,
Academic apathy was measured in four questions
asking for the student’s effort, work avoidance, and
scheduling level when planning their study sessions.
The following seven questions were related to the
university teaching methods, their preferences about
attendance to lectures and practical sessions, as well
as honors achievements. The last three questions were
about the students’ willingness to work in groups and
their social network size. Except for age, gender and
honors achievement, the questions’ answers were on a 5
point Likert scale. Additionally, the students were asked
to provide the list of students they most communicate
with at the university; the maximum length of the
list was 10. The survey’s response rate was 38%,
with 273 students answering it. Nonetheless, among
those responses, only 218 students provided a list of
connections at the university.
In the light of the COVID-19 pandemic and the
sudden change to Emergency Remote Teaching (ERT)
[28], and in line with recent research developed to
analyze and understand its impact on teaching and
learning processes, we decided to use the data obtained
from the survey previously implemented to analyze
the preferred study style of Computer Science students
before the pandemic, to understand the impact of ERT
in the undergraduate student community. Questions
in Table 1 were selected to perform pre-pandemic
study profile identification; these questions relate to
the self-perception of motivation, adaptation to different
teaching styles, preference to attend or not to lectures,
and their willingness to work in groups instead of
working alone. The list of students with whom the
respondent communicated at the university was used
to build a social network of students by creating a
link between the respondent and everyone that they
named. The data pre-processing was performed in R
and R-studio, while the Social Network Analysis and
community detection were performed in Python with
NetworkX.
3.2. The Girvan-Newman Algorithm
The Girvan-Newman algorithm was used to identify
the communities in the student network. This algorithm
successively removes the edges with the highest
betweenness as those edges tend to connect different
clusters [11]. Betweenness is a centrality measure
helpful to identify the most influential people in a social
network. To calculate it, the times a node (edge) is
crossed by the shortest path between any other pair
of nodes in the network are quantified. The higher
the betweenness coefficient, the more essential the
node (edge) is to connect with the rest of the network
[29]. The Girvan-Newman algorithm returns a set of
partitions where each of them represents the clusters
identified from the connected components after each
edge is removed. As there is no natural benchmark
Table 1. Questions selected to perform study profile identification
Topic Question/Statement
Individual Background Age in years
Individual Background Gender
Educational Values Getting good grades is important to me
Goals I’m a very methodical person
Self-Efficacy I can easily adapt to different styles of teaching
Self-Efficacy I can motivate myself to study when I need to
University teaching methods I like the way of teaching (the methods) used at the university
University teaching methods I do not usually attend lectures at the university
University teaching methods I watch the lectures online, on Echo360 in Canvas, rather than attend class
University teaching methods I always attend problem solving classes
Social Networks I prefer to work in groups (arranged by the teacher), rather than work on my own
Social Networks I prefer to work in groups (chosen by students), rather than work on my own
for the identified clusters, each partition returned by the
algorithm was evaluated in its modularity to select the
partition that maximizes it. The modularity coefficient
compares the edges among nodes in the same cluster and
the edges among nodes belonging to different clusters
[13].
4. Results
4.1. Network Description
Nodes in the network represent a student who
either answered the survey or was mentioned by
someone who did. Directed edges were created from
the student (source) who mentions another student
(target). Questions’ answer values were included as
node attributes in the network. The final network
displayed in Figure 1 includes 615 nodes with 806
edges. About 22% of the students who answered the
survey did not provide a list of friends at the university.
Those students, 59 in total, are included in the network
as singletons representing 9.5% of the total nodes in
the network. There are several reasons explaining the
singletons: (1) the students do not have connections in
the university, (2) the students do not feel comfortable
sharing information about their connections, or (3) as
the survey was not mandatory, the student skipped the
last part of the questionnaire.
The density and the clustering coefficient are
measures commonly used to describe the structure of
a network. The density is defined as the fraction
of connected nodes among all the possible pairs in
the network. With a maximum value of 1, the
higher the value, the more connected the network is
[13]. The average clustering coefficient measures, on
average, the extent to which the neighbors of each
node in the network link to each other [30]. Both
measures are helpful to outline characteristics of the
Figure 1. Friendship Network of undergraduate
students.
network, such as its completeness and connectedness.
The friendship network constructed has a density of
0.00213, and its average clustering coefficient is 0.1213.
Networks with low density are told to be sparse;
real-world networks are commonly characterized by
sparsity [13]. Real-world networks with comparable
densities coefficients were found in yeast protein
interactions (2,277 links and a density of 0.001)
and US air transportation data (18, 617 links and a
density of 0.004) [13]. An assortative network is
defined by [31] as networks with ”a preference for
high-degree vertices to attach to other high-degree
vertices”. The assortativity coefficient of a network
is calculated as the correlation among the degrees
of each pair of nodes in the network. Networks
with positive coefficients are known as assortative,
whereas negative values lead to disassortative networks
[13]. With an assortativity coefficient of 0.24, we can
say this network is assortative; students with many
connections tend to frequent other students with a high
number of connections. The network has 94 connected
components, the biggest with 358 nodes, while the
smaller ones are singletons.
Figure 2. In-degree distribution.
Figure 3. Out-degree distribution.
Figures 2 and 3 display the in-degree and out-degree
distribution of the nodes in the network. The in-degree
value represents the number of times the student appears
in others’ list of connections, whereas the out-degree
is the number of friends or connections declared by
each student. The in-degree distribution is right-tailed;
most students have a low in-degree (are mentioned
by fewer people) than the maximum in-degree of the
network who has an in-degree of 8. For the out-degree
distribution, most of the nodes have an out-degree of
zero. The reason for that is, among the 615 nodes
in the network: (1) only 273 answered the survey,
those nodes that were mentioned by someone but did
not answer the survey will have out-degree zero; (2)
among those who answered, 59 were singletons with
no connections. Figure 4 displays the distribution when
the zero out-degree nodes are not considered. It is
remarkable the number of nodes with a high out-degree.
Figure 4. Out-degree distribution for degrees greater
than zero.
4.2. Community Detection
The directed network was transformed into an
undirected network before applying the community
detection algorithms. With this change, the final
amount of edges decreased to 739, and the average
degree of the nodes is 2.40. Figure 5 displays the
modularity coefficient for each of the partitions returned
by the Girvan-Newman algorithm, the partition with
the highest modularity, 0.89, has 110 communities.
The number of communities is high due to the
59 singletons in the network; each singleton is
a single community. We analyze the attributes
of the five largest communities and present their
preferred study profile. The communities were named
‘Star pupils’, ‘Independent students’, ‘Team players’,
‘Female power’, and ‘Versatile students’. In those
communities, 162 students are included. Figure 6
displays them colored by the community they belong
to. ‘Star pupils’ is the only community disconnected
from the others. In the following subsections, the
study profile that characterizes the communities will be
outlined based on the distribution of the answers to the
attributes in the Academic Readiness Questionnaire [26]
in Table 1. Figures 7 and 8 display the average and
median of the responses to each question by converting
the Likert scale into: Strongly Agree=5, Agree=4,
Neutral=3, Disagree=2,and Strongly Disagree=1.
4.2.1. Community No. 1: ‘Star pupils’. The first
community identified includes 35 students. Among
them, 69% are males, 20% female and 11% unknown.
The response rate of the students in this community is
37%.The study profile is characterized by:
Its members are mostly younger than 22 years.
Figure 5. Modularity coefficient in each step of the
Girvan-Newman algorithm.
Figure 6. The five biggest communities identified
using the Girvan-Newman algorithm.
They declare that getting good grades is important
to them.
They are highly methodical, and they can highly
adapt to different styles of teaching.
They also declare being always able to motivate
themselves when needed.
These students also say they do like the teaching
methods at the university.
They always attend lectures, and they prefer to
attend rather than to watch recordings. They
always attend practical classes.
These students prefer to work alone rather than
working in groups arranged by the teacher.
4.2.2. Community No. 2: ‘Independent Students’.
This community includes 32 students. The gender
distribution is quite different from the first community
analyzed; 56% males, 38% females, and 6% unknown.
In this community, the response rate was 38%. The
study profile of this community is featured by:
Most of the students are around 25 years old, but
the ages range from 23 to 38.
Getting good grades is important, but they are not
very methodical.
They declare they could adapt to different
teaching styles, but they mostly like the teaching
methods at the university.
What makes this community special is that they
do prefer to watch the lecture recordings.
Finally, this community prefers to work in groups
chosen by themselves rather than working alone.
4.2.3. Community No. 3: ‘Team players’. This
community consists of 38 students. A 63% of them are
males, 20% females, and 8% unknown. Similar to the
previous communities, its response rate is 40%. Among
its features is possible to identify:
The students are around 23 years old, with ages
from 20 to 26 years.
Getting good grades is important, and they
declared themselves to some extent methodical.
They agree they can adapt to different teaching
methods, and they mostly like those used at the
university.
The students usually attend lectures at the
university and also problem-solving classes.
In contrast to the other communities, these
students do prefer to work in groups chosen by
themselves rather than working alone.
4.2.4. Community No. 4: ‘Female Power’. This
community has 29 students. It is the only community
with a higher percentage of females, 55%, whereas
34% are males and 10% unknown. Additionally,
this community has the lowest response rate among
the five communities analyzed, 24%. The features
characterizing this community are:
• Their age goes from 20 up to 30 years with a
uniform distribution.
In this community, getting good grades is
essential.
They declare they could adapt to different
teaching methods.
In addition, they could prefer to work alone rather
than in groups chosen by the teacher, but also
prefer to work in groups rather than alone.
Figure 7. Average responses of the communities based on the Likert scale: Strongly Agree (5), Agree (4),
Neutral (3), Disagree (2), Strongly Disagree (1).
4.2.5. Community No. 5: ‘Versatile Students’.
Twenty-eight students are allocated to this community.
It is the community with the highest percentage of
males, 82%, while 7% are females, and 11%unknown.
The response rate was 42%. This community shares
most of its features with the previous communities
presented:
Its members are around 27 years old.
Grades are important, and they declare to be able
to adapt to different styles of teaching.
They declare themselves to be able to motivate
themselves when needed and mostly like the
teaching methods at the university.
These students attend lectures almost always.
• Regarding group work, they prefer to work in
groups chosen by themselves, or alone if the
groups are chosen by the teacher.
4.2.6. Singletons. There are 59 singletons in the
network, most of them have less than 30 years. The
gender distribution is similar to the distribution of the
whole network; almost 65% are males. Among their
features; getting good grades is important to them, but
there is no evident definition of being methodical when
studying; they could adapt to different styles of teaching,
motivate themselves when needed to, and they like the
teaching methods at the university; regarding attendance
to lectures, the distribution of the answers is uniform
among the statements, but most of them agree on
prefer watching the lecture recordings instead; finally,
regarding group working, there is no clear definition
when the teacher chooses the groups, but they tend to
slightly prefer to work alone rather than groups chosen
by the students themselves.
4.3. Statistical comparison of attributes’
distribution
Differences among the distribution of the
communities’ attributes were evaluated using the
Kruskal-Wallis test. Only four attributes showed
differences in their distributions considered statistically
significant; (i) Question 4 in Table 1: Being methodical,
(ii) Question 5 in Table 1: Adaptation to teaching
styles, (iii) Question 8 in Table 1: Lecture attendance,
and (iv) Question 9 in Table 1: Watching recorded
lectures. Among those attributes, the community
‘Star pupils’ has significant differences in Being
methodical for ‘Independent students’, ‘Versatile
students’, and ‘Female power’ communities. Regarding
Lecture attendance and Watching recorded lectures, a
significant difference was found between ‘Star pupils’
and ‘Independent students’. In contrast, for Adaptation
to teaching styles, a significant distribution difference
was found between ‘Star pupils’ and ‘Female power’.
5. Discussion & Conclusion
This study presents five different study profiles
among the undergraduate students in the Computer
Science department at Reykjav´
ık University. The data
were gathered from an online survey distributed in 2019
as an initial approach to understanding undergraduate
Figure 8. Median of the communities’ responses based on the Likert scale: Strongly Agree (5), Agree (4),
Neutral (3), Disagree (2), Strongly Disagree (1).
students’ learning patterns and behaviors. Relevant
features of the student network in this university before
the pandemic are; students with many connections
tend to interact with other students who also have
many connections, but also, the network has a high
percentage of singletons, students without connections.
As the second step in this analysis, the Girvan-Newman
algorithm was used to identify the communities; the
optimal partition was selected based on its modularity.
The largest communities were analyzed to identify the
features and learning preferences that characterize the
study profile of its members. In the third section,
we identified five communities with an evident and
particular profile. (1) ‘Star pupils’ is featured by
being those who always behave as expected and have
many of the best attributes. Being the community
with the youngest members (mostly less than 22
years), first-year students are likely allocated to this
community and keeping most of the study habits they
used to have at high school. (2) With students slightly
older than the first community presented (around 23),
‘Team players’ members consider the group work
an essential factor in their learning preferences, as
long as they can choose their groups. (3) The
third community, ‘Independent students’, has students
around 25 years. This community is featured by
preferring lecture recordings instead of always being
at the university’s venues. (4) ‘Versatile Students’
community has, on average, the oldest students (around
27 years). This community share features with the
previous communities. From the first four communities,
we can infer that the year of study and maturity level
play an important role in determining the study style
profile and preferences. As the students move on in
their undergraduate studies, they become able to adapt
their learning strategy to the needs and requirements
of each course, presumably more complex in the last
terms of their studies, becoming a ‘Versatile Student’.
Last but not least, the fifth community, ‘Female Power’,
is characterized by being the only community with
more females than males. In line with the results
obtained by [14] gender-based homophily is present.
Nevertheless, besides gender, the attribute that features
this community is how important getting good grades
is. Regarding singletons, most of them prefer lecture
recording, and as it could be expected, they prefer
working alone; if the teacher or the students choose the
groups, it does not matter.
The sample of the students examined in this
paper falls under STEM, which stands for Science,
Technology, Engineering and Mathematics. Within
the literature on STEM, there has been an ongoing
discussion on the issues related to students’ sense of
belonging. On that note, an extensive body of literature
has focused on solutions targeted towards developing
an increased sense of belonging, which is thought to
lower the impact of identity-related issues on education;
one such identity-related issue can be due to skewed
distribution between genders in STEM [32]. This
paper is a contribution to that literature through the five
communities.
Among the limitations of this study, the data was
gathered with an online survey. It does not allow linking
the students with the average grade, the number of
credits earned, or the year of study. That information
could be helpful to analyze how the communities
evolve through the years, to what extent the students
interact with and provide support to peers from other
years, and how the study style selection relates to
academic performance. Other drawbacks of this data
collection method are the (i) response rate [33], affecting
the performance of the Kruskal-Wallis tests due to
small group sizes and (ii) response biases related to
social desirability and the tendency to always select
extreme ends in the Likert scale [34]. Regarding the
sample used in this analysis, the students surveyed
belong to the Computer Science department; students
in other departments may have different study styles,
so the identified communities’ presence should not be
generalized to students in other departments. This
paper focused on the biggest communities; 30% of the
students with at least one connection are part of the
communities and profiles presented. Therefore, the
other communities should be analyzed to identify if their
study profiles are similar to those identified.
The results regarding the profiles of undergraduate
students lead to relevant implications and future work.
Before the pandemic arose, only one of the five
identified study profiles preferred lecture recordings
over the university venues. The students in the
community ‘Independent students’ could have fewer
difficulties during the ERT as they had previously
interacted with the lecture recording platform. In
contrast, most students were used to attending
face-to-face lectures, and their transition to distance
learning could be more complicated. Furthermore, the
impact of the forced distance learning period could
be bigger for the youngest students, ‘Star pupils’, as
they highly prefer university venues. Another affected
community was ‘Team players’, who highly valued
interacting with their peers; during the pandemic,
their interactions were limited due to the pandemic
restrictions to contain the spread of the virus. Recent
studies on the effects of the pandemic in education
provide insights into how the new normality in Higher
Education will be [35, 36, 37, 3]. On one side, the
use of the technology and the transition to hybrid
learning are expected to be present in the new normality
of higher education [36]. In contrast, the impact of
ERT on the students’ experience with online learning
may modify their preferences about enrolling in online
courses in the future [35, 38]. The communities and
study profiles analyzed in this study correspond to the
pre-pandemic period. Consequently, once the pandemic
finishes, it is essential to study and understand the
changes in the learning profile and preferences of the
students in the communities highly affected by the
ERT. Furthermore, understanding to what extent the
pre-pandemic communities’ structure remained or not
after the pandemic will allow redesigning teaching
strategies to provide better support to the students
according to their specific profiles.
The past term, Spring 2021, ran still with restrictions
due to the COVID-19 pandemic. Most of the schools
around the world remained closed. The term Autumn
2021 will be, maybe, the first term of Education
adaptation after the pandemic. Future work of
this research relates to analyzing the undergraduate
students’ new connections, study profiles, self-efficacy
perception, and goals during the adaptation to a new
normality in higher education. In addition, other
university departments and other data sources, such as
data generated from LMSs or external tools, such as
forum activity or recorded lectures, will be included
to enrich the SNA and student communities’ profiling.
The integration of those data sources will allow linking
the results obtained in this study with similar research
performed using SNA on LMSs data, making our
conclusions more generalizable. Finally, more research
is also needed in investigating the evolution of students’
social networks through their years of study at the
university and how their modifications relate to their
study profile and performance.
References
[1] R. Huang, J. M. Spector, and J. Yang, Educational
Technology. Springer, 2019.
[2] A. Nguyen, L. Gardner, and D. Sheridan, A Design
Methodology for Learning Analytics Information
Systems: Informing Learning Analytics Development
with Learning Design. 2020.
[3] N. G. L ´
opez Flores, A. S. Islind, and M. ´
Oskarsd´
ottir,
“Effects of the covid-19 pandemic on learning and
teaching: a case study from higher education,arXiv
preprint arXiv:2105.01432, 2021.
[4] D. Gaˇ
sevi´
c, S. Dawson, and G. Siemens, “Let’s
not forget: Learning analytics are about learning,”
TechTrends, vol. 59, no. 1, p. 64–71, 2015.
[5] M. Gaebel, T. Zhang, H. Stoeber, and A. Morrisroe,
“Digitally enhanced learning and teaching in european
higher education institutions,” tech. rep., European
University Association absl., 2021.
[6] A. Dutt, M. A. Ismail, and T. Herawan, “A systematic
review on educational data mining,IEEE Access, vol. 5,
2017.
[7] A. Pe˜
na-Ayala, “Learning analytics: A glance of
evolution, status, and trends according to a proposed
taxonomy,WIREs Data Mining and Knowledge
Discovery, vol. 8, May 2018.
[8] M. Brown, “Learning analytics: Moving from concept to
practice,” 2012.
[9] S. Roy and S. N. Singh, “Emerging trends in
applications of big data in educational data mining and
learning analytics,” in 2017 7th International Conference
on Cloud Computing, Data Science Engineering -
Confluence, pp. 193–198, 2017.
[10] A. Aleem and M. M. Gore, “Educational data mining
methods: A survey,” in 2020 IEEE 9th International
Conference on Communication Systems and Network
Technologies (CSNT), pp. 182–188, 2020.
[11] C. C. Aggarwal, Data Mining. Springer International
Publishing, 2015.
[12] G. Deeva, S. Willermark, A. S. Islind, and
M. Oskarsdottir, “Introduction to the minitrack on
learning analytics,” 2021.
[13] F. Menczer, S. Fortunato, and C. A. Davis, A first course
in network science. Cambridge University Press, 1 ed.,
2020.
[14] Q. Nguyen, O. Poquet, C. Brooks, and W. Li, “Exploring
homophily in demographics and academic performance
using spatial-temporal student networks,” in Proceedings
of the 13th International Conference on Educational
Data Mining, EDM 2020, Fully virtual conference, July
10-13, 2020 (A. N. Rafferty, J. Whitehill, C. Romero,
and V. Cavalli-Sforza, eds.), International Educational
Data Mining Society, 2020.
[15] M. S. Boroujeni, T. Hecking, H. U. Hoppe, and
P. Dillenbourg, “Dynamics of mooc discussion forums,”
in Proceedings of the Seventh International Learning
Analytics & Knowledge Conference, LAK ’17, (New
York, NY, USA), p. 128–137, Association for Computing
Machinery, 2017.
[16] E. G. Sturlud ´
ottir, E. Arnard´
ottir, G. Hj´
almt´
ysson, and
M. ´
Oskarsd´
ottir, “Gaining insights on student course
selection in higher education with community detection,”
2021.
[17] Y. Xu, C. Lynch, and T. Barnes, “How many friends can
you make in a week?: evolving social relationships in
moocs over time,” in EDM, 2018.
[18] S. Yassine, S. Kadry, and M.-A. Sicilia, “Application of
community detection algorithms on learning networks.
the case of khan academy repository, Computer
Applications in Engineering Education, vol. 29, no. 2,
pp. 411–424, 2020.
[19] N. Ahmad and Z. Tasir, “Threshold value in automatic
learning style detection,” Procedia - Social and
Behavioral Sciences, vol. 97, p. 346–352, 2013.
[20] R. Costaguta and M. de los Angeles Menini, “An
assistant agent for group formation in cscl based on
student learning styles,” in Proceedings of the 7th Euro
American Conference on Telematics and Information
Systems, EATIS ’14, (New York, NY, USA), Association
for Computing Machinery, 2014.
[21] R. Shobbrook, P. Branch, and P. Ling, “Using
educational data mining to test the validity of learning
style theory,” in 2020 IEEE International Conference
on Teaching, Assessment, and Learning for Engineering
(TALE), pp. 490–496, 2020.
[22] P. A. Kirschner, “Stop propagating the learning styles
myth,” Computers & Education, vol. 106, p. 166–171,
2017.
[23] M. Shirvani Boroujeni and P. Dillenbourg, “Discovery
and temporal analysis of mooc study patterns,” Journal
of Learning Analytics, vol. 6, no. 1, p. 16–33, 2019.
[24] K. Casey and D. Azcona, “Utilizing student activity
patterns to predict performance,” International Journal
of Educational Technology in Higher Education, vol. 14,
no. 1, 2017.
[25] S. Y. Lee, H. S. Chae, and G. Natriello, “Identifying
user engagement patterns in an online video discussion
platform,” in EDM, 2018.
[26] J.-C. Lemmens et al.,Students’ readiness for university
education. PhD thesis, University of Pretoria, 2011.
[27] K. S. Taber, “The use of cronbach’s alpha when
developing and reporting research instruments in science
education,” Research in Science Education, vol. 48, Dec
2018.
[28] C. Hodges, S. Moore, B. Lockee, T. Trust, and A. Bund,
“The difference between emergency remote teaching and
online learning,” 2020.
[29] L. Igual and S. Segu´
ı, Introduction to Data Science.
Springer International Publishing, 2017.
[30] A.-L. Barab´
asi and M. P´
osfai, Network science.
Cambridge: Cambridge University Press, 2016.
[31] M. E. J. Newman, “Assortative mixing in networks,
Physical Review Letters, vol. 89, no. 20, 2002.
[32] R. Robnett, “The role of peer support for girls and
women in stem: implications for identity and anticipated
retention,” International Journal of Gender, Science and
Technology, vol. 5, no. 3, pp. 232–253, 2013.
[33] B. Blumberg, D. R. Cooper, and P. S. Schindler, Business
research methods. McGraw-Hill Higher Education,
3 ed., 2011.
[34] Handbook of Learning Analytics. Society for Learning
Analytics Research (SoLAR), May 2017.
[35] E. M. Aucejo, J. French, M. P. Ugalde Araya, and
B. Zafar, “The impact of covid-19 on student experiences
and expectations: Evidence from a survey,Journal of
Public Economics, vol. 191, p. 104271, 2020.
[36] A. Benito, K. Dogan Yenisey, K. Khanna, M. F. Masis,
R. M. Monge, M. A. Tugtan, L. D. Vega Araya,
and R. Vig, “Changes that should remain in higher
education post covid-19: A mixed-methods analysis of
the experiences at three universities,Higher Learning
Research Communications, vol. 11, no. 0, 2021.
[37] S. Roy and B. Covelli, “Covid-19 induced transition
from classroom to online mid semester: Case study on
faculty and students’ preferences and opinions,” Higher
Learning Research Communications, vol. 11, no. 0,
2020.
[38] K. O’Neill, N. Lopes, J. Nesbit, S. Reinhardt, and
K. Jayasundera, “Modeling undergraduates’ selection of
course modality: A large sample, multi-discipline study,
The Internet and Higher Education, vol. 48, p. 100776,
2021.
... Studies have been conducted on the effect and impact of the ERT on teaching and learning in varied educational settings. For instance, changes in students' learning patterns, motivation and engagement levels have been researched (Means et al., 2020;Patricia Aguilera-Hermida, 2020); as well as the pre-pandemic students' study profiles and their relationship with the forced change in the teaching modality to online and distance learning (López Flores et al., 2022). There were multiple challenges teachers faced during this period, including feelings of ineffectiveness when it came to using digital platforms (Abilleira et al., 2021;Trust and Whalen, 2020), the lack of institutional support for professional development and workplace learning (Niu, 2024;Nikolopoulou and Kousloglou, 2022), and the need to nurture pedagogical and technological training (Tsegay et al., 2022). ...
Article
Purpose The acceleration of technology adoption in higher education, prompted by the global shift to online teaching during the COVID-19 pandemic, called for responsive programmes to address pedagogical challenges. This paper aims to present the design process and initial adoption of an institutional programme created to support instructors in providing educational resources for online and hybrid undergraduate courses at Reykjavik University. Design/methodology/approach By adopting a socio-technical perspective, the programme encompasses teacher support and digital platform use. Additionally, the programme aimed to enhance the student experience by increasing course consistency and facilitating data collection for future research on learning analytics. Findings The findings demonstrate the programme’s successful adoption, effectively strengthening teachers’ practices. Key contributions include a teacher-centric perspective on technology challenges and a socio-technical conceptualisation informed by teachers’ experiences during the pandemic. Originality/value This research provides valuable insights for teachers, administrators and researchers developing similar initiatives for effective professional development of faculty in online and hybrid teaching environments.
... Social Network Analysis (SNA) has been used to investigate several aspects of education; in most cases the students are represented as nodes in the network, and the edges connecting them represent different kinds of relationships or communication events, either online or face-to-face [4]. Among the educational aspects investigated using SNA, the most common are academic success and dropout [8,9], the influence of homophily on performance [10,11], Massive Open Online Courses (MOOCs) [12], study patterns [13], course selection [14,15], collaborative learning [16], and community detection [14,17,18]. Data sources commonly used to construct the networks include self-reports and surveys in face-to-face settings, as well as discussion logs and threads in online events and forums for online networks. ...
Conference Paper
Full-text available
The temporal component has been pointed out as an impactful dimension for educational research, as learning is not a static phenomena. In this paper, we investigate how the students' activity and interaction dynamics in an online forum differ among teaching modalities. The analysis is based on three years of forum interaction data in an undergraduate course where the only significant change was the teaching modality, from on-site (2019) to fully online (2021). We build and analyse temporal networks by studying changes in several networks' measures and features. Our preliminary results show changes in the students' and teachers' interaction dynamics as well as changes in the posts' content with fully online teaching. This ongoing research is focused on studying the impact of teaching modalities on the discussion forum interactions. Initial results are promising, and other features such as the students' closeness and betweenness, as well as the dynamics changes' relationship with the academic performance are being explored to improve our understanding on teaching modalities and help generalisation of Network Science and Learning Analytics research in wider educational contexts.
... A significant number of researchers have reported on the effect and impact of the ERT on teaching and learning in varied educational settings. For instance, changes in students' learning patterns, and motivation and engagement levels have been studied [2], [8]; as well as the pre-pandemic students' study profiles and their relationship with the forced change in the teaching modality to online and distance learning [9]. Teachers' challenges mentioned in the literature are related to the lack of guidance and training on distance learning modalities, work overload, lack of interaction with the students, and difficulty to uphold students' motivation levels. ...
... The results show that although the general view is that most are satisfied with learning online during extreme situations, there is a lower engagement level to be found for students in the virtual classroom in general. Moreover, Lopez Flores et al [41] analyze the swift move online and find that study patterns change significantly for students in higher education and show through click-logs that students are not shifting between subjects, but instead study each subject in a more focused manner, before moving over to the next subject, i.e., students are multi-tasking less. The long-term effects of these changes on students remain to be seen and some studies suggest that the sensible way forward, is through a blended learning environment. ...
Article
Full-text available
For many teachers, the COVID-19 pandemic meant an instant shift from teaching in traditional to a virtual classroom to reduce the spread of infection. It represents a widespread and intensive case of digitalization of teaching practice and many stakeholders are asking the imminent question of which transformations that ‘will stick’ and become a constant in the ‘new normal’ onwards. However, research of online teaching in a high school context remains limited. In this study, we analyze what happens when teaching is redirected from the traditional to the virtual classroom and explore what characterizes educational affordances in the virtual classroom. The context is 15 high schools in Sweden and the empirical data includes a survey with a total of 1103 teachers. Educational affordances are used as an analytic lens to conceptualize what teaching activities that the virtual classroom afford. The main contribution includes theorizing about what activities, interactions, and procedures that the virtual classroom affords by presenting seven educational affordances and contrast these with teaching in traditional classrooms. The affordances consist of 1) Structure 2) One-to-one communication 3) Formalized reconciliations 4) Peace and quiet 5) Hidden back channels 6) Right time and 7) Reaches certain students. The seven affordances can make a foundation for reflection and discussions of how to create a didactic design adapted for different classrooms. Furthermore, we contribute with implications to teachers and school leaders.
Chapter
Full-text available
In the field of social network analysis, understanding interactions and group structures takes a center stage. This chapter focuses on finding such groups, constellations or ensembles of actors who can be grouped together, a process often referred to as community detection, particularly in the context of educational research. Community detection aims to uncover tightly knit subgroups of nodes who share strong connectivity within a network or have connectivity patterns that demarcates them from the others. This chapter explores various algorithms and techniques to detect these groups or cohesive clusters. Using well-known R packages, the chapter presents the core approach of identifying and visualizing densely connected subgroups in learning networks.
Article
Full-text available
Objectives: The goal of the present study is to describe how the transition to remote emergency delivery was addressed in three universities during COVID-19 pandemic, to determine the satisfaction levels of their students and faculty with this new teaching-learning experience, and to gather their opinions about the future of Higher Education. Method: The study uses a mixed methods approach, including faculty and student surveys and focus groups Results: The study shows high satisfaction with the emergency remote delivery, and clearly reflects the relevancy of enhancing the digital components of the future learning experiences in Higher Education and a unanimous preference for hybrid education, providing some interesting recommendations to institutions regarding what students and faculty would like to keep for a more effective learning experience when the new normality comes. Conclusions: COVID-19 has had terrible consequences, however, the authors of this paper believe that this pandemic has brought along some positive effects and improvement opportunities in higher education, and if the results of the present study are any indication, the future of face to face higher education should be hybrid. Implication for Theory and / or Practice: This study may have some impact on future research initiatives, but the aspiration of the authors of this paper would be to inform decision making, and make direct recommendations to institutional leaders and policy makers regarding the necessary enhancement of the digital component of the teaching and learning process in Higher Education. Keywords: COVID-19; emergency remote delivery; hybrid higher education
Conference Paper
Full-text available
Network analysis in educational research has primarily relied on self-reported relationships or connections inferred from online learning environments, such as discussion forums. However, a large part of students' social connections through day-today on-campus encounters has remained underexplored. The paper examines spatial-temporal student networks using campus WiFi log data throughout a semester, and their relations to the student demographics and academic performance. A tie in the spatial-temporal network was inferred when two individuals connected to the same WiFi access point at the same time intervals at the 'beyond chance' frequency. Our findings revealed that students were more likely to co-locate with the individuals of similar gender, ethnic group identity, family income, and grades. Analysis of homophily over the semester showed that students of the same gender were more likely to co-locate as the semester progressed. However, co-location of the students similar on ethnic minority identity, family income, and grades remained consistent throughout the semester. Mixed-effect regression models demonstrated that features derived from spatial-temporal networks, such as degree, the grade of the most frequently co-located peer, and average grade of five most frequently co-located peers were positively associated with academic performance. This study offers a unique exploration of the potential use of WiFi log data in understanding of student relationships integral to the quality of college experience.
Article
Scholarly understanding is limited with regard to what influences students' choice to take a particular course fully online or in-person. We surveyed 650 undergraduates at a public Canadian university who were enrolled in courses that were offered in both modalities during the same semester, for roughly the same tuition cost. The courses spanned a wide range of disciplines, from archaeology to computing science. Twenty-five variables were gauged, covering areas including students' personal circumstances, their competence in the language of instruction, previous experience with online courses, grade expectations, and psychological variables including their regulation of their time and study environment, work avoidance and social goal orientation. Two logistic regression models (of modality of enrolment and modality of preference) both had good fit to the data, each correctly classifying roughly 75% of cases using different variables. Implications for instructional design and enrolment management are discussed.
Article
Objectives: The purpose of this study was to investigate faculty and students’ reactions to the COVID-19 emergency move to online classes. The goal was to better inform instructional strategies to be used in similar circumstances and to inform best practices in online pedagogy. Method: Online surveys were administered to students and faculty near the end of the semester to evaluate different aspects of the transition. Classes included in the study were scheduled as full-semester, on-campus classes but made an emergency switch to online post-spring break, after eight weeks. Results: Students’ and faculty’s comfort levels at the time of the switch depended on the amount of prior experience they had in online teaching and learning. Individual students and faculty experienced varying degrees of ease of adjustment to the switch in format from in-class to online. Faculty had to adapt quickly to determine the best way to replicate the in-class experience. Many faculty would depend on familiarity with technology and creativity with its usage. To varying degrees, comfort level improved as the semester progressed for both faculty and students. Still, a majority of students expressed less interest than before in taking online classes. Conclusions: The level of preparedness of faculty and students determined the outcome of this natural experiment. The adjustment was easier for those with prior experience with the online format and/or for those who felt comfortable with the format. Implication for Practice: As faculty and students prepare to return to the classroom, consideration can be given to best practices in online pedagogy to support students and faculty. Our findings point to the need for institutional preparedness for unforeseen circumstances.
Article
In order to understand the impact of the COVID-19 pandemic on higher education, we surveyed approximately 1,500 students at one of the largest public institutions in the United States using an instrument designed to recover the causal impact of the pandemic on students’ current and expected outcomes. Results show large negative effects across many dimensions. Due to COVID-19: 13% of students have delayed graduation, 40% have lost a job, internship, or job offer, and 29% expect to earn less at age 35. Moreover, these effects have been highly heterogeneous. One quarter of students increased their study time by more than 4 hours per week due to COVID-19, while another quarter decreased their study time by more than 5 hours per week. This heterogeneity often followed existing socioeconomic divides; lower-income students are 55% more likely than their higher-income peers to have delayed graduation due to COVID-19. Finally, we show that the economic and health related shocks induced by COVID-19 vary systematically by socioeconomic factors and constitute key mediators in explaining the large (and heterogeneous) effects of the pandemic.
Conference Paper
Educational Data Mining (EDM) is an emerging inter-disciplinary research area that involves education and computer science. EDM employs data mining tools and techniques, on large datasets related to education, to extract meaningful and useful information. EDM works toward the improvement of educational processes by introducing better and effective learning practices. EDM methods refer to the set of methods that are used for building models/applications. This article presents an extensive literature survey of EDM methods. The article also discusses research trends and challenges in EDM. This insight into EDM attempts to provide useful and valuable information to researchers interested in furthering the field of EDM.
Article
The rapid development of online learning networks has resulted in the widespread use of recorded educational contents. While the community structure of those networks may have an influence on the use of contents, research on detecting online learning communities and investigating their structures using social network analysis (SNA) methods is scarce. The purpose of the research presented here is to investigate the structure of online learning networks and their users’ engagement patterns. In this study, Khan Academy, a widely used video learning repository, will be used as a case. Community detection algorithms are used to detect the development of online learning communities and network performance and effectiveness measures are applied to assess the network structure, effectiveness, and efficiency of a large dataset consisting of 359,163 users that interacted with Khan Academy's videos with over 3M questions and answers. The results demonstrate that different community detection algorithms can be implemented on learning networks and produce good learning communities which are not necessarily related to a domain or a topic. Measures such as density can be used to measure social presence while centrality measures are used to define central users and hubs in the communities. This study complements previous research that shed the light on the power and potential of SNA measures to structurally evaluate and detect online learning communities.
Book
Cambridge Core - Statistical Physics - A First Course in Network Science - by Filippo Menczer