Alessandro Galeazzi’s research while affiliated with Ca' Foscari University of Venice and other places

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (39)


Decoding Musical Evolution Through Network Science
  • Preprint
  • File available

January 2025

·

6 Reads

Niccolo' Di Marco

·

Edoardo Loru

·

Alessandro Galeazzi

·

[...]

·

Music has always been central to human culture, reflecting and shaping traditions, emotions, and societal changes. Technological advancements have transformed how music is created and consumed, influencing tastes and the music itself. In this study, we use Network Science to analyze musical complexity. Drawing on 20,000\approx20,000 MIDI files across six macro-genres spanning nearly four centuries, we represent each composition as a weighted directed network to study its structural properties. Our results show that Classical and Jazz compositions have higher complexity and melodic diversity than recently developed genres. However, a temporal analysis reveals a trend toward simplification, with even Classical and Jazz nearing the complexity levels of modern genres. This study highlights how digital tools and streaming platforms shape musical evolution, fostering new genres while driving homogenization and simplicity.

Download

Fig. S4 Boxplots of the observed number of retweets received by each influencer category and for each debate, compared with two different per-debate randomizations. For the 'Shuffled' randomization, we randomly permute the category label across all influencers; for the 'Uniform' randomization, we randomly sample the labels from a uniform distribution where each category has p = 1/6 of being sampled. To compare the observed distributions with the two randomizations, we conduct a two-sample two-sided Wilcoxon rank-sum test for each pair and report the corresponding P -values adjusted with the Holm-Bonferroni method (see Table S2 for a full breakdown of the test results).
Fig. S5 Distribution of the Gini Index of users' preference in retweeting influencer categories. The vertical lines indicate, from left to right, the 0.25, 0.5, and 0.75 quantiles of the distribution.
Who Sets the Agenda on Social Media? Ideology and Polarization in Online Debates

December 2024

·

18 Reads

The abundance of information on social media has reshaped public discussions, shifting attention to the mechanisms that drive online discourse. This study analyzes large-scale Twitter (now X) data from three global debates -- Climate Change, COVID-19, and the Russo-Ukrainian War -- to investigate the structural dynamics of engagement. Our findings reveal that discussions are not primarily shaped by specific categories of actors, such as media or activists, but by shared ideological alignment. Users consistently form polarized communities, where their ideological stance in one debate predicts their positions in others. This polarization transcends individual topics, reflecting a broader pattern of ideological divides. Furthermore, the influence of individual actors within these communities appears secondary to the reinforcing effects of selective exposure and shared narratives. Overall, our results underscore that ideological alignment, rather than actor prominence, plays a central role in structuring online discourse and shaping the spread of information in polarized environments.


Fig. 1: Centrality. Rescaled adjacency matrix showing the ratio between the observed and expected number of URLs pointing from one platform to another. Green (red) cells indicate values greater (smaller) than one. A value of 0 indicate no observed URLs.
Fig. 3: Cosine similarity network based on the platforms' 20 most linked domains. The size of the different nodes is proportional to the volume of links shared in the platform, while the colors of the pies indicate the fraction of questionable or reliable content shared. We observe two cliques with high similarity: one made up of mainstream platforms (Facebook, Twitter, Reddit) that share a majority of reliable news sources, and one made up of alt-tech ones (Gab, Parler, Voat) sharing a higher fraction of questionable sources. Scored, BitChute, and, to an extent, YouTube remain fairly separated from the rest of the platforms.
Fig. 4: Distributions of users' political leaning for each platform. Each unique user gets assigned a leaning score between −1 and +1, according to their posting activity. The bars are colored according to the number of "questionable" sources shared by users of a specific leaning. We notice how Facebook and Twitter have a polarized user base with two distinct groups, one sharing mostly reliable content and the other sharing mostly content with low factual reporting. The situation is more homogeneous regarding all of the fringe platforms, and the remaining two mainstream platforms.
Fig. 5: Adjacency matrix describing the number of links from and to each of the nine different platforms.
Timeframe and number of URLs collected (before and after processing) for each platform's data set.
Characterizing the Fragmentation of the Social Media Ecosystem

November 2024

·

43 Reads

The entertainment-driven dynamics of social media platforms encourage users to engage with like-minded individuals and consume content aligned with their beliefs. These dynamics may amplify polarization by reinforcing shared perspectives and reducing exposure to diverse viewpoints. Simultaneously, users migrate from one platform to another, either forced by moderation policies, such as de-platforming, or spontaneously seeking environments more aligned with their preferences. These migrations foster the specialization and differentiation of the social media ecosystem, with platforms increasingly organized around specific user communities and shared content preferences. This shift marks an evolution from echo chambers enclosed within platforms to "echo platforms", i.e., entire platforms functioning as ideologically homogeneous niches. This study introduces an operational framework to systematically analyze these dynamics, by examining three key dimensions: platform centrality (central vs. peripheral), news consumption (reliable vs questionable), and user base composition (uniform vs diverse). To this aim, we leverage a dataset of 126M URLs posted by nearly 6M users on nine social media platforms, namely Facebook, Reddit, Twitter (now X), YouTube, BitChute, Gab, Parler, Scored, and Voat. We find a clear separation between mainstream and alt-tech platforms, with the second category being characterized by a peripheral role in the social media ecosystem, a greater prevalence of unreliable content, and a heightened ideological uniformity. These findings outline the main dimensions defining the fragmentation and polarization of the social media ecosystem.


Fig. 1. Biases in the information chain.
Fig. 2. Estimated news outlets' propensity to report on positive events against adverse events (left) and distributions of distances and angles of the point from the balanced selection line (right). The 45-degree line represents the set of all points showing a balanced selection of news, i.e. equal propensity of reporting on positive and adverse events.
Fig. 3. News outlets' narrative bias in reporting positive events compared to their estimated stance in reporting adverse events, as estimated by the Latent Space Bayesian Model. Points are colored according to the classification retrieved from third-party data. The asymmetry in axis values is due to different framing strategies adopted when reporting events of different natures (positive or negative).
Fig. 4. Propensity vs narrative values for positive, adverse, and neutral events. Points represent news outlets' scores for narrative and propensity computed with the Latent Space Bayesian Model. Questionable outlets (top row) exhibit a moderate correlation (Pearson's coefficients: 0.460, −0.514, 0.429, P-value < 0.001) between propensity and narrative for all three types of events. In contrast, reliable outlets (bottom row) show weak correlations (Pearson's coefficients: 0.266, −0.274, −0.198, P-value < 0.001), suggesting that, for the latter, higher values of selection bias do not necessarily imply higher values of narrative bias.
Breakdown of the dataset.
Unveiling the hidden agenda: Biases in news reporting and consumption

October 2024

·

28 Reads

·

2 Citations

PNAS Nexus

Recognizing the presence and impact of news outlets’ biases on public discourse is a crucial challenge. Biased news significantly shapes how individuals perceive events, potentially jeopardizing public and individual well-being. In assessing news outlet reliability, the focus has predominantly centered on narrative bias, sidelining other biases such as selecting events favoring specific perspectives (selection bias). Leveraging machine learning techniques, we have compiled a six-year dataset of articles related to vaccines, categorizing them based on narrative and event types. Employing a Bayesian latent space model, we quantify both selection and narrative biases in news outlets. Results show third-party assessments align with narrative bias but struggle to identify selection bias accurately. Moreover, extreme and negative perspectives attract more attention, and consumption analysis unveils shared audiences among ideologically similar outlets, suggesting an echo chamber structure. Quantifying news outlets’ selection bias is crucial for ensuring a comprehensive representation of global events in online debates.


Figure 1: P-score distribution of original content. Colors represent the different categories of Tweets, highlighting the different contributions of each category depending on the p-score. The volume of content pointing to Twitter itself has little contribution to the overall distribution.
Revealing The Secret Power: How Algorithms Can Influence Content Visibility on Social Media

October 2024

·

127 Reads

Online social media platforms significantly influence public debates by shaping the information users encounter. Content visibility on these platforms is regulated by recommendation algorithms designed to maximize user engagement using individual-level data, including personal preferences and interactions. These algorithms play a crucial role in information dissemination, yet their inner workings are often undisclosed, raising concerns about potential manipulation of visibility. While algorithms may be intended to limit the spread of harmful content, they can also be exploited to suppress dissenting voices without users' awareness. The suspicion that platforms deliberately reduce the visibility of certain users or content - commonly known as shadow banning - has garnered significant public attention, with numerous figures advocating for greater transparency around this practice. In this study, we perform a quantitative study geared to identify suspicious changes in content visibility on Twitter (now known as X). We build and study a dataset of over 13 million tweets from more than 5 million users discussing the Ukraine conflict, including each tweet's number of views and metadata, aiming to detect reduced or inflated visibility patterns. We investigate how visibility correlates with factors such as authors' stance, role, interaction networks, and content narratives. Our findings reveal significant variations in visibility, likely driven by algorithmic interventions. These results highlight the need for greater transparency in regulating online information ecosystems to prevent algorithmic manipulation that could undermine public discourse and the fairness of debates.


Figure 2. Low factuality users tend to have higher tweet count than high factuality users. Panel a: Comparing the distributions of tweet count between low (orange curve) and high factuality (blue curve) users, we find that low factuality users have a significantly higher number of tweets compared to high factuality users (p-value < 0.0001). The median value for low factuality users (orange dotted line) is 2.99 tweets per day, whereas for high factuality users (blue dotted line) it is 1.78. Panel b: The MWU test score obtained from the empirical data (red dotted line) is higher than all the MWU test scores calculated on the 1,000 shuffled datasets (green bars).
Figure 4. Low factuality users tend to have lower number of days since registration than high factuality users. Panel a: Comparing the distributions of the number of days since registration between low (orange curve) and high factuality (blue curve) users, we find that low factuality users have a significantly lower number of days compared to high factuality users (p-value < 0.0001). The median value for low factuality users (orange dotted line) is 3134 days since registration, whereas for high factuality users (blue dotted line) it is 3493. Panel b: The MWU test score obtained from the empirical data (red dotted line) is lower than all the MWU test scores calculated on the 1,000 shuffled datasets (green bars).
Figure 5. Low factuality users tend to have higher followed account count than high factuality users. Comparing the distributions of followed account count between low (orange curve) and high factuality (blue curve) users, we find that low factuality users have a significantly higher number of followed accounts compared to high factuality users (p-value < 0.0001) in each of the tested datasets. The median value for low factuality users (orange dotted line) is higher than for high factuality users (blue dotted line) in each of the tested datasets. The peak around 5000 is due to an X policy that limits the number of new followed accounts until the user obtains more followers.
Easy-access online social media metrics can effectively identify misinformation sharing users

August 2024

·

23 Reads

Misinformation poses a significant challenge studied extensively by researchers, yet acquiring data to identify primary sharers is costly and challenging. To address this, we propose a low-barrier approach to differentiate social media users who are more likely to share misinformation from those who are less likely. Leveraging insights from previous studies, we demonstrate that easy-access online social network metrics -- average daily tweet count, and account age -- can be leveraged to help identify potential low factuality content spreaders on X (previously known as Twitter). We find that higher tweet frequency is positively associated with low factuality in shared content, while account age is negatively associated with it. We also find that some of the effects, namely the effect of the number of accounts followed and the number of tweets produced, differ depending on the number of followers a user has. Our findings show that relying on these easy-access social network metrics could serve as a low-barrier approach for initial identification of users who are more likely to spread misinformation, and therefore contribute to combating misinformation effectively on social media platforms.


Sampled Datasets Risk Substantial Bias in the Identification of Political Polarization on Social Media

June 2024

·

49 Reads

Following recent policy changes by X (Twitter) and other social media platforms, user interaction data has become increasingly difficult to access. These restrictions are impeding robust research pertaining to social and political phenomena online, which is critical due to the profound impact social media platforms may have on our societies. Here, we investigate the reliability of polarization measures obtained from different samples of social media data by studying the structural polarization of the Polish political debate on Twitter over a 24-hour period. First, we show that the political discussion on Twitter is only a small subset of the wider Twitter discussion. Second, we find that large samples can be representative of the whole political discussion on a platform, but small samples consistently fail to accurately reflect the true structure of polarization online. Finally, we demonstrate that keyword-based samples can be representative if keywords are selected with great care, but that poorly selected keywords can result in substantial political bias in the sampled data. Our findings demonstrate that it is not possible to measure polarization in a reliable way with small, sampled datasets, highlighting why the current lack of research data is so problematic, and providing insight into the practical implementation of the European Union's Digital Service Act which aims to improve researchers' access to social media data.


News and misinformation consumption: A temporal comparison across European countries

May 2024

·

62 Reads

·

2 Citations

The Internet and social media have transformed the information landscape, democratizing content access and production. While making information easily accessible, these platforms can also act as channels for spreading misinformation, posing crucial societal challenges. To address this, understanding news consumption patterns and unraveling the complexities of the online information environment are essential. Previous studies highlight polarization and misinformation in online discussions, but many focus on specific topics or contexts, often overlooking comprehensive cross-country and cross-topic analyses. However, the dynamics of debates, misinformation prevalence, and the efficacy of countermeasures are intrinsically tied to socio-cultural contexts. This work aims to bridge this gap by exploring information consumption patterns across four European countries over three years. Analyzing the Twitter activity of news outlets in France, Germany, Italy, and the UK, this study seeks to shed light on how topics of European significance resonate across these nations and the role played by misinformation sources. The results spotlight that while reliable sources predominantly shape the information landscape, unreliable content persists across all countries and topics. Though most users favor trustworthy sources, a small percentage predominantly consumes content from questionable sources, with even fewer maintaining a mixed information diet. The cross-country comparison unravels disparities in audience overlap among news sources, the prevalence of misinformation, and the proportion of users relying on questionable sources. Such distinctions surface not only across countries but also within various topics. These insights underscore the pressing need for tailored studies, crucial in designing targeted and effective countermeasures against misinformation and extreme polarization in the digital space.


Cumulative number of unique posts about ChatGPT discussion across various platforms (a) and distribution of interaction volume versus the number of posts on different platforms (b). The nature of interactions varies among platforms; For instance, on Twitter, interactions are the sum of likes, quotes, retweets and replies, while on Instagram and YouTube, interactions are the sum of likes and comments.
(a) Proportion of comments for each topic by platform. The cell color intensity corresponds to the proportion of comments discussing a given topic; a higher percentage results in a darker hue. (b) Box plots distributions of the sentiment tone across topics. On the x-axis, sentiment tones are represented as values. A negative value indicates a negative sentiment, while a positive value suggests the opposite. The further away from zero the value is, the stronger the sentiment. The vertical red dashed line at the 0 mark, differentiates positive tones from negative tones. Black diamonds inside the boxes indicate the average sentiment tone for each topic.
Cumulative number of unique users with logistic fits by platform. Each plot shows the cumulative count of unique users engaged in ChatGPT and COVID-19 vaccination-related topics over time. The fitted curve corresponds to a logistic function used to model the diffusion of unique users.
Cross-platform social dynamics: an analysis of ChatGPT and COVID-19 vaccine conversations

February 2024

·

62 Reads

·

9 Citations

The role of social media in information dissemination and agenda-setting has significantly expanded in recent years. By offering real-time interactions, online platforms have become invaluable tools for studying societal responses to significant events as they unfold. However, online reactions to external developments are influenced by various factors, including the nature of the event and the online environment. This study examines the dynamics of public discourse on digital platforms to shed light on this issue. We analyzed over 12 million posts and news articles related to two significant events: the release of ChatGPT in 2022 and the global discussions about COVID-19 vaccines in 2021. Data was collected from multiple platforms, including Twitter, Facebook, Instagram, Reddit, YouTube, and GDELT. We employed topic modeling techniques to uncover the distinct thematic emphases on each platform, which reflect their specific features and target audiences. Additionally, sentiment analysis revealed various public perceptions regarding the topics studied. Lastly, we compared the evolution of engagement across platforms, unveiling unique patterns for the same topic. Notably, discussions about COVID-19 vaccines spread more rapidly due to the immediacy of the subject, while discussions about ChatGPT, despite its technological importance, propagated more gradually.


Leading topics in hurricane related news articles, and key news terminology by reliability of the news sources
Climate change coverage is among the most covered topics in news articles about hurricanes. (a) The most prominent topics in the news dataset. Dark green bars correspond to hurricane specific topics, light green topics are not specific to an individual hurricane. (b) The terminology used by reliable (green) and unreliable (magenta) media outlets in the news articles which fall under the “climate change” topic. Words are ranked in descending order by the relative frequency within the two sets. The score shift indicates whether the term is disproportionately used by reliable (left) or unreliable (right) news outlets.
The impact of hurricanes on Twitter attention towards climate change in affected and unaffected regions, relative to a random baseline
We show the percentage change in the number of tweets after a hurricane impacts, with respect to the average number of tweets in the 30 days before the hurricane. We compare the in location (pink line) and the out of location curve (blue line) with respect to the random baseline (orange line). The shaded region around each curve is the standard deviations of the mean across all hurricanes.
Number of tweets in location and out of location aggregated over one month before and after the impact of each hurricane
The Damage column lists the cost of damages caused by each hurricane in US Dollars [52].
How does extreme weather impact the climate change discourse? Insights from the Twitter discussion on hurricanes

November 2023

·

81 Reads

·

5 Citations

The public understanding of climate change plays a critical role in translating climate science into climate action. In the public discourse, climate impacts are often discussed in the context of extreme weather events. Here, we analyse 65 million Twitter posts and 240 thousand news media articles related to 18 major hurricanes from 2010 to 2022 to clarify how hurricanes impact the public discussion around climate change. First, we analyse news content and show that climate change is the most prominent non hurricane-specific topic discussed by the news media in relation to hurricanes. Second, we perform a comparative analysis between reliable and questionable news media outlets, finding that unreliable outlets frequently refer to climate-related conspiracies and preferentially use the term “global warming” over “climate change”. Finally, using geolocated data, we show that accounts in regions affected by hurricanes discuss climate change at a significantly higher rate than accounts in unaffected areas, with references to climate change increasing by, on average, 80% after impact, and up to 200% for the largest hurricanes. Our findings demonstrate how hurricanes have a key impact on the public awareness of climate change.


Citations (19)


... News consumption has become a shared social experience, with individuals exchanging links and recommendations within their networks and treating news as a form of cultural currency [40]. As a result, traditional hierarchical models of media influence struggle to explain the dynamics of public discourse in this decentralized environment [41][42][43], where narratives and participants compete for prominence. ...

Reference:

Who Sets the Agenda on Social Media? Ideology and Polarization in Online Debates
Unveiling the hidden agenda: Biases in news reporting and consumption

PNAS Nexus

... For example, studies on the Brazilian elections have documented the strategic use of bots and coordinated networks to shape public opinion and influence voter perceptions, highlighting the risks of computational propaganda in electoral processes [56]. Similarly, research on European elections has illustrated how misinformation campaigns spread across social media can alter public perceptions, as seen in countries such as France, Germany, and Italy [7,18,35,53]. In the United States, coordinated online activities around elections have garnered significant attention, with investigations revealing how coordinated bot activities and foreign information operations have sought to manipulate voter beliefs and amplify divisive content across multiple election cycles [8,28,39,64]. ...

News and misinformation consumption: A temporal comparison across European countries

... The Facebook posts collection was facilitated through CrowdTangle, a digital platform owned by Facebook and designed for monitoring social media content (CrowdTangle Team, 2023). Recently, the CrowdTangle database of social media posts has been widely utilized by scholars across various fields, such as public health (Harper and Attwell, 2022;Alipour et al., 2024), politics (Giglietto et al., 2020;Ngo et al., 2022), communication (Angus et al., 2023), and finance (Ferretti and Sciandra, 2022;Ngo et al., 2023a;Bai and Lee, 2024) to investigate public sentiment and reactions to global or market events (e.g., the COVID-19 pandemic, elections). ...

Cross-platform social dynamics: an analysis of ChatGPT and COVID-19 vaccine conversations

... Previous research has focused on the origins of this polarization [67], its relation to political affiliations [68], and the influential roles played by media outlets [69] and corporations [70]. Recently, an increasing polarization was reported in locations suffering from extreme weather events [71]. ...

How does extreme weather impact the climate change discourse? Insights from the Twitter discussion on hurricanes

... To this end, the World Health Organization's Early Artificial Intelligence-Supported Response With Social Listening Platform (EARS, White et al. (2023)) used semi-supervised machine learning for classifying social media content into topics, which offered real-time analytics to public health researchers during the COVID-19 pandemic. Other works have used unsupervised methods, in particular, topic modeling (Blei, 2012), which represents topics as word distributions through generative probabilistic modeling (e.g., Rowe et al. (2021)), and text clustering (Willett, 1988), which represents topics as groups of semantically similar texts (e.g., Santoro et al. (2023)). In addition, sentiment analysis (Liu, 2012) can improve the understanding of the public perception of healthrelated topics by classifying sentiments expressed in texts (Boender et al., 2023;Briand et al., 2023). ...

Analyzing the changing landscape of the Covid-19 vaccine debate on Twitter

Social Network Analysis and Mining

... The information reflects how consumers interact with content. Comments and retweets show a more active and engaged audience, whereas likes and shares show passive acknowledgement and endorsement supported by this study (Baqir et al., 2023). The analysis demonstrates how language, engagement and user behaviour interact in complicated ways on social media. ...

Beyond Active Engagement: The Significance of Lurkers in a Polarized Twitter Debate

... Twitter is a popular venue for political discussions [4,5], such as referendums [6][7][8][9]. Ireland has a reasonably lengthy history of dealing with referendums [10], and two of them-the Irish Same-sex Marriage and the Irish Abortion Referendums-have received a lot of attention in social media, where millions of tweets were shared, and many public figures expressed their opinions [11,12]. ...

Political polarization of news media and influencers on Twitter in the 2016 and 2020 US presidential elections

Nature Human Behaviour

... The role of social media in political polarization has become a subject of intense debate, characterized by opposing views that, ironically, reflect polarization within the academic and public discourse itself. Some researchers argue that social media platforms amplify divisions by fragmentation [7][8][9], echo chambers [10] and, eventually, echo platforms [11,12]. Others contend that these platforms provide opportunities for democratic engagement and cross-partisan dialogue [13], fostering interactions that might not occur in offline settings [14]. ...

Growing polarization around climate change on social media

Nature Climate Change

... The role of social media in political polarization has become a subject of intense debate, characterized by opposing views that, ironically, reflect polarization within the academic and public discourse itself. Some researchers argue that social media platforms amplify divisions by fragmentation [7][8][9], echo chambers [10] and, eventually, echo platforms [11,12]. Others contend that these platforms provide opportunities for democratic engagement and cross-partisan dialogue [13], fostering interactions that might not occur in offline settings [14]. ...

Conspiracy theories and social media platforms
  • Citing Article
  • June 2022

... It has provided sustained connection and access to information [7]. However, it has also been characterized by an overburden of information, accurate and inaccurate, which has made it difficult for people to find trustworthy sources and reliable guidance [8,9]. False reports about COVID-19 vaccines may undermine public confidence in vaccination [7]. ...

COVID-19 infodemic on Facebook and containment measures in Italy, United Kingdom and New Zealand