Conference PaperPDF Available

Sentiment Analysis of Media in German on the Refugee Crisis in Europe

Authors:
  • HENSOLDT Analytics

Abstract and Figures

Since the summer of 2015, the refugee crisis in Europe has grown to be one of the biggest challenges Europe has faced since WW2. The development of this humanitarian crisis are the topic of discussions throughout Europe and covered by media on a daily basis. Germany in particular has been the focus of migration. Over time, in Germany and the neighboring German speaking countries a shift could be observed, from the initial hospitable Willkommenskultur (welcome culture), to more reserved and skeptical points of view. These factors - Germany as the prime-destination for migrants, as well as a shift in public perception and media coverage - are the motivation for our analysis. The current article investigates the coverage of this crisis on traditional and social media, employing sentiment analysis to detect tendencies and relates these to real-world events. To this end, sentiment analysis was applied to textual documents of a data-set collected from relevant and highly circulated German, Austrian and Swiss traditional media sources and from social media in the course of six months from October 2015 to March of 2016.
Content may be subject to copyright.
Sentiment Analysis of Media in German on the
Refugee Crisis in Europe
Gerhard Backfried and Gayane Shalunts
SAIL LABS Technology GmbH
Vienna, Austria
{gerhard.backfried, gayane.shalunts}@sail-labs.com
Abstract. Since the summer of 2015, the refugee crisis in Europe has
grown to be one of the biggest challenges Europe has faced since WW2.
The development of this humanitarian crisis are the topic of discussions
throughout Europe and covered by media on a daily basis. Germany
in particular has been the focus of migration. Over time, in Germany
and the neighboring German speaking countries a shift could be ob-
served, from the initial hospitable Willkommenskultur (welcome culture),
to more reserved and skeptical points of view. These factors - Germany as
the prime-destination for migrants, as well as a shift in public perception
and media coverage - are the motivation for our analysis. The current
article investigates the coverage of this crisis on traditional and social
media, employing sentiment analysis to detect tendencies and relates
these to real-world events. To this end, sentiment analysis was applied
to textual documents of a data-set collected from relevant and highly cir-
culated German, Austrian and Swiss traditional media sources and from
social media in the course of six months from October 2015 to March of
2016.
Key words: Sentiment analysis, media analysis, refugee crisis
1 Introduction
Sentiment Analysis (SA) tackles the problem of determining the objectivity or
polarity of the input. The main parameters defining the scope of an SA method
are the target language, domain and media type (traditional or social media).
The most common application is the monitoring of public opinions in marketing
(product reviews) and politics (election campaigns). Whereas the research field
is active, most publications are limited to the domains of movie and product
reviews in English. SA-approaches can be divided into two broad categories:
machine learning and lexicon-based ones. Machine learning methods are im-
plemented as supervised binary (positive/negative) classification approaches, in
which classifiers are trained on labeled data [1, 2]. The dependency on a labeled
dataset is considered a major drawback, as labeling is usually costly and im-
possible in some cases. In contrast, lexicon-based methods [3] use a predefined
set of patterns (sentiment dictionary or lexicon) associating each entry with a
specific sentiment and score and do not require any labeled training data. Here
2 Gerhard Backfried and Gayane Shalunts
the challenge lies in designing an appropriate lexicon for the target domain. A
comparison of eight state-of-the-art SA methods (SentiWordNet [4], SASA [5],
PANAS-t [6], Emoticons, SentiStrength [7], LIWC [8], SenticNet [9] and Happi-
ness Index [10]) is performed in [1]. All experiments are carried out using two
English datasets of Online Social Networks messages. The authors report that
the examined methods have different levels of applicability on real-world events
and vary widely in their agreement on the predicted polarity. The authors in [11]
also limit their work to English, but target the domain of news. SA is applied
in the context of the refugee crisis to tweets in English by [12]. In general, the
number of SA approaches for languages other than English is limited. Sentimen-
tWS [13] and [14] analyze textual data in German. In the present paper, the
state-of-the-art SA tool SentiSAIL [15] is employed as it supports the process-
ing of content in German and has been adapted to the domain of news articles,
specifically to news on disasters and crises [16].
The analysis of sources in German is motivated by the fact that Germany
and Austria are affected by the refugee crises to a great extent. Swiss sources in
German were also included due to their proximity, even though the situation in
Switzerland is different. The period of time covered by this paper corresponds to
an important period, when the initial, enthusiastic welcome-culture was slowly
fading and being replaced by more concerned opinions as to whether the affected
countries would be able to cope with the massive influx of refugees. The sources
covered reflect traditional and social media and include the leading news outlets
of the three countries, as well as a variety of accounts from Twitter and Facebook
(only publicly available information was processed!).
The current article makes the following contributions: (i) presents an auto-
matically compiled corpus of texts from traditional and social media in German,
covering the refugee crisis over a period of six months from October 2015 to
March 2016, (ii) investigates the temporal development of sentiment across the
different sources and types of media, (iii) identifies the most prominent sources
and differences in their behavior across the period. The remainder of the paper is
organized as follows: Section 2 clarifies the methodology of SentiSAIL. Section 3
presents the corpora, empirical setup and findings. Section 4 concludes the work
and proposes alleys for further research.
2 SentiSAIL Lexicon-based Approach
SentiSAIL is a multilingual SA tool addressing the domain of general news and
particularly the coverage of disasters/crises [17, 15]. It is based on one of the
state-of-the-art SA methods, SentiStrength [7] and integrated into the SAIL
LABS Media Mining System (MMS) for Open Source Intelligence (OSINT) [18].
SentiSAIL addresses content from traditional and social media in a variety of lan-
guages (English, German, Russian, Spanish, French and Arabic). Performance
of SentiSAIL on a trilingual traditional media corpus is reported in [15]. Sen-
tiSAIL was also used to analyze social media data in German concerning the
European floods 2013 [17]. Like [3], it employs a lexicon-based approach, us-
ing lexicons of words associated with scores of positive or negative orientation.
Sentiment Analysis of Media in German on the Refugee Crisis in Europe 3
Features such as stemming, boosting (intensification or weakening), negation as
well as the scoring of phrases and idioms aim to model the structure and se-
mantics of the language. Whereas SentiStrength is optimized for and evaluated
on social media content, SentiSAIL targets both social and traditional media
data. Social media features are parameterized and may be disabled during tra-
ditional media processing. SentiStrength and SentiSAIL features are compared
in [15] on a proprietary traditional media corpus, reporting SentiSAIL’s perfor-
mance improvement to be moderate for English and considerable for German
and Russian. SentiSAIL, like [19], solves a dual classification task by assigning a
text into one of the following 4 classes: positive,negative,mixed (both positive
and negative) or neutral (neither positive, nor negative). The dual classification
scheme is motivated by the ability of humans to experience positive and negative
emotions simultaneously [20]. The classification of input text is performed in a
3-step process as outlined in [15].
3 Experimental Setup and Results
The corpus of documents in German covering the humanitarian crisis of refugees
in Europe was compiled using the SAIL LABS MMS, a system for the collection
and processing of data from open sources [18]. It spans documents from tradi-
tional media (Web-Feeds and -pages) and from social media (Twitter, Facebook)
covering the period from October 2015 to March 2016. Documents from tradi-
tional media comprise 48733 articles from 68 of the most circulated traditional
media sources in Germany, Austria and Switzerland. The social media corpus
contains 16593 tweets, posts and comments from Twitter and Facebook from a
total of 5996 accounts. All documents were selected by using keywords based
on the German word Fl¨uchtling (refugee). The same words were subsequently
excluded from SA to avoid negative bias.
Fig. 1 and Fig. 2 present a break-down of the percentages of documents
of three sentiment classes for traditional and social media. All texts pertain-
ing to the mixed class are considered as half-positive and half-negative for all
evaluations and visualizations. They clearly display the higher percentage of
sentiment-laden content on social media, where the neutral class only accounts
for 21% compared to 67% on the traditional media. Fig. 3 displays the percentage
of positive, negative and neutral articles compared to the overall volume of tra-
ditional media articles per day. Over time, a slight upward trend can be observed
for both, the negative and positive classes, with a more pronounced rise of nega-
tive documents possibly indicating more polarized reporting. The percentage of
neutral documents decreases accordingly. Fig. 4 is the equivalent chart of Fig. 3
for social media. A slight rise in negative posts can be observed over time, while
positive posts decline and neutral ones stay on approximately the same level.
The percentages of positive and negative posts are both constantly higher than
on traditional media, confirming that sentiment is generally expressed more ac-
tively on social media (a comments section of traditional news typically behaves
like social media in this respect). Overall, the dominating temporal sentiment
in traditional media is neutral whereas social media are dominated by negative
4 Gerhard Backfried and Gayane Shalunts
Fig. 1. Sentiment distribution in tra-
ditional media
Fig. 2. Sentiment distribution in social
media
sentiment. This trend persists throughout the observed period and may be ex-
plained by the tendency of traditional media to provide rather unbiased coverage,
whereas content on social media tends to be sentiment-laden. This tendency is
in line with the findings of [21], who report on social media reactions to news on
traditional media.
It is difficult to precisely relate all positive and negative peaks in sentiment for
traditional and social media to real-world events; however, the following events
may be related to those peaks 1.
– October 29 2015: (negative) Pegida 2demonstrations attacking Germany
politicians Angela Merkel and Sigmar Gabriel.
November 21 2015: (negative) cancelation of the soccer-match between Ger-
many and France in Hannover
– December 15 2015 (negative) left-extremist demonstrations and clashes be-
tween demonstrators and police in Leipzig
– January 1 2016: (negative) sexual assaults by migrants during New Year’s
celebrations in Cologne
February 7 2016: (negative) Pegida Aktionstag (day of action)
March 22 2016: (negative) terror attacks at Brussels Airport
Of the above events, the sexual assaults committed during the New Year’s
celebrations in Cologne likely had the greatest impact on media coverage and
also resulted in legal action by the German state. However, several other key
events which happened during the period - e.g. Austria’s introduction of upper-
limits (Feb 19), the effective closing of the Balkan route (March 10) or a summit
with Turkey (March 18) - do not seem to have left direct traces on sentiment.
Table 1 displays an overview of the five most active (most articles) sources
per country 3. The ratio of negative to positive articles is most pronounced for
1Information about these events has been taken from http://zeitstrahl-
߬uchtlingskrise.org, providing excellent coverage and history of events concerning
the refugee crisis, accessed on 2016/06/08
2Pegida: Patriotische Europ¨aer gegen Islamisierung des Abendlandes (Patriotic Eu-
ropeans Against the Islamisation of the West), www.pegida.de
3Germany: Passauer Neue Presse, Frankfurter Allgemeine, Focus, Welt, Spiegel; Aus-
tria: Der Standard, Kleine Zeitung, Salzburger Nachrichten, Die Presse, Wiener
Sentiment Analysis of Media in German on the Refugee Crisis in Europe 5
Fig. 3. Relative sentiment of traditional media
Fig. 4. Relative sentiment social media
Germany with Swiss and Austrian newspapers exhibiting a much lower ratio.
The percentage of neutral articles is similar for all three countries, indicating
that objectiveness is approximately the same for the most active sources. The
percentage of positive articles is approximately the same for the three countries;
the percentage of negative articles slightly higher for Germany. Passau, at the
border of Germany and Austria became a hot-spot for migrants crossing into
Germany, which may explain the unusually high number of articles produced
by the Passauer Neue Presse. Based in the Austrian province bordering Ger-
many, the Ober¨osterreiche Nachrichten is the Austrian paper with the highest
percentage of positive articles. The General-Anzeiger Bonn and 20 Minuten are
Zeitung; Switzerland: Neue Z¨urcher Zeitung, Aargauer Zeitung, Tagesanzeiger,
Basler Zeitung, 20 Minuten
Germany Austria Switzerland
Number of articles top-5 sources 13280 7737 3966
Avg ratio negative/positive 120.08 24.31 58.29
Avg % neutral 65% 69% 70%
Avg % negative 34% 30% 29%
Avg % positive 1% 2% 1%
Table 1. Statistics for top-5 active sources per country
6 Gerhard Backfried and Gayane Shalunts
the most positive papers for Germany and Switzerland respectively. The three
papers with the highest circulation - Bild (Germany), Kronenzeitung (Austria)
and Tagesanzeiger (Switzerland) - dominate news distribution and are known
to have a large impact on public opinion. Bild and Kronenzeitung exhibit a
slightly more positive tendency than the top-5 most active papers in the respec-
tive countries, whereas Tagesanzeiger is slightly more negative than the average
Swiss papers.
4 Conclusion and Future Work
The paper presented the results of applying SA to a corpus of textual data cov-
ering the European refugee crisis during the period of October 2015 to March
2016. The distribution of sentiment in traditional media was substantially dif-
ferent from that of social media, with more neutral content being published in
traditional media. Both types of media show a tendency for negative content
to increase over time. On social media, a decline of positive posts could be ob-
served. These changes may be related to the general shift of attitudes towards
refugees over the observed period. The percentages of positive, negative and
neutral sentiment for the five most active news sources in Germany, Austria and
Switzerland are similar, with German sources exhibiting slightly more negative
articles. Several real-world events could be connected to the local maxima in sen-
timent values. Other notable events were not directly reflected in the sentiment
of articles and posts. Future work will include the analysis of actors and voices
on social media to gain further insights on how these are linked to traditional
media and on differences detected between actors from the different countries.
References
1. Gon¸calves, P., Ara´ujo, M., Benevenuto, F., Cha, M.: Comparing and Combining
Sentiment Analysis Methods. In: Proc. of the 1st ACM Conference on Online
Social Networks (COSN 2013), Boston, USA, ACM (2013) 27–38
2. Pang, B., Lee, L., Vaithyanathan, S.: Thumbs up?: Sentiment Classification Us-
ing Machine Learning Techniques. In: Proc. of the ACL conference on Empirical
Methods in Natural Language Processing (EMNLP ’02), Philadelphia, PA, USA
(2002) 79–86
3. Taboada, M., Brooke, J., Tofiloski, M., Voll, K., Stede, M.: Lexicon-based Methods
for Sentiment Analysis. Computational Linguistics 37(2) (2011) 267–307
4. Esuli, A., Sebastiani, F.: SENTIWORDNET: A Publicly Available Lexical Re-
source for Opinion Mining. In: Proc. of the 5th Conference on Language Resources
and Evaluation (LREC06). (2006) 417–422
5. Wang, H., Can, D., Kazemzadeh, A., Bar, F., Narayanan, S.: A System for Real-
time Twitter Sentiment Analysis of 2012 U.S. Presidential Election Cycle. In: ACL
(System Demonstrations). (2012) 115–120
6. Gon¸calves, P., Benevenuto, F., Cha, M.: PANAS-t: A Pychometric Scale for Mea-
suring Sentiments on Twitter. CoRR abs/1308.1857 (2013)
7. Thelwall, M., Buckley, K., Paltoglou, G., Cai, D., Kappas, A.: Sentiment Strength
Detection in Short Informal Text. J. American Society for Information Science
and Technology 61(12) (2010) 2544–2558
Sentiment Analysis of Media in German on the Refugee Crisis in Europe 7
8. Tausczik, Y.R., Pennebaker, J.W.: The Psychological Meaning of Words: LIWC
and Computerized Text Analysis Methods. Journal of Language and Social Psy-
chology 29(1) (2010) 25–54
9. Cambria, E., Speer, R., Havasi, C., Hussain, A.: SenticNet: A Publicly Available
Semantic Resource for Opinion Mining. In: AAAI Fall Symposium: Commonsense
Knowledge. (2010) 14–18
10. Dodds, P.S., Danforth, C.M.: Measuring the happiness of large-scale written ex-
pression: songs, blogs, and presidents. Journal of Happiness Studies 11(4) (2009)
441–456
11. Balahur, A., Steinberger, R., Kabadjov, M., Zavarella, V., van der Goot, E., Halkia,
M., Pouliquen, B., Belyaeva, J.: Sentiment Analysis in the News. In: Proc. of the
7th International Conference on Language Resources and Evaluation (LREC’10),
Valletta, Malta, ELRA (2010)
12. Coletto, M., Esuli, A., Lucchese, C., Muntean, C.I., Nardini, F.M., Perego, R.,
Renso, C.: Sentiment-enhanced multidimensional analysis of online social networks:
Perception of the mediterranean refugees crisis. In: Workshop on Social Network
Analysis Surveillance Technologies (SNAST 16), San Francisco, USA (2016)
13. Remus, R., Quasthoff, U., Heyer, G.: SentiWS - a German-language Resource for
Sentiment Analysis. In: Proc. of the 7th International Conference on Language
Resources and Evaluation (LREC), Valletta, Malta (2010) 1168–1171
14. Momtazi, S.: Fine-grained German Sentiment Analysis on Social Media. In:
Proc. of the 8th International Conference on Language Resources and Evaluation
(LREC’12), Istanbul, Turkey, ELRA (2012) 1215–1220
15. Shalunts, G., Backfried, G.: SentiSAIL: Sentiment Analysis in English, German
and Russian. In: Proc. of the 11th International Conference on Machine Learning
and Data Mining. MLDM ’15, Hamburg, Germany (2015) 87–97
16. Backfried, G., G¨ollner, J., Quirchmayr, G., Rainer, K., Kienast, G., G.Thallinger,
Schmidt, C., Peer, A.: Integration of Media Sources for Situation Analysis in the
Different Phases of Disaster Management: The QuOIMA Project. In: Proc. of
European Intelligence and Security Informatics Conference (EISIC ’13), Uppsala,
Sweden (2013) 143–146
17. Shalunts, G., Backfried, G., Prinz, K.: Sentiment Analysis of German Social Media
Data for Natural Disasters. In: Proc. of the 11th International Conference on
Information Systems for Crisis Response and Management (ISCRAM), University
Park, Pennsylvania, USA (2014) 752–756
18. Backfried, G., Schmidt, C., Pfeiffer, M., Quirchmayr, G., Glanzer, M., Rainer,
K.: Open source intelligence in disaster management. In: Proc. of the European
Intelligence and Security Informatics Conference (EISIC), Odense, Denmark, IEEE
(2012) 254–258
19. Wilson, T., Wiebe, J., Hoffmann, P.: Recognizing Contextual Polarity: An Explo-
ration of Features for Phrase-Level Sentiment Analysis. Computational Linguistics
(2009) 399–433
20. Norman, G.J., Norris, C.J., Gollan, J., Ito, T.A., Hawkley, L.C., Larsen, J.T.,
Cacioppo, J.T., Berntson, G.G.: Current Emotion Research in Psychophysiology:
The Neurobiology of Evaluative Bivalence. Emotion Review 3(3) (2011) 349–359
21. Tan, C., Friggeri, A., Adamic, L.A.: Lost in Propagation? Unfolding News Cycles
from the Source. In: Proc. of the 10th International AAAI Conference Web and
Social Media (ICWSM 2016), Cologne, Germany (2016)
... A small number of machine learning tools, frameworks and libraries were also identified in the included research. Some of thesse include Apache OpenNLP [7], Tensor-Flow [6], SentiSAIL [12], CoreNLP [117] and NLTK [16,94,139,176,189]. The Natural Language Toolkit (NLTK) [18], was fairly prevalent with about 5 included research articles using this tool. ...
... Many of the researchers have elected to collect data manually, or through application programming interfaces such as the Twitter API [138,190,196], or by downloading preexisting data sets, such as UNSW-NB15 [103] or AmritaDGA [264]. There are some examples of OSINT tools being used in the included research in combination with AI, such as Tor Crawler [108], Shodan [267], ReKognition boto3 [251], SAIL LABS Media Mining System [12] and the Tweepy API [94]. In [136] Maltego is used in the prototype Twitter cyberbullying detection system to graph relationships between the various accounts in the collected data. ...
Article
Full-text available
This paper presents a systematic review to identify research combining artificial intelligence (AI) algorithms with Open source intelligence (OSINT) applications and practices. Currently, there is a lack of compilation of these approaches in the research domain and similar systematic reviews do not include research that post dates the year 2019. This systematic review attempts to fill this gap by identifying recent research. The review used the preferred reporting items for systematic reviews and meta-analyses and identified 163 research articles focusing on OSINT applications leveraging AI algorithms. This systematic review outlines several research questions concerning meta-analysis of the included research and seeks to identify research limitations and future directions in this area. The review identifies that research gaps exist in the following areas: Incorporation of pre-existing OSINT tools with AI, the creation of AI-based OSINT models that apply to penetration testing, underutilisation of alternate data sources and the incorporation of dissemination functionality. The review additionally identifies future research directions in AI-based OSINT research in the following areas: Multi-lingual support, incorporation of additional data sources, improved model robustness against data poisoning, integration with live applications, real-world use, the addition of alert generation for dissemination purposes and incorporation of algorithms for use in planning.
... Many studies have documented that real-world events surrounding the refugee crisis significantly impact public perceptions and sentiments, suggesting that the sentiment patterns and event occurrences are inextricably linked (Backfried and Shalunts 2016). Öztürk and Ayvaz (2018) revealed that opinions expressed in Turkish tweets shifted over time due to external events, such as the Syrian government's chemical gas assault and the US airstrike in response. ...
Article
Full-text available
The Rohingya crisis has been a significant issue for national and international news media, capturing their attention for an extended period and documenting various phases of the crisis. Previous research exploring the tones and portrayal of the Rohingyas in the news lacks comparative and temporal analysis of news sentiments. In this study, we aim to fill this gap by analyzing 8,074 news stories on Rohingya issues published in 10 news media outlets from 10 countries between 2009 and 2023. Our computational sentiment analysis reveals that Rohingya-related news sentiments are predominantly negative and fluctuate over the years across different countries, showing little identifiable patterns. An ANOVA suggests significant variation in news sentiments among countries, with some countries exhibiting more similar sentiments than others, thus creating distinguishable groups. Some of our findings contradict previous scholarships, warranting further research and novel frameworks. Additionally, we encourage scrutiny of academic insights to address potential biases against news media’s journalistic integrity.
... Sentiment analysis is a dictionary-based text-mining technique for extracting opinions (Lei & Liu, 2021), which has been used to grasp the public sentiment towards refugees (Backfried & Shalunts, 2016;Öztürk & Ayvaz, 2018) or migration (Heidenreich et al., 2020). This chapter uses sentiment analysis to identify tendencies in the media's partisan leanings and to look at the portrayals related to them. ...
Chapter
Full-text available
This chapter examines the influences of the Taiwanese media’s political leaning on their coverage of Hong Kong migration by analysing news articles from the five major pro-Blue/Green printed media outlets between 2014 and 2022. Using structural topic modelling, this chapter demonstrates a politicalised trend relating to Hong Kong migration, involving two themes: migration safety and Political Security. Even though the migration safety theme has gained much more attention than the Political Security theme since the 2019 Protests in Hong Kong, the polarisation of the depiction of Hong Kong migration becomes particularly evident when issues related to the People's Republic of China (PRC) are factored in through topic comparison. In other words, compared to the pro-Blue media, which is inclined to portray the Hong Kong migration from a human rights perspective, the pro-Green media, which favours a nativist viewpoint, offer a more security-based narrative, such as cultural Sinicisation and the possibility of Chinese spies entering through Hong Kong migration. The results of sentiment analysis further highlight both media camps’ differences by detecting a positive or negative tendency.
... Finally, numerous studies have used sentiment analysis to address the representation of migrants and refugees, both in the mass media (Backfried & Shalunts, 2016) and on social networks (Heidenreich et al., 2020). The study of feelings associated with the discourse around migrants and refugees -the connection of which to hate speech has already been identified by Arcila-Calderón et al. (2021) matter. ...
Article
Full-text available
Hate speech directed towards migrants and refugees poses one of the greatest challenges in communication on social networks. By means of an automated analysis of 124,337 messages about migration collected on Twitter in Spain between 2015 and 2020, the presence of hate and the underlying sentiment in the discourse are analysed, as is their development and the possible differences between Spanish regions. It has been observed that, although greater attention was paid to migration in 2015 and 2016, the core years of the Mediterranean refugee crisis, the greatest volume of hate was detected in 2019 and 2020, following the rise of Vox and its anti-immigration rhetoric. In general, the sentiment in these messages was negative, although the difference is slight between those expressing hate and those which do not. Finally, differences have been observed between Spanish regions, with Asturias having the greatest presence of hate and Cantabria showing the most negative sentiment; such differences, however, are not great and no clear patterns have been detected to explain them.
... La cobertura de los medios de comunicación sobre procesos de migración puede llegar a tener serias consecuencias sobre las actitudes hacia con las personas migrantes o refugiadas (Quinsaat, 2014;Robin, 2004). Según Backfried & Shalunts (2016), por lo general, los medios de comunicación, generan mensajes sobre migración más neutros que el contenido producido y difundido en las redes sociales por usuarios comunes (Backfried & Shalunts 2016, pp. 239), esto claro en el contexto de los medios en Alemania y en el periodo de la crisis de refugiados en Europa en 2015. ...
Thesis
Full-text available
Los flujos migratorios en Centroamérica, el Caribe y México vieron un aumento durante el 2018. El fenómeno de las llamadas caravanas migrantes tomó relevancia en las agendas mediáticas, sociales y políticas a nivel nacional en México y en diferentes países del continente americano. Esta tesis aborda la discusión del hashtag #CaravanaMigrante durante cuatro de los episodios más presentes en la agenda de los medios nacionales en México, ocurridos entre octubre y noviembre del 2018. Las unidades de análisis extraídas desde Twitter se analizan desde la Teoría de la Identidad Social y pone especial énfasis en el concepto de la gestión de la amenaza a la identidad social en redes sociales digitales. Los resultados permiten observar que la discusión en Twitter no fue mayormente de rechazo y xenofobia por parte de los usuarios en México, contrario a lo que la narrativa de los medios de comunicación señaló en ese momento. Los resultados arrojan también las diferentes dimensione identitarias de los usuarios que participaron en el debate en dicha etiqueta. Permiten también observar los procesos de gestión, desplazamientos, ajustes y reforzamientos identitarios desempeñados en el espacio público digital. Migratory flows in Central America, the Caribbean and Mexico saw an increase during 2018. The phenomenon of the so-called migrant caravans became relevant in the media, social and political agendas at the national level in Mexico and in different countries of the American continent. This thesis addresses the discussion of the hashtag #CaravanaMigrante during four of the episodes most present in the national media agenda in Mexico, which occurred between October and November 2018. The units of analysis extracted from Twitter are analysed from the Social Identity Theory and place special emphasis on the concept of the management of the threat to social identity in digital social networks. The results show that the discussion on Twitter was not largely one of rejection and xenophobia on the part of users in Mexico, contrary to what the media narrative pointed out at the time. The results also show the different identity dimensions of the users who participated in the debate on the hashtag. They also allow us to observe the processes of management, displacement, adjustment and reinforcement of identities played out in the digital public space.
... In 2013, Allen and Blinders used widespread British newspapers from 2010-2012 to understand the patterns in coverage of migrant news [23]. Backfried and Shalunts [24] applied sentiment analysis to text documents from relevant media sources to study the migrant crisis in Europe and indicated the more neutral content being published in traditional media. Vazquez et al. [25] used topic modelling to extract the most important issues surrounding the migration crisis in Venezuela using online news articles. ...
Article
Full-text available
Implementing countrywide lockdown measures in India, from March 2020 to May 2020 was a major step to deal with the COVID -19 pandemic crisis. The decision of country lockdown adversely affected the urban migrant population, and a large section of them was compelled to move out of the urban areas to their native places. The reverse migration garnered widespread media attention and coverage in electronic as well as print media. The present study focuses on the coverage of the issue by print media using descriptive natural language text mining. The study uses topic modelling, clustering, and sentiment analysis to examine the articles on migration issues during the lockdown period published in two leading English newspapers in India- The Times of India and The Hindu. The sentiment analysis results indicate that the majority of articles have neutral sentiment while very few articles show high negative or positive polarity. Descriptive topic modelling results show that transport, food security, special services, and employment with migration and migrants are the majorly covered topics after employing Bag of Words and TF-IDF models. Clustering is performed to group the article titles based on similar traits using agglomerative hierarchical clustering.
Article
Full-text available
Sosyal medya platformları aracılığıyla toplumlar bilgilerini, isteklerini, ihtiyaçlarını ve tepkilerini yansıtabilmektedir. Sosyal medyanın tüm bu süreçlere açık yapısı ise toplumların bu ortamları sıkça kullanmasını sağlamaktadır. Özellikle toplumları etkileyen bir durum meydana geldiğinde insanlar oldukça hızlı bir şekilde sosyal medya aracılığıyla düşüncelerini paylaşabilmekte ve kimi durumlarda da başkalarının düşüncelerini etkileyebilmektedir. Bu doğrultuda, 6 Şubat 2023 tarihinde Kahramanmaraş merkezli meydana gelen deprem felaketi sonrasında insanlar öncelikle bilgilendirme ve yardım faaliyetlerini yürütme amacıyla sosyal medya ortamlarını kullanmışlardır. Daha sonrasında ise deprem bölgesine destek göndermeyen, depremle ilgili destekleyici paylaşımlar yapmayan ve yapılan destekler hakkında sosyal medya üzerinden toplumu bilgilendirmeyen kurumlar hakkında boykotlar başlamıştır. Bu boykotların devamlılığının ölçülmesi ve devamında da içeriklerinin doğru bir şekilde analiz edilmesi markalar açısından kriz iletişimi bağlamında oldukça önemlidir. Kriz anlarında on binlerce içeriğin takip edilmesi, analiz edilmesi ve sonuçlarının sunulması çok da mümkün görülmemektedir. Çalışmada, 6 Şubat tarihinde meydana gelen deprem sonrasında boykot edilen markalardan birisi olan Starbucks’a yönelik paylaşımlar veri madenciliği yöntemi ile analiz edilmiş ve hesaplamalı sosyal bilimler yöntemi kullanılmıştır. Bu yöntem dâhilinde depremin ilk günlerinde sessiz kalan Starbucks’ın açıklama yaptığı ilk tarih olan 9 Şubat 2023 ile 19 Mart 2023 tarih aralığındaki tweetler araştırmaya dâhil edilmiştir. İlgili veriler, veri temizleme süreçlerinden geçirildikten sonra BERT duygu analizine tabii tutulmuş ve sonrasında veri görselleştirmeleri yapılmıştır. Yirmi bin veri üzerinden gerçekleştirilen duygu analizi sonuçları negatif tweetlerin fazlalığına dikkat çekmektedir. Analiz verilerine göre 13621 negatif, 5059 pozitif ve 1320 nötr tweetin olduğu sonucuna ulaşılmıştır. Tweetler duygu durumları bakımından yüzdesel olarak %68 negatif, %7 nötr ve %25 pozitif dağılım göstermektedir.
Article
Full-text available
El discurso de odio dirigido hacia personas migrantes y refugiadas plantea uno de los mayores desafíos en la comunicación en redes so- ciales. A través del análisis automatizado de 124.337 mensajes sobre migración recogidos en Twitter en España entre 2015 y 2020, se anali- za la presencia de odio y el sentimiento subyacente en el discurso, así como su evolución y las posibles diferencias entre comunidades autó- nomas. Se ha observado que, aunque la atención prestada a la cues- tión migratoria fue superior en 2015 y 2016, años centrales de la crisis de refugiados del Mediterráneo, el mayor volumen de odio se detectó en 2019 y 2020, tras el ascenso de Vox y su retórica anti-inmigración. En general, el sentimiento de estos mensajes fue negativo, si bien la diferencia entre aquellos que tienen odio y los que no es reducida. Por último, se han observado diferencias entre regiones autónomas, sien- do Asturias la de mayor presencia de odio y Cantabria la que muestra un sentimiento más negativo; estas diferencias, no obstante, no son elevadas y no se han detectado patrones claros que las expliquen.
Article
The 2022 Russian invasion of Ukraine sent shockwaves through Europe and led to rapid policy changes concomitant with variations in citizen perceptions. This article analyses how EU public opinion on security and defence matters has reacted to the war: what patterns of change and continuity can be detected, what differences are visible between Member States, and how might those be explained? Our analysis draws on big data-based sentiment analysis of news sources, reflecting a widely recognized connection between media coverage and public opinion – especially during crisis times – and complementing more traditional measurements of citizen perceptions such as opinion polls. Broadly speaking, we find that the invasion has heightened rather than fundamentally altered underlying trends. Our article contributes to a growing literature on the acceptability of European integration in security and defence, showing that publics are generally supportive of it, and regard it as complementary to NATO.
Conference Paper
Full-text available
We propose an analytical framework able to investigate discussions about polarized topics in online social networks from many different angles. The framework supports the analysis of social networks along several dimensions: time, space and sentiment. We show that the proposed analytical framework and the methodology can be used to mine knowledge about the perception of complex social phenomena. We selected the refugee crisis discussions over Twitter as the case study. This difficult and controversial topic is an increasingly important issue for the EU. The raw stream of tweets is enriched with space information (user and mentioned locations), and sentiment (positive vs. negative) w.r.t. refugees. Our study shows differences in positive and negative sentiment in EU countries, in particular in UK, and by matching events, locations and perception it underlines opinion dynamics and common prejudices regarding the refugees.
Article
Full-text available
We propose an analytical framework able to investigate discussions about polarized topics in online social networks from many different angles. The framework supports the analysis of social networks along several dimensions: time, space and sentiment. We show that the proposed analytical framework and the methodology can be used to mine knowledge about the perception of complex social phenomena. We selected the refugee crisis discussions over Twitter as the case study. This difficult and controversial topic is an increasingly important issue for the EU. The raw stream of tweets is enriched with space information (user and mentioned locations), and sentiment (positive vs. negative) w.r.t. refugees. Our study shows differences in positive and negative sentiment in EU countries, in particular in UK, and by matching events, locations and perception it underlines opinion dynamics and common prejudices regarding the refugees.
Article
Full-text available
Expressing opinions and emotions on social media becomes a frequent activity in daily life. People express their opinions about various targets via social media and they are also interested to know about other opinions on the same target. Automatically identifying the sentiment of these texts and also the strength of the opinions is an enormous help for people and organizations who are willing to use this information for their goals. In this paper, we present a rule-based approach for German sentiment analysis. The proposed model provides a fine-grained annotation for German texts, which represents the sentiment strength of the input text using two scores: positive and negative. The scores show that if the text contains any positive or negative opinion as well as the strength of each positive and negative opinions. To this aim, a German opinion dictionary of 1,864 words is prepared and compared with other opinion dictionaries for German. We also introduce a new dataset for German sentiment analysis. The dataset contains 500 short texts from social media about German celebrities and is annotated by three annotators. The results show that the proposed unsupervised model outper-forms the supervised machine learning techniques. Moreover, the new dictionary performs better than other German opinion dictionaries.
Conference Paper
Full-text available
Several messages express opinions about events, products, and services, political views or even their author's emotional state and mood. Sentiment analysis has been used in several applications including analysis of the repercussions of events in social networks, analysis of opinions about products and services, and simply to better understand aspects of social communication in Online Social Networks (OSNs). There are multiple methods for measuring sentiments, including lexical-based approaches and supervised machine learning methods. Despite the wide use and popularity of some methods, it is unclear which method is better for identifying the polarity (i.e., positive or negative) of a message as the current literature does not provide a method of comparison among existing methods. Such a comparison is crucial for understanding the potential limitations, advantages, and disadvantages of popular methods in analyzing the content of OSNs messages. Our study aims at filling this gap by presenting comparisons of eight popular sentiment analysis methods in terms of coverage (i.e., the fraction of messages whose sentiment is identified) and agreement (i.e., the fraction of identified sentiments that are in tune with ground truth). We develop a new method that combines existing approaches, providing the best coverage results and competitive agreement. We also present a free Web service called iFeel, which provides an open API for accessing and comparing results across different sentiment methods for a given text.
Conference Paper
Full-text available
This paper describes a system for real-time analysis of public sentiment toward presidential candidates in the 2012 U.S. election as expressed on Twitter, a micro-blogging service. Twitter has become a central site where people express their opinions and views on political parties and candidates. Emerging events or news are often followed almost instantly by a burst in Twitter volume, providing a unique opportunity to gauge the relation between expressed public sentiment and electoral events. In addition, sentiment analysis can help explore how these events affect public opinion. While traditional content analysis takes days or weeks to complete, the system demonstrated here analyzes sentiment in the entire Twitter traffic about the election, delivering results instantly and continuously. It offers the public, the media, politicians and scholars a new and timely perspective on the dynamics of the electoral process and public opinion.
Article
The news media play an important role in informing the public on current events. Yet it has been difficult to understand the comprehensiveness of news media coverage on an event and how the reactions that the coverage evokes may diverge, because this requires identifying the origin of an event and tracing the information all the way to individuals who consume the news. In this work, we pinpoint the information source of an event in the form of a press release and investigate how its news cycle unfolds. We follow the news through three layers of propagation: the news articles covering the press release, shares of those articles in social media, and comments on the shares. We find that a news cycle typically lasts two days. Although news media in aggregate cover the information contained in the source, a single news article will typically only provide partial coverage. Sentiment, while dampened in news coverage relative to the source, again rises in social media shares and comments. As the information propagates through the layers, it tends to diverge from the source: while some ideas emphasized in the source fade, others emerge or gain in importance. We also discover how far the news article is from the information source in terms of sentiment or language does not help predict its popularity.
Article
Today millions of web-users express their opinions about many topics through blogs, wikis, fora, chats and social networks. For sectors such as e-commerce and e-tourism, it is very useful to automatically analyze the huge amount of social information available on the Web, but the extremely unstructured nature of these contents makes it a difficult task. SenticNet is a publicly available resource for opinion mining built exploiting AI and Semantic Web techniques. It uses dimensionality reduction to infer the polarity of common sense concepts and hence provide a public resource for mining opinions from natural language text at a semantic, rather than just syntactic, level. Copyright © 2010, Association for the Advancement of Artificial Intelligence. All rights reserved.
Conference Paper
Sentiment analysis has been well in the focus of researchers in recent years. Nevertheless despite the considerable amount of literature in the field the majority of publications target the domains of movie and product reviews in English. The current paper presents a novel sentiment analysis method, which extends the state-of-the-art by trilingual sentiment classification in the domains of general news and particularly the coverage of natural disasters in general news. The languages examined are English, German and Russian. The approach targets both traditional and social media content. The extensive experiments demonstrate that the performance of the proposed approach outperforms human annotators, as well as the original method, on which it is built and extended.
Conference Paper
Analysis of social media and traditional media provides significant information to first responders in times of natural disasters. Sentiment analysis , particularly of social media originating from the affected population, forms an integral part of multifaceted media analysis. The current paper extends an existing methodology to the domain of natural disasters, broadens the support of multiple languages and introduces a new man ner of classification. The performance of the approach is evaluated on a recently collected dataset manually annotated by three human annotators as a reference. The experiments show a high agreement rate between the approach taken and the annotators. Furthermore, the paper presents the initial application of the resulting technology and models to sentiment analysis of social media data in German, covering data collected during the Central European floods of 2013.
Conference Paper
In this paper we discuss the role of Open Source Intelligence (OSINT) in Disaster Management. In particular we present the use of the Sail Labs Media Mining System in the context of disaster relief operations and use samples to point out advantages and strengths of the MM-System. Future challenges in research and further development of this field are addressed towards the end of the paper.