Content uploaded by Mohsen Ali Alawami
Author content
All content in this area was uploaded by Mohsen Ali Alawami on Apr 02, 2021
Content may be subject to copyright.
ConTheModel: Can We Modify Tweets
to Confuse Classifier Models?
Aishwarya Ram Vinay , Mohsen Ali Alawami , and Hyoungshick Kim(B
)
Department of Electrical and Computer Engineering, Sungkyunkwan University,
Suwon, South Korea
{aishwarya,mohsencomm,hyoung}@skku.edu
Abstract. News on social media can significantly influence users,
manipulating them for political or economic reasons. Adversarial manip-
ulations in the text have proven to create vulnerabilities in classifiers,
and the current research is towards finding classifier models that are not
susceptible to such manipulations. In this paper, we present a novel tech-
nique called ConTheModel, which slightly modifies social media news
to confuse machine learning (ML)-based classifiers under the black-box
setting. ConTheModel replaces a word in the original tweet with its syn-
onym or antonym to generate tweets that confuse classifiers. We evaluate
our technique on three different scenarios of the dataset and perform a
comparison between five well-known machine learning algorithms, which
includes Support Vector Machine (SVM), Naive Bayes (NB), Random
Forest (RF), eXtreme Gradient Boosting (XGBoost), and Multilayer
Perceptron (MLP) to demonstrate the performance of classifiers on the
modifications done by ConTheModel. Our results show that the classi-
fiers are confused after modification with the utmost drop of 16.36%. We
additionally conducted a human study with 25 participants to validate
the effectiveness of ConTheModel and found that the majority of par-
ticipants (65%) found it challenging to classify the tweets correctly. We
hope our work will help in finding robust ML models against adversarial
examples.
Keywords: Machine learning ·Social media ·Adversarial examples ·
Tweets
1 Introduction
Various social media platforms are used by a large number of population world-
wide for communication as it is easily accessible. Statistics show that in 2020,
approximately 3.6 billion people were using social media, and this number would
increase by almost another billion by 2025 [6]. All is fine unless otherwise, when
what is passed off as “news” on social media is often disinformation. Contrary to
real news, fake news develops stories instead of reporting facts. Last October, a
new law was passed in Singapore, which bans the spreading of false information.
This law does so by allowing the government to instruct popular online social
c
Springer Nature Switzerland AG 2021
Y. Park et al. (Eds.): SVCC 2020, CCIS 1383, pp. 205–219, 2021.
https://doi.org/10.1007/978-3-030-72725-3_15