Srimayi Gupta’s scientific contributions

What is this page?


This page lists works of an author who doesn't have a ResearchGate profile or hasn't added the works to their profile yet. It is automatically generated from public (personal) data to further our legitimate goal of comprehensive and accurate scientific recordkeeping. If you are this author and want this page removed, please let us know.

Publications (1)


Figure 4. Precision trend plot for racist tweets across model classifiers and data sets. Balancing the data set improved the precision.
Figure 6. F1 score trend plot for racist tweets across model classifiers and data sets. Balancing the data set improved the precision considerably.
Prediction metrics for raw tweets data
Prediction metrics for cleaned data post stop words removal and stemming
Prediction metrics for cleaned pre-processed and downsampled data
Twitter Sentiment Analysis Using Machine Learning
  • Article
  • Full-text available

May 2024

·

492 Reads

·

1 Citation

Journal of Student Research

Srimayi Gupta

·

Padmavathy Jawahar

In an age of social media, online forums, and chats, cyberbullying is a prevalent issue. On Twitter (now X), approximately 500 million tweets are shared per day (Antonakaki et.al., 2021). It is the job of the moderators to ensure these tweets follow standard community guidelines. However, the sheer number of tweets makes it difficult to sort manually and ensure they are following protocol. Sentiment analysis and machine learning algorithms can be used to classify these texts automatically as positive or negative. Normally, these machine learning models are much more efficient and may provide higher accuracy rates in identifying hate speech in Twitter. In this paper, we are exploring the use of five classical machine learning algorithms to classify Twitter hate speech as neutral, racist, or sexist. Model performance was compared after using raw tweet data versus pre-processed tweets through data cleanup. Furthermore, we highlight two methods to deal with imbalanced datasets to improve the prediction rates. Overall, we were able to achieve a 96% accuracy in correctly classifying tweets into the different labels.

Download