Laiba Mehnaz's research while affiliated with Delhi Technological University and other places

Publications (8)

Preprint
Full-text available
Code-switching is the communication phenomenon where speakers switch between different languages during a conversation. With the widespread adoption of conversational agents and chat platforms, code-switching has become an integral part of written conversations in many multi-lingual communities worldwide. This makes it essential to develop techniqu...
Preprint
Full-text available
In this paper we present our approach and the system description for Sub Task A of SemEval 2019 Task 9: Suggestion Mining from Online Reviews and Forums. Given a sentence, the task asks to predict whether the sentence consists of a suggestion or not. Our model is based on Universal Language Model Fine-tuning for Text Classification. We apply variou...
Preprint
Full-text available
In this paper we present our approach and the system description for Sub-task A and Sub Task B of SemEval 2019 Task 6: Identifying and Categorizing Offensive Language in Social Media. Sub-task A involves identifying if a given tweet is offensive or not, and Sub Task B involves detecting if an offensive tweet is targeted towards someone (group or an...

Citations

... Humans, on the other hand, may sometimes code-switch between languages, especially when there is no appropriate translation, or when readers are more familiar with foreign entities. There have been summarization resources addressing the code-switching phenomenon (Mehnaz et al., 2021), but they focus on summarizing from already code-switched source texts. ...
... After training on the WIkitext 103 dataset, the model is finetuned for a new dataset. Based on ULMFiT, Anand et al. [42] proposed a suggestion-mining approach for forums and online reviews. They also presented a system description for SubTask A of SemEval 2019 Task 9. ...
... However, shows limited ability with domains containing high types and slang such as social media [36]. The embedding size affects the quality of results, large embeddings provide consistent results [28] while higher dimensions may not improve the performance [100]. Another limitation of word embeddings resides in handling out-of-vocabulary (OOV) words which can be partially overcome using character-level embeddings information [101]. ...
... Methods based on neural networks became a more common choice [25,17]. During the latest instance of SMM4H, co-located with ACL 2019, transformersbased architectures such as BERT [6] and BioBERT [13] were the building blocks of the top performing systems [2,14,15]. However, these recent shared tasks have focused on a very particular type of social media texts: tweets, which are short, highly informal and noisy texts. ...
... Social media applications such as Twitter were effective in raising blood donation requests and dissemination, reducing the gap between blood donors and the people in need. 14,15 Among the various social media applications, WhatsApp was effective in managing the blood donation process. 16 However, a similar study in Brazil identified there was no significant impact of WhatsApp in increasing the number of blood donors and the retention rates of existing donors. ...