Movies can be tagged with various details such as genre, plot structure, soundtracks, and visual and emotional experiences. This information can be used to build automatic systems to extract similar movies, enhance user experience, and improve the recommendations. In this paper, a machine learning-based approach is proposed to predict the tags which can be associated with a given movie. The
... [Show full abstract] problem is posed as a multi-label classification problem. To solve this, firstly, we created a fine-tuned set of various tags that exposed the varied characteristics of movie plots (a textual summary of a movie). Then, using different textual representation techniques such as TF-IDF, AVGW2V (averaged word vector), and several machine learning classifiers are used to predict the tags associated with a movie. It is believed that the proposed machine learning-based tag prediction can be useful in other tasks related to narrative analysis.KeywordsMulti-label classificationTF–IDFWord2VecPattern recognitionBig data