Please use this identifier to cite or link to this item: http://hdl.handle.net/10263/7295
Title: Author Identification and Analysis of Bollywood Song Lyrics
Authors: Pai, Deepesh
Keywords: Natural Language Processing
Authorship identification
GRU
BERT
Issue Date: Jul-2021
Publisher: Indian Statistical Institute, Kolkata
Citation: 29p.
Series/Report no.: Dissertation;;CS1920
Abstract: Research in Natural Language Processing is expanding in multiple domains and applications. With every advancement, the variety of text that are processed is growing. One such domain is lyrics processing. Songs are vital to the music and film industry and are analyzed to get important information such as genre, theme, mood, author, etc. of the song. Bollywood, the Indian film industry makes a lot of revenue making use of songs. The number of songs churned out by this industry is massive and is a rich source of textual data for Natural Language Processing tasks. In the field of Natural Language Processing (NLP) one of the important topics is Authorship identification. Authorship identification is the task of identifying the author of a given text from a set of authors. Authorship identification is applied to tasks such as identifying anonymous authors, detecting plagiarism, or finding ghostwriters. It also gives us an opportunity to work on data in Devanagari which is a relatively less explored field. The main concern of this task is to define an appropriate characterization of texts that captures the writing style of authors. Although deep learning is used in different author identification tasks using LSTM and GRU, it has not been used with BERT(to the best of our knowledge). In this study, the project aims to build a system that can identify the lyricist of a song based on its lyrics. We have built a model based on BERT which would take input the lyrics of a particular song and our program would predict its lyricist based on the content of the lyrics. The results show that the proposed system outperforms its counterparts.
Description: Dissertation under the supervision of Debapriyo Majumdar
URI: http://hdl.handle.net/10263/7295
Appears in Collections:Dissertations - M Tech (CS)

Files in This Item:
File Description SizeFormat 
Deepesh Pai-19-21.pdf494.84 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.