Please use this identifier to cite or link to this item: http://hdl.handle.net/10263/7295
Full metadata record
DC FieldValueLanguage
dc.contributor.authorPai, Deepesh-
dc.date.accessioned2022-03-22T10:31:08Z-
dc.date.available2022-03-22T10:31:08Z-
dc.date.issued2021-07-
dc.identifier.citation29p.en_US
dc.identifier.urihttp://hdl.handle.net/10263/7295-
dc.descriptionDissertation under the supervision of Debapriyo Majumdaren_US
dc.description.abstractResearch in Natural Language Processing is expanding in multiple domains and applications. With every advancement, the variety of text that are processed is growing. One such domain is lyrics processing. Songs are vital to the music and film industry and are analyzed to get important information such as genre, theme, mood, author, etc. of the song. Bollywood, the Indian film industry makes a lot of revenue making use of songs. The number of songs churned out by this industry is massive and is a rich source of textual data for Natural Language Processing tasks. In the field of Natural Language Processing (NLP) one of the important topics is Authorship identification. Authorship identification is the task of identifying the author of a given text from a set of authors. Authorship identification is applied to tasks such as identifying anonymous authors, detecting plagiarism, or finding ghostwriters. It also gives us an opportunity to work on data in Devanagari which is a relatively less explored field. The main concern of this task is to define an appropriate characterization of texts that captures the writing style of authors. Although deep learning is used in different author identification tasks using LSTM and GRU, it has not been used with BERT(to the best of our knowledge). In this study, the project aims to build a system that can identify the lyricist of a song based on its lyrics. We have built a model based on BERT which would take input the lyrics of a particular song and our program would predict its lyricist based on the content of the lyrics. The results show that the proposed system outperforms its counterparts.en_US
dc.language.isoenen_US
dc.publisherIndian Statistical Institute, Kolkataen_US
dc.relation.ispartofseriesDissertation;;CS1920-
dc.subjectNatural Language Processingen_US
dc.subjectAuthorship identificationen_US
dc.subjectGRUen_US
dc.subjectBERTen_US
dc.titleAuthor Identification and Analysis of Bollywood Song Lyricsen_US
dc.typeOtheren_US
Appears in Collections:Dissertations - M Tech (CS)

Files in This Item:
File Description SizeFormat 
Deepesh Pai-19-21.pdf494.84 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.