Please use this identifier to cite or link to this item: http://hdl.handle.net/10263/7340
Full metadata record
DC FieldValueLanguage
dc.contributor.authorBaksi, Arkadeep-
dc.date.accessioned2022-07-18T10:17:32Z-
dc.date.available2022-07-18T10:17:32Z-
dc.date.issued2022-07-
dc.identifier.citation50p.en_US
dc.identifier.urihttp://hdl.handle.net/10263/7340-
dc.descriptionDissertation under the supervision of Dr. Debapriyo Majumdaren_US
dc.description.abstractAnswer generation for a question, given a context has gained tremendous popularity in the NLP research space. Benchmark datasets like SQuAD[9] have propelled the research and recent years have seen many transformer based models achieving state of the art (SOTA) results on Question Answering tasks even beating human level accuracy. However the second step to a Question Answering System that Contextual Answer Validation is a much less attempted space in NLP. For the past few years India has seen a tremendous growth in the Edtech industry. These edtech firms are sitting on a gold mine of data primarily in Question Answer- ing space. As a result there is a growing demand for automatic Answer Validation Systems as well which can bypass the norm of human evaluation, automating the process. Apart from these, demand for such systems is also there in the Chatbot space to validate junk/spam responses and smoothen the chatbot experience overall. In our work we attempted the answer validation problem with the additional con- straints of the answer being single sentence long and having 10 words atleast. However due to the unavailability of exact datasets we had to generate synthetic data based on the SQuAD dataset. We build our model inspired from paraphrase detection and fine-tuned it against various datasets clubbed with the synthetic data we generated. Our model on final evaluation even hit an accuracy of 0.83 on the highly complex PAWS dataset which typically contains lexically highly overlapped examplesen_US
dc.language.isoenen_US
dc.publisherIndian Statistical Institute, Kolkataen_US
dc.relation.ispartofseriesDissertation;2022-1-
dc.subjectAnswer Validationen_US
dc.subjectDeep Learning in NLPen_US
dc.subjectRecurrent Neural Networks (RNN)en_US
dc.subjectLong Short Term Memory (LSTM)en_US
dc.titleContextual Answer Validationen_US
dc.typeOtheren_US
Appears in Collections:Dissertations - M Tech (CS)

Files in This Item:
File Description SizeFormat 
Arkadeep_Thesis-dissertation-18-7-22.pdf860.08 kBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.