DSpace Repository

On the Choice of Appropriate Combination of Classi er and Decomposition Scheme for Multiclass Imbalanced Data Classi cation : A Comparative Analysis Sayantan

Show simple item record

dc.contributor.author Kumar, Sayantan
dc.date.accessioned 2022-02-01T09:11:22Z
dc.date.available 2022-02-01T09:11:22Z
dc.date.issued 2019-07
dc.identifier.citation 54p. en_US
dc.identifier.uri http://hdl.handle.net/10263/7259
dc.description Dissertation under the supervision of Prof. en_US
dc.description.abstract Classifying a multiclass data set with an imbalanced distribution of class repre- sentatives in the data set is a challenging problem which is prevalent in many real-world applications. In this study,we have made a comparative analysis of di erent decomposition techniques like OneVsAll(OVA), OneVsOne(OVO), Error Correcting Output Codes(ECOC), All-and-One(A&O) and One-Against-Lower- Order(OALO) to deal with the multiclass imbalance. While OVA and OVO have been used signi cantly in the multiclass imbalance domain, our work is the rst to explore the remaining binarization approaches in this eld. We have examined the performance of these decomposition methods on two types of learning : algorith- mic approach and hybrid approach of both data-level and algorithmic solutions to solve the binary class imbalance classi cation problem. For the algorithmic ap- proach learning we have used Hellinger Distance Decision Trees and for the hybrid method, we propose Balanced Ensemble Models (BEM) that combines both sam- pling and algorithm level modi cations. It has been analyzed how e ectively the decomposition methods when applied on our approach can counter the challenges of multiclass imbalance. A detailed experimental study, supported by statistical analysis has been carried out to determine which combination of classi er(between HDDT and our proposed ensemble method) and decomposition scheme work best to produce satisfactory classi cation performance on a multiclass imbalanced data set. From our research we conclude that ECOC decomposition strategy when ap- plied on our proposed BEM outperforms all the other algorithms in dealing with multiclass imbalance problem. en_US
dc.language.iso en en_US
dc.publisher Indian Statistical Institute, Kolkata en_US
dc.relation.ispartofseries Dissertation;;2019:11
dc.subject Multiclass Imbalanced data classi cation en_US
dc.subject Decision tree en_US
dc.title On the Choice of Appropriate Combination of Classi er and Decomposition Scheme for Multiclass Imbalanced Data Classi cation : A Comparative Analysis Sayantan en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record

Search DSpace


Advanced Search

Browse

My Account