000 04533nam a22003137a 4500
001 th647
003 ISI Library, Kolkata
005 20250916145853.0
008 250916b |||||||| |||| 00| 0 eng d
040 _aISI Library
_bEnglish
082 0 4 _223rd
_a616.0754
_bG427
100 1 _aGhosh, Susmita
_eauthor
245 1 0 _aEnhancing medical image analysis through deep learning: a comprehensive study on classification, segmentation, and multitask learning/
_cSusmita Ghosh
260 _aKolkata:
_bIndian Statistical Institute,
_c2025
300 _aix, 181 pages
_ccharts, ills
502 _aThesis (Ph. D.) - Indian Statistical Institute, 2025
504 _aIncludes bibliography
505 0 _aFoundations of Deep Learning in Medical Imaging: A Prelude -- A Deep Learning Framework Integrating the Spectral and Spatial Features for Image-assisted Medical Diagnostics -- An Improved Vision Transformer Model for Medical Image based Diagnostic Solution -- Multi-scale Morphology-aided Deep Medical Image Segmentation -- MA-DTNet: Multi-task Learning with Morphological Attention for Medical Image Analysis -- Conclusion
508 _aGuided by Prof. Swagatam Das
520 _aMedical image analysis has become indispensable for accurate diagnosis and treatment planning. However, despite advances in deep learning, several critical challenges persist, ranging from more efficient models to the integration of multiple tasks within a unified framework. This thesis addresses these challenges by proposing innovative deep learn- ing architectures that enhance medical image classification, segmentation, and multitask learning. At the heart of this research is the goal of developing models that deliver high performance and tackle the nuanced complexities of medical data. Existing clas- sification models often overlook valuable information hidden in the spectral domain of images. I address this by integrating spatial and spectral features, demonstrating their complementary power to detect diseases such as COVID-19 from chest radiographs. This approach facilitates a more holistic understanding of medical images, improving the ac- curacy and reliability of diagnostic systems. To further enhance image classification, I explore hybrid architectures that combine convolutional and transformer-based models. These models leverage the strengths of both architectures, capturing fine-grained visual details and long-range dependencies. This significantly improves various medical imaging datasets, offering deeper interpretability and superior classification accuracy, particularly in complex diagnostic scenarios. Moving beyond classification, I tackle the fundamen- tal challenge of segmenting complex and irregular regions within medical images, where traditional deep learning models often struggle. To overcome this, I introduce a novel segmentation framework that combines the power of deep neural networks with trainable morphological operations. This leads to a more precise delineation of regions of inter- est, even in challenging clinical scenarios, setting a new benchmark for medical image segmentation. One of the most pressing issues in medical imaging is the inefficiency of current multitask learning models, which often require vast computational resources and struggle to generalize across different tasks. I present a lightweight multitask learn- ing framework that excels at both segmentation and classification, particularly in breast tumor analysis. Using novel morphological attention mechanisms and the sharing of task- specific knowledge, proposed model significantly reduces computational complexity while improving performance. Importantly, this framework demonstrates versatility across various medical imaging domains, from gland segmentation and malignancy detection in histology images to skin lesion analysis, demonstrating its robustness and applicability in real-world settings. Altogether, this thesis offers solutions to some of the most pressing problems in medical image analysis, providing models that are not only more accurate but also computationally efficient, making them suitable for deployment in clinical practice.
650 4 _aMedical Image Analysis
650 4 _aMedical Image Classification
650 4 _aHybrid Architectures
650 4 _aVision Transformer (ViT)
650 4 _aConvolutional Neural Networks (CNNs)
650 4 _aDiscrete Wavelet Transform
856 _uhttps://dspace.isical.ac.in/jspui/handle/10263/7597
_yFull text
942 _2ddc
_cTH
999 _c437311
_d437311