000 02956nam a22002177a 4500
001 th651
003 ISI Library, Kolkata
005 20250917160509.0
008 250917b |||||||| |||| 00| 0 eng d
040 _aISI Library
_bEnglish
082 0 4 _223rd
_a006.31
_bS131
100 1 _aSaha, Soumadeep
_eauthor
245 1 0 _aDomain Obedient Deep Learning/
_cSoumadeep Saha
260 _aKolkata:
_bIndian Statistical Institute,
_c2025
300 _axvii, 135 pages,
_bgraphs
502 _aThesis (Ph.D.)- Indian Statstical Institute, 2025
505 _aIntroduction -- Do vision systems learn rules? -- faithful language modeling -- Domain aware learning & evaluation -- Constrained inference -- Conclusion
520 _aDeep learning, a family of data-driven artificial intelligence techniques, has shown immense promise in a plethora of applications, and it has even outpaced experts in several domains. However, unlike symbolic approaches to learning, these methods fall short when it comes to abiding by and learning from pre-existing established principles. This is a significant deficit for deployment in critical applications such as robotics, medicine, industrial automation, etc. For a decision system to be considered for adoption in such fields, it must demonstrate the ability to adhere to specified constraints, an ability missing in deep learning-based approaches. Exploring this problem serves as the core tenet of this dissertation. This dissertation starts with an exploration of the abilities of conventional deep learning-based systems vis-à-vis domain coherence. A large-scale rule-annotated dataset is introduced to mitigate the pronounced lack of suitable constraint adherence evaluation benchmarks, and with its aid, the rule adherence abilities of vision systems are analyzed. Additionally, this study probes language models to elicit their performance characteristics with regard to domain consistency. Examination of these language models with interventions illustrates their ineptitude at obeying domain principles, and a mitigation strategy is proposed. This is followed by an exploration of techniques for imbuing deep learning systems with domain constraint information. Also, a comprehensive study of standard evaluation metrics and their blind spots pertaining to domain-aware performance estimation is undertaken. Finally, a novel technique to enforce constraint compliance in models without training is introduced, which pairs a search strategy with large language models to achieve cutting-edge performance. A key highlight of this dissertation is the emphasis on addressing pertinent real-world problems with scalable and practicable solutions. We hope the results presented here pave the way for wider adoption of deep learning-based systems in pivotal situations with enhanced confidence in their trustworthiness.
856 _uhttps://dspace.isical.ac.in/jspui/handle/10263/7608
_yFull text
942 _2ddc
_cTH
999 _c437316
_d437316