Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/579599
Full metadata record
DC FieldValueLanguage
dc.coverage.spatial
dc.date.accessioned2024-07-30T05:02:36Z-
dc.date.available2024-07-30T05:02:36Z-
dc.identifier.urihttp://hdl.handle.net/10603/579599-
dc.description.abstractHealthcare has been undergoing a data-driven transformation, further accelerated by the COVID-19 pandemic. A significant amount of healthcare data is unstructured and underutilized. The success of Large Language Models (LLMs) in achieving human-like conversations has unlocked their potential in healthcare. For example, language models can help improve patient outcomes through temporal decision support, early warning systems, and clinical risk assessment. Through our work, we have explained how language models can assist in pandemic preparedness and support decision-making processes in critical care. Integrated frameworks incorporating machine learning, deep learning, and language models have been developed to effectively track and analyze temporal changes in unstructured healthcare data, to make informed decisions, and to enhance patient outcomes in a dynamic healthcare landscape. In this thesis, my first contribution was a deep learning based language model for modeling the spike region of COVID-19 genome sequences. This led to novel knowledge discovery and real-world implementation for predicting pandemic progression, StrainFlow, which successfully captured COVID-19 caseloads two months ahead of their occurrence. The integrative framework for language models, statistical features and machine learning to capture the temporal changes in the semantics of the genomic sequence was deployed as a publicly available web-application. In my second contribution, I constructed language models on COVID-19 scientific literature to track and predict emerging scientific evidence. The findings of this contribution illustrated that temporal changes in unsupervised word embeddings of scientific literature effectively captured and tracked new knowledge. Additionally, my work leveraged machine learning techniques and predicted emerging themes based on evolving word associations. This was also implemented as an openly available web application called EvidenceFlow. In my third contribution, I developed language models on unstructured clinical notes data from intensive care units (ICU) for prognosticating critical outcomes. Shock Index (SI) is a commonly employed prognostic indicator used in intensive care units (ICU) and emergency settings to assess patient outcomes. We developed a comprehensive multimodal early warning system (EWS) utilizing an integrated framework combining machine learning, deep learning, and language models. The framework leverages routinely available vital signs and clinical notes data to detect abnormal shock index and provide timely alerts for potential deteriorations in patient health. This model is planned to be evaluated prospectively for real-world clinical decision making, which is outside the scope of my thesis. In our final contribution, I contributed to the development and deployment of an end-to-end language model pipeline and android application, WashKaro, for raising WASH awareness during the COVID-19 pandemic. This was one of the first AI-based information dissemination applications built during COVID-19, which provided both Hindi and English bite sized text and audio based upon text summarization, word embedding similarities and text-to-speech technologies using advanced NLP methods. The application and research publication also demonstrated the user-feedback based improvement of our AI model, providing pointers for designing public health intervention systems for pandemic preparedness. Overall, my thesis contributed to the development, evaluation, and deployment of language model based technologies in ICU and pandemic preparedness settings, specifically in the setting of future predictions and early warning systems using temporal data. The findings contribute to advancing knowledge and methodologies while assisting medical practitioners and policymakers in effectively responding to disease outbreaks and formulating data and AI-augmented policy for healthcare settings.
dc.format.extent159 p.
dc.languageEnglish
dc.relation
dc.rightsuniversity
dc.titleLanguage models for temporal decisions in health datasets
dc.title.alternative
dc.creator.researcherPal, Ridam
dc.subject.keywordBiology
dc.subject.keywordBiology and Biochemistry
dc.subject.keywordLife Sciences
dc.description.note
dc.contributor.guideSethi, Tavpritesh
dc.publisher.placeDelhi
dc.publisher.universityIndraprastha Institute of Information Technology, Delhi (IIIT-Delhi)
dc.publisher.institutionDepartment of Computational Biology
dc.date.registered
dc.date.completed2024
dc.date.awarded2024
dc.format.dimensions
dc.format.accompanyingmaterialNone
dc.source.universityUniversity
dc.type.degreePh.D.
Appears in Departments:Department of Computational Biology

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File73.86 kBAdobe PDFView/Open
02_prelim pages.pdf217.58 kBAdobe PDFView/Open
03_content.pdf126.01 kBAdobe PDFView/Open
04_abstract.pdf137.36 kBAdobe PDFView/Open
05_chapter 1.pdf466.57 kBAdobe PDFView/Open
06_chapter 2.pdf688.85 kBAdobe PDFView/Open
07_chapter 3.pdf3.06 MBAdobe PDFView/Open
08_chapter 4.pdf1.62 MBAdobe PDFView/Open
09_chapter 5.pdf2.04 MBAdobe PDFView/Open
10_annexures.pdf463.18 kBAdobe PDFView/Open
11_chapter 6.pdf684.29 kBAdobe PDFView/Open
80_recommendation.pdf172.97 kBAdobe PDFView/Open


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: