Please use this identifier to cite or link to this item: http://hdl.handle.net/10603/595259
Title: A model for churn prediction based on qualitative support interaction features for hotel technology provider
Researcher: De, Soumi
Guide(s): Prabu, P
Keywords: Active Learning,
Churn Prediction,
Computer Science
Computer Science Artificial Intelligence
Engineering and Technology
Topic Classification,
Uncertainty Sampling.
University: CHRIST University
Completed Date: 2024
Abstract: Customer retention is a significant driver of a company s growth. Machine learning has gained immense popularity as a means to predict customers at risk of churn. Churn prediction models are capable of highlighting customers who are at high risk of churn well in advance. A popular approach to improve the performance of churn prediction models is by using input variables that are mainly quantitative and structured in nature. There are limited works in literature that newlineinvestigate smart means to effectively utilize and integrate unstructured data into churn prediction models, and study the impact on model efficacy. One of the roadblocks to effectively utilize unstructured data is the associated cost of annotation which is both time consuming and requires intensive manual effort. To overcome this obstacle, researchers often adopt a semi-supervised newlineapproach called active learning that aims to achieve state-of-the-art performance using minimal number of samples. Although active learning boosts classifier performance, the underlying query strategies are unable to eliminate redundancy in selected samples for manual annotation. Redundant samples lead to increased cost and sub-optimal performance of learner. Inspired by this challenge, the study proposes a new representation-based query strategy that selects highly newlineinformative and representative subsets of samples for manual annotation. Data comprises newlinemessages of a set of customers sent to a service provider. Series of experiments are conducted to analyse the effectiveness of the proposed query strategy, called Entropy-based Min Max Similarity (E-MMSIM), in the context of topic classification for churn prediction. The foundation of E-MMSIM is an algorithm that is popularly used to sequence proteins in protein databases. The algorithm is modified and utilized to select the most representative and informative samples. The performance is evaluated using F1-score, AUC and accuracy.
Pagination: x, 89p.;
URI: http://hdl.handle.net/10603/595259
Appears in Departments:Department of Data Science

Files in This Item:
File Description SizeFormat 
01_title.pdfAttached File166.39 kBAdobe PDFView/Open
02_prelim pages.pdf920.33 kBAdobe PDFView/Open
03_abstract.pdf127.74 kBAdobe PDFView/Open
04_table_of_contents.pdf129.31 kBAdobe PDFView/Open
05_introduction_1.pdf172.41 kBAdobe PDFView/Open
06_literature_review_2.pdf406.25 kBAdobe PDFView/Open
07_research_methodology_3.pdf453.99 kBAdobe PDFView/Open
08_model_for_churn_prediction_4.pdf426.97 kBAdobe PDFView/Open
09_results_and_discussion_5.pdf2 MBAdobe PDFView/Open
10_conclusion_6.pdf152.55 kBAdobe PDFView/Open
11_annexures.pdf202.56 kBAdobe PDFView/Open
80_recommendation.pdf314.99 kBAdobe PDFView/Open
Show full item record


Items in Shodhganga are licensed under Creative Commons Licence Attribution-NonCommercial-ShareAlike 4.0 International (CC BY-NC-SA 4.0).

Altmetric Badge: