Classification of patients with chronic disease by activation level using machine learning methods
Publication Date
Advisor
Institution Author
Sakarya, Sibel
Güneş, Evrim Didem
Co-Authors
Demiray, Onur
Kulak, Ercan
Dogan, Emrah
Karaketir, Seyma Gorcin
Cifcili, Serap
Akman, Mehmet
Journal Title
Journal ISSN
Volume Title
Publisher:
Springer
Type
Abstract
Patient Activation Measure (PAM) measures the activation level of patients with chronic conditions and correlates well with patient adherence behavior, health outcomes, and healthcare costs. PAM is increasingly used in practice to identify patients needing more support from the care team. We define PAMlevels 1 and 2 as low PAM and investigate the performance of eight machine learning methods (Logistic Regression, Lasso Regression, Ridge Regression, Random Forest, Gradient Boosted Trees, Support Vector Machines, Decision Trees, Neural Networks) to classify patients. Primary data collected from adult patients (n=431) with Diabetes Mellitus (DM) or Hypertension (HT) attending Family Health Centers in Istanbul, Turkey, is used to test the methods. 44.5% of patients in the dataset have a low PAM level. Classification performance with several feature sets was analyzed to understand the relative importance of different types of information and provide insights. The most important features are found as whether the patient performs self-monitoring, smoking and exercise habits, education, and socio-economic status. The best performance was achieved with the Logistic Regression algorithm, with Area Under the Curve (AUC)=0.72 with the best performing feature set. Alternative feature sets with similar prediction performance are also presented. The prediction performance was inferior with an automated feature selection method, supporting the importance of using domain knowledge in machine learning.
Description
Subject
Health policy, Services