Interpretable cancer stage classification using sparse bayesian neural networks

Publication:
Interpretable cancer stage classification using sparse bayesian neural networks

dc.contributor.advisor	Gönen, Mehmet
dc.contributor.department	Graduate School of Sciences and Engineering
dc.contributor.kuauthor	Yurdakul, Hazal Hasret
dc.contributor.program	Computer Sciences and Engineering
dc.contributor.schoolcollegeinstitute	GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.coverage.spatial	İstanbul
dc.date.accessioned	2025-06-30T04:36:18Z
dc.date.available	2025-03-25
dc.date.issued	2024
dc.description.abstract	Cancer requires an in-depth exploration of its molecular characteristics. For targeted treatment strategies, distinguishing cancer stages is essential. This thesis has focused on the differentiation of early- and late-stage cancers using gene expression profiles. With the integration of computational techniques into medical research, machine learning models excel in this task, offering insights into biological mechanisms. To harness these insights, we proposed a novel approach, which is Bayesian Neural Networks (BNNs) with sparsity-inducing priors. The proposed sparse BNNs are designed to deliver high predictive performance in identifying cancer stages while maintaining a high level of interpretability. To evaluate our sparse BNN models, we benchmarked them against three machine learning algorithms across 15 different cancer cohorts. The results of our study revealed that our sparse BNN models achieve predictive performances comparable to traditional benchmark models. Additionally, we addressed the black-box issue of neural networks in medicine, which obscures which input features are crucial for predictions, a serious issue in decision-making with significant implications. To address this issue, our primary contribution has been the development of a novel BNN architecture that considerably enhances data interpretability. In our approach, we have integrated three types of sparsity inducing priors, namely, Laplace, Student's t, and Spike-and-Slab. Each prior has a mean of zero and low variance, promoting a reduction in connections and thus enabling a focused feature selection process. This methodology allows us to identify and concentrate on the most influential gene expressions. Our analysis revealed that sparse BNNs show a distinct preference for specific gene sets. In conclusion, the development of sparse BNNs offers a biologically informative and interpretative tool, enhancing the field of cancer research by shedding light on key gene pathways and significantly improving the process of cancer staging.
dc.description.abstract	Kanser moleküler özelliklerinin derinlemesine incelenmesini gerektirir. Hedeflenen tedavi stratejileri için kanser evrelerinin ayırt edilmesi esastır. Bu tez, gen ifade profillerini kullanarak erken ve geç evre kanserlerin ayrımına odaklanmıştır. Hesaplama tekniklerinin tıbbi araştırmalarla bütünleşmesiyle, makine öğrenimi modelleri bu görevde üstün başarı göstermektedir ve biyolojik mekanizmalara dair içgörüler sunmaktadır. Bu içgörüleri kullanabilmek için, seyreklik teşvik eden öncüllerle Bayesian Sinir Ağları (BNN) adında yeni bir yaklaşım öneriyoruz. Önerilen seyrek BNN'ler, kanser evrelerini belirlemede yüksek tahmin performansı sunarken yüksek düzeyde yorumlanabilirlik sağlamak için tasarlanmıştır. Seyrek BNN modellerimizi değerlendirmek için, onları 15 farklı kanser kohortunda üç makine öğrenimi algoritması ile karşılaştırdık. Çalışmamızın sonuçları, seyrek BNN modellerimizin geleneksel referans modellerle karşılaştırılabilir tahmin performanslarına ulaştığını göstermektedir. Ayrıca, sinir ağlarının tıpta kara kutu sorununu ele alıyoruz. Bu sorun, hangi girdi özelliklerinin tahminler için kritik olduğunu belirsiz bırakır ve önemli sonuçları olan karar verme sürecinde ciddi bir sorundur. Bu sorunu ele almak için, ana katkımız veri yorumlanabilirliğini önemli ölçüde artıran yeni bir BNN mimarisi geliştirmek olmuştur. Yaklaşımımızda, üç tür seyreklik teşvik eden öncülleri entegre ettik: Laplace, Student t ve Sivri ve Taban. Her bir öncülün sıfır ortalaması ve düşük varyansı vardır, bu da bağlantıların azaltılmasını teşvik eder ve böylece odaklanmış bir özellik seçimi sürecini mümkün kılar. Bu metodoloji, en etkili gen ifadelerini belirlememize ve üzerine odaklanmamıza olanak tanır. Analizimiz, seyrek BNN'lerin belirli gen setlerine belirgin bir tercih gösterdiğini ortaya koymaktadır. Sonuç olarak, seyrek BNN'lerin geliştirilmesi, kanser araştırmaları alanını anahtar gen yollarını aydınlatarak ve kanser evreleme sürecini önemli ölçüde iyileştirerek biyolojik olarak bilgilendirici ve yorumlanabilir bir araç sunar.
dc.description.fulltext	Yes
dc.format.extent	xiii, 65 leaves : illustrations ; 30 cm.
dc.identifier.embargo	No
dc.identifier.endpage	78
dc.identifier.filenameinventoryno	T_2024_017_GSSE
dc.identifier.uri	https://hdl.handle.net/20.500.14288/29812
dc.identifier.yoktezid	851075
dc.identifier.yoktezlink	https://tez.yok.gov.tr/UlusalTezMerkezi/TezGoster?key=cr4SkWLaRMhkDRBjqthpsXBQxDPbcxKL_Jp5xlSpl3vYX0Cw4iBsw_1APqWL43K0
dc.language.iso	eng
dc.publisher	Koç University
dc.relation.collection	KU Theses and Dissertations
dc.rights	restrictedAccess
dc.rights.copyrightsnote	© All Rights Reserved. Accessible to Koç University Affiliated Users Only!
dc.subject	Machine learning
dc.subject	Biology, Research
dc.subject	Medicine, Research
dc.subject	Computational intelligence
dc.title	Interpretable cancer stage classification using sparse bayesian neural networks
dc.title.alternative	Seyrek bayesian sinir ağları ile yorumlanabilir kanser evre sınıflandırması.
dc.type	Thesis
dspace.entity.type	Publication
local.contributor.kuauthor	Yurdakul, Hazal Hasret

Collections

Theses & Dissertations

Publication: Interpretable cancer stage classification using sparse bayesian neural networks

Files

Collections

Publication:
Interpretable cancer stage classification using sparse bayesian neural networks