Publication: Discriminating early- and late-stage cancers using multiple kernel learning on gene sets
dc.contributor.coauthor | N/A | |
dc.contributor.department | N/A | |
dc.contributor.department | Department of Industrial Engineering | |
dc.contributor.kuauthor | Rahimi, Arezou | |
dc.contributor.kuauthor | Gönen, Mehmet | |
dc.contributor.kuprofile | PhD Student | |
dc.contributor.kuprofile | Faculty Member | |
dc.contributor.other | Department of Industrial Engineering | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.contributor.yokid | N/A | |
dc.contributor.yokid | 237468 | |
dc.date.accessioned | 2024-11-10T00:07:16Z | |
dc.date.issued | 2018 | |
dc.description.abstract | Motivation: Identifying molecular mechanisms that drive cancers from early to late stages is highly important to develop new preventive and therapeutic strategies. Standard machine learning algorithms could be used to discriminate early-and late-stage cancers from each other using their genomic characterizations. Even though these algorithms would get satisfactory predictive performance, their knowledge extraction capability would be quite restricted due to highly correlated nature of genomic data. That is why we need algorithms that can also extract relevant information about these biological mechanisms using our prior knowledge about pathways/gene sets. Results: In this study, we addressed the problem of separating early- and late-stage cancers from each other using their gene expression profiles. We proposed to use a multiple kernel learning (MKL) formulation that makes use of pathways/gene sets (i) to obtain satisfactory/improved predictive performance and (ii) to identify biological mechanisms that might have an effect in cancer progression. We extensively compared our proposed MKL on gene sets algorithm against two standard machine learning algorithms, namely, random forests and support vector machines, on 20 diseases from the Cancer Genome Atlas cohorts for two different sets of experiments. Our method obtained statistically significantly better or comparable predictive performance on most of the datasets using significantly fewer gene expression features. We also showed that our algorithm was able to extract meaningful and disease-specific information that gives clues about the progression mechanism. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.indexedby | PubMed | |
dc.description.issue | 13 | |
dc.description.openaccess | YES | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | TÜBİTAK | |
dc.description.sponsorship | Scientific and Technological Research Council of Turkey (TUBITAK) [EEEAG 117E181] | |
dc.description.sponsorship | Turkish Academy of Sciences (TUBA-GEB_IP | |
dc.description.sponsorship | The Young Scientist Award Program) | |
dc.description.sponsorship | Science Academy of Turkey (BAGEP | |
dc.description.sponsorship | The Young Scientist Award Program) This work was supported by the Scientific and Technological Research Council of Turkey (TUBITAK) under Grant EEEAG 117E181. Mehmet Gonen was supported by the Turkish Academy of Sciences (TUBA-GEB_IP | |
dc.description.sponsorship | The Young Scientist Award Program) and the Science Academy of Turkey (BAGEP | |
dc.description.sponsorship | The Young Scientist Award Program). | |
dc.description.volume | 34 | |
dc.identifier.doi | 10.1093/bioinformatics/bty239 | |
dc.identifier.eissn | 1460-2059 | |
dc.identifier.issn | 1367-4803 | |
dc.identifier.quartile | Q1 | |
dc.identifier.scopus | 2-s2.0-85050821994 | |
dc.identifier.uri | http://dx.doi.org/10.1093/bioinformatics/bty239 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/16757 | |
dc.identifier.wos | 438247800047 | |
dc.keywords | Breast-cancer | |
dc.language | English | |
dc.publisher | Oxford Univ Press | |
dc.source | Bioinformatics | |
dc.subject | Biochemical research methods | |
dc.subject | Biotechnology | |
dc.subject | Applied microbiology | |
dc.subject | Computer science | |
dc.subject | Mathematical and computational biology | |
dc.subject | Statistics | |
dc.subject | Probability | |
dc.title | Discriminating early- and late-stage cancers using multiple kernel learning on gene sets | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.authorid | N/A | |
local.contributor.authorid | 0000-0002-2483-075X | |
local.contributor.kuauthor | Rahimi, Arezou | |
local.contributor.kuauthor | Gönen, Mehmet | |
relation.isOrgUnitOfPublication | d6d00f52-d22d-4653-99e7-863efcd47b4a | |
relation.isOrgUnitOfPublication.latestForDiscovery | d6d00f52-d22d-4653-99e7-863efcd47b4a |