Publication:
Discriminating early- and late-stage cancers using multiple kernel learning on gene sets

dc.conference.dateJUL 06-10, 2018
dc.conference.locationChicago, IL
dc.conference.organizer26th Annual Conference on Intelligent Systems for Molecular Biology (ISMB)
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.departmentDepartment of Industrial Engineering
dc.contributor.departmentSchool of Medicine
dc.contributor.facultymemberYes
dc.contributor.kuauthorRahimi, Arezou
dc.contributor.kuauthorGönen, Mehmet
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.contributor.schoolcollegeinstituteSCHOOL OF MEDICINE
dc.date.accessioned2024-11-10T00:07:16Z
dc.date.issued2018
dc.description.abstractMotivation: Identifying molecular mechanisms that drive cancers from early to late stages is highly important to develop new preventive and therapeutic strategies. Standard machine learning algorithms could be used to discriminate early-and late-stage cancers from each other using their genomic characterizations. Even though these algorithms would get satisfactory predictive performance, their knowledge extraction capability would be quite restricted due to highly correlated nature of genomic data. That is why we need algorithms that can also extract relevant information about these biological mechanisms using our prior knowledge about pathways/gene sets. Results: In this study, we addressed the problem of separating early- and late-stage cancers from each other using their gene expression profiles. We proposed to use a multiple kernel learning (MKL) formulation that makes use of pathways/gene sets (i) to obtain satisfactory/improved predictive performance and (ii) to identify biological mechanisms that might have an effect in cancer progression. We extensively compared our proposed MKL on gene sets algorithm against two standard machine learning algorithms, namely, random forests and support vector machines, on 20 diseases from the Cancer Genome Atlas cohorts for two different sets of experiments. Our method obtained statistically significantly better or comparable predictive performance on most of the datasets using significantly fewer gene expression features. We also showed that our algorithm was able to extract meaningful and disease-specific information that gives clues about the progression mechanism.
dc.description.fulltextNo
dc.description.harvestedfromManual
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.indexedbyPubMed
dc.description.openaccessYES
dc.description.peerreviewstatusN/A
dc.description.publisherscopeInternational
dc.description.readpublishN/A
dc.description.sponsoredbyTubitakEuTÜBİTAK
dc.description.sponsorshipScientific and Technological Research Council of Turkey (TUBITAK) [EEEAG 117E181]
dc.description.sponsorshipTurkish Academy of Sciences (TUBA-GEB_IP; The Young Scientist Award Program)
dc.description.sponsorshipScience Academy of Turkey (BAGEP; The Young Scientist Award Program)
dc.description.studentonlypublicationNo
dc.description.studentpublicationYes
dc.description.versionN/A
dc.identifier.doi10.1093/bioinformatics/bty239
dc.identifier.eissn1460-2059
dc.identifier.embargoN/A
dc.identifier.endpage421
dc.identifier.grantno117E181
dc.identifier.issn1367-4803
dc.identifier.pubmed29949993
dc.identifier.quartileQ1
dc.identifier.scopus2-s2.0-85050821994
dc.identifier.startpage412
dc.identifier.urihttps://doi.org/10.1093/bioinformatics/bty239
dc.identifier.urihttps://hdl.handle.net/20.500.14288/16757
dc.identifier.wos000438247800047
dc.keywordsBreast-cancer
dc.language.isoeng
dc.publisherOxford University Press
dc.relation.affiliationKoç University
dc.relation.collectionKoç University Institutional Repository
dc.relation.ispartofBioinformatics
dc.relation.openaccessN/A
dc.rightsN/A
dc.subjectBiochemical research methods
dc.subjectBiotechnology
dc.subjectApplied microbiology
dc.subjectComputer science
dc.subjectMathematical and computational biology
dc.subjectStatistics
dc.subjectProbability
dc.titleDiscriminating early- and late-stage cancers using multiple kernel learning on gene sets
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorRahimi, Arezou
local.contributor.kuauthorGönen, Mehmet
relation.isGoalOfPublicationa9786601-9431-4553-9a46-013bb366fb87
relation.isGoalOfPublication.latestForDiscoverya9786601-9431-4553-9a46-013bb366fb87
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublicationd6d00f52-d22d-4653-99e7-863efcd47b4a
relation.isOrgUnitOfPublicationd02929e1-2a70-44f0-ae17-7819f587bedd
relation.isOrgUnitOfPublication.latestForDiscoveryd6d00f52-d22d-4653-99e7-863efcd47b4a
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublication17f2dc8e-6e54-4fa8-b5e0-d6415123a93e
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files