Publication:
Optimal nonlinearities improve generalization performance of random features

dc.contributor.departmentDepartment of Electrical and Electronics Engineering
dc.contributor.departmentGraduate School of Sciences and Engineering
dc.contributor.departmentKUIS AI (Koç University & İş Bank Artificial Intelligence Center)
dc.contributor.kuauthorDemir, Samet
dc.contributor.kuauthorDoğan, Zafer
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.contributor.schoolcollegeinstituteGRADUATE SCHOOL OF SCIENCES AND ENGINEERING
dc.contributor.schoolcollegeinstituteResearch Center
dc.date.accessioned2024-12-29T09:36:26Z
dc.date.issued2023
dc.description.abstractRandom feature model with a nonlinear activation function has been shown to perform asymptotically equivalent to a Gaussian model in terms of training and generalization errors. Analysis of the equivalent model reveals an important yet not fully understood role played by the activation function. To address this issue, we study the "parameters" of the equivalent model to achieve improved generalization performance for a given supervised learning problem. We show that acquired parameters from the Gaussian model enable us to define a set of optimal nonlinearities. We provide two example classes from this set, e.g., second-order polynomial and piecewise linear functions. These functions are optimized to improve generalization performance regardless of the actual form. We experiment with regression and classification problems, including synthetic and real (e.g., CIFAR10) data. Our numerical results validate that the optimized nonlinearities achieve better generalization performance than widely-used nonlinear functions such as ReLU. Furthermore, we illustrate that the proposed nonlinearities also mitigate the so-called double descent phenomenon, which is known as the non-monotonic generalization performance regarding the sample size and the model size.
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuTÜBİTAK
dc.description.sponsorshipWe acknowledge that this work was supported in part by TUBITAK 2232 International Fellowship for Outstanding Researchers Award (No. 118C337) and an AI Fellowship provided by Koc University &. Is Bank Artificial Intelligence (KUIS AI) Research Center.
dc.description.volume222
dc.identifier.issn2640-3498
dc.identifier.quartileN/A
dc.identifier.scopus2-s2.0-85189637296
dc.identifier.urihttps://hdl.handle.net/20.500.14288/22063
dc.identifier.wos1221095300017
dc.keywordsRandom feature model
dc.keywordsGeneralization performance
dc.keywordsActivation functions
dc.keywordsGaussian equivalence conjecture
dc.keywordsUniversality
dc.keywordsDouble descent phenomenon
dc.language.isoeng
dc.publisherJMLR-Jornal Machine Learning Research
dc.relation.ispartofAsian Conference on Machine Learning Vol 222
dc.subjectComputer science, artificial intelligence
dc.subjectComputer science, theory and methods
dc.subjectStatistics and probability
dc.titleOptimal nonlinearities improve generalization performance of random features
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorDemir, Samet
local.contributor.kuauthorDoğan, Zafer
local.publication.orgunit1College of Engineering
local.publication.orgunit1GRADUATE SCHOOL OF SCIENCES AND ENGINEERING
local.publication.orgunit1Research Center
local.publication.orgunit2Department of Electrical and Electronics Engineering
local.publication.orgunit2KUIS AI (Koç University & İş Bank Artificial Intelligence Center)
local.publication.orgunit2Graduate School of Sciences and Engineering
relation.isOrgUnitOfPublication21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isOrgUnitOfPublication3fc31c89-e803-4eb1-af6b-6258bc42c3d8
relation.isOrgUnitOfPublication77d67233-829b-4c3a-a28f-bd97ab5c12c7
relation.isOrgUnitOfPublication.latestForDiscovery21598063-a7c5-420d-91ba-0cc9b2db0ea0
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication434c9663-2b11-4e66-9399-c863e2ebae43
relation.isParentOrgUnitOfPublicationd437580f-9309-4ecb-864a-4af58309d287
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files

Original bundle

Now showing 1 - 1 of 1
Thumbnail Image
Name:
IR04696.pdf
Size:
1.67 MB
Format:
Adobe Portable Document Format