Publication: Correlative information maximization: a biologically plausible approach to supervised deep neural networks without weight symmetry
dc.contributor.coauthor | Pehlevan, Cengiz | |
dc.contributor.department | Department of Electrical and Electronics Engineering | |
dc.contributor.kuauthor | Bozkurt, Barışcan | |
dc.contributor.kuauthor | Erdoğan, Alper Tunga | |
dc.contributor.other | Department of Electrical and Electronics Engineering | |
dc.contributor.researchcenter | Koç Üniversitesi İş Bankası Yapay Zeka Uygulama ve Araştırma Merkezi (KUIS AI)/ Koç University İş Bank Artificial Intelligence Center (KUIS AI) | |
dc.contributor.schoolcollegeinstitute | Graduate School of Sciences and Engineering | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.date.accessioned | 2024-12-29T09:36:09Z | |
dc.date.issued | 2023 | |
dc.description.abstract | The backpropagation algorithm has experienced remarkable success in training large-scale artificial neural networks;however, its biological plausibility has been strongly criticized, and it remains an open question whether the brain employs supervised learning mechanisms akin to it. Here, we propose correlative information maximization between layer activations as an alternative normative approach to describe the signal propagation in biological neural networks in both forward and backward directions. This new framework addresses many concerns about the biological-plausibility of conventional artificial neural networks and the backpropagation algorithm. The coordinate descent-based optimization of the corresponding objective, combined with the mean square error loss function for fitting labeled supervision data, gives rise to a neural network structure that emulates a more biologically realistic network of multi-compartment pyramidal neurons with dendritic processing and lateral inhibitory neurons. Furthermore, our approach provides a natural resolution to the weight symmetry problem between forward and backward signal propagation paths, a significant critique against the plausibility of the conventional backpropagation algorithm. This is achieved by leveraging two alternative, yet equivalent forms of the correlative mutual information objective. These alternatives intrinsically lead to forward and backward prediction networks without weight symmetry issues, providing a compelling solution to this long-standing challenge. | |
dc.description.indexedby | WoS | |
dc.description.indexedby | Scopus | |
dc.description.publisherscope | International | |
dc.description.sponsors | This research was supported by KUIS AI Center Research Award. B. Bozkurt acknowledges the support by Gatsby PhD programme, which is supported by the Gatsby Charitable Foundation (GAT3850). C. Pehlevan is supported by NSF Award DMS-2134157, NSF CAREER Award IIS-2239780, and a Sloan Research Fellowship. This work has been made possible in part by a gift from the Chan Zuckerberg Initiative Foundation to establish the Kempner Institute for the Study of Natural and Artificial Intelligence. | |
dc.identifier.issn | 1049-5258 | |
dc.identifier.quartile | N/A | |
dc.identifier.scopus | 2-s2.0-85191197285 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/21968 | |
dc.identifier.wos | 1220600006015 | |
dc.keywords | Biological neural networks | |
dc.keywords | Coordinate descent | |
dc.keywords | Forward-and-backward | |
dc.keywords | Information maximization | |
dc.keywords | Large-scales | |
dc.keywords | Learning mechanism | |
dc.keywords | Means square errors | |
dc.keywords | Optimisations | |
dc.keywords | Signal propagation | |
dc.keywords | Weight symmetry | |
dc.language | en | |
dc.publisher | Neural Information Processing Systems 36 | |
dc.source | Advances in Neural Information Processing Systems 36 (NeurIPS 2023) | |
dc.subject | Computer science | |
dc.subject | Artificial intelligence | |
dc.subject | Information systems | |
dc.title | Correlative information maximization: a biologically plausible approach to supervised deep neural networks without weight symmetry | |
dc.type | Conference proceeding | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Bozkurt, Barışcan | |
local.contributor.kuauthor | Erdoğan, Alper Tunga | |
relation.isOrgUnitOfPublication | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 | |
relation.isOrgUnitOfPublication.latestForDiscovery | 21598063-a7c5-420d-91ba-0cc9b2db0ea0 |