Publication:
Low-overhead reuse distance profiling tool for multicore

dc.contributor.coauthorChabbi, Milind
dc.contributor.departmentDepartment of Computer Engineering
dc.contributor.kuauthorErten, Didem Unat
dc.contributor.kuauthorSasongko, Muhammad Aditya
dc.contributor.schoolcollegeinstituteCollege of Engineering
dc.date.accessioned2024-11-09T22:57:06Z
dc.date.issued2022
dc.description.abstractWith the increase in core count in multicore systems, data movement is one of the main sources of performance slowdown in parallel applications and data locality has become a critical factor in application optimization. One of the important locality metrics is reuse distance, which shows the likelihood of a memory access to be a cache hit. In this work, we propose a low-overhead reuse distance profiling tool for multi-threaded applications. Our method relies on available hardware features in commodity CPUs, namely, Performance Monitoring Units (PMUs) and debug registers, to detect data reuse in private and shared caches by considering inter-thread cache line invalidations. Unlike prior approaches, our tool is fast, accurate, does not change the program behavior and can also handle shared cache accesses. Though it has low runtime (2.9x) and memory overheads (2.8x), our tool achieves 92% accuracy.
dc.description.indexedbyWOS
dc.description.indexedbyScopus
dc.description.openaccessNO
dc.description.publisherscopeInternational
dc.description.sponsoredbyTubitakEuTÜBİTAK
dc.description.sponsorshipScientific and Technological Research Council of Turkey (TUBITAK) [120E492]
dc.description.sponsorshipRoyal Society-Newton Advanced Fellowship The work is supported by the Scientific and Technological Research Council of Turkey (TUBITAK), Grant no. 120E492. Dr. Didem Unat is supported by the Royal Society-Newton Advanced Fellowship.
dc.description.volume13098
dc.identifier.doi10.1007/978-3-031-06156-1_49
dc.identifier.eissn1611-3349
dc.identifier.isbn978-3-031-06156-1
dc.identifier.isbn978-3-031-06155-4
dc.identifier.issn0302-9743
dc.identifier.scopus2-s2.0-85132978136
dc.identifier.urihttps://doi.org/10.1007/978-3-031-06156-1_49
dc.identifier.urihttps://hdl.handle.net/20.500.14288/7481
dc.identifier.wos851509300049
dc.keywordsReuse distance
dc.keywordsHardware performance counters
dc.keywordsDebug registers
dc.keywordsAddress sampling
dc.language.isoeng
dc.publisherSpringer International Publishing Ag
dc.relation.ispartofEuro-Par 2021: Parallel Processing Workshops
dc.subjectComputer science
dc.titleLow-overhead reuse distance profiling tool for multicore
dc.typeConference Proceeding
dspace.entity.typePublication
local.contributor.kuauthorSasongko, Muhammad Aditya
local.contributor.kuauthorErten, Didem Unat
local.publication.orgunit1College of Engineering
local.publication.orgunit2Department of Computer Engineering
relation.isOrgUnitOfPublication89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isOrgUnitOfPublication.latestForDiscovery89352e43-bf09-4ef4-82f6-6f9d0174ebae
relation.isParentOrgUnitOfPublication8e756b23-2d4a-4ce8-b1b3-62c794a8c164
relation.isParentOrgUnitOfPublication.latestForDiscovery8e756b23-2d4a-4ce8-b1b3-62c794a8c164

Files