Publication: Precise event sampling on AMD versus intel: quantitative and qualitative comparison
dc.contributor.coauthor | Chabbi, Milind | |
dc.contributor.coauthor | Kelly, Paul H. J. | |
dc.contributor.department | Department of Computer Engineering | |
dc.contributor.kuauthor | Sasongko, Muhammad Aditya | |
dc.contributor.kuauthor | Erten, Didem Unat | |
dc.contributor.schoolcollegeinstitute | College of Engineering | |
dc.date.accessioned | 2025-01-19T10:34:09Z | |
dc.date.issued | 2023 | |
dc.description.abstract | Precise event sampling is a profiling feature in commodity processors that can sample hardware events and accurately locate the instructions that trigger the events. This feature has been used in a large number of tools to detect application performance issues. Although precise event sampling is readily supported in modern multicore architectures, vendor supports exhibit great differences that affect their accuracy, stability, overhead, and functionality. This work presents the most comprehensive study to date on benchmarking the event sampling features of Intel PEBS and AMD IBS and performs in-depth analysis on key differences through series of microbenchmarks. Our qualitative and quantitative analysis shows that PEBS allows finer-grained and more accurate sampling of hardware events, while IBS offers richer set of information at each sample though it suffers from lower accuracy and stability. Moreover, OS signal delivery, which is a common method used by the profiling software, introduces significant time overhead to the original overhead incurred by the hardware mechanisms in both PEBS and IBS. We also found that both PEBS and IBS have bias in sampling events across multiple different locations in a code. Lastly, we demonstrate how our findings on microbenchmarks under different thread counts hold for a full-fledged profiling tool that runs on the state-of-the-art Intel and AMD machines. Overall our detailed comparisons serve as a great reference and provide invaluable information for hardware designers and profiling tool developers. | |
dc.description.indexedby | WOS | |
dc.description.indexedby | Scopus | |
dc.description.issue | 5 | |
dc.description.openaccess | Green Published, hybrid | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | N/A | |
dc.description.sponsorship | Manuscript received 24 June 2022; revised 17 February 2023; accepted 9 March 2023. Date of publication 14 March 2023; date of current version 24 March 2023. This work was supported in part by the European Research Council (ERC) under the European Union's Horizon 2020 research and innovation programme under Grant 949587 and in part by the Scientific and Technological Research Council of Turkey (TUBITAK), under Grant 120E492. The work of Didem Unat was supported in part by the Royal Society-Newton Advanced Fellowship. Recommended for acceptance by T. Estrada. (Corresponding author: Muhammad Aditya Sasongko.) | |
dc.description.volume | 34 | |
dc.identifier.doi | 10.1109/TPDS.2023.3257105 | |
dc.identifier.eissn | 1558-2183 | |
dc.identifier.issn | 1045-9219 | |
dc.identifier.quartile | Q1 | |
dc.identifier.scopus | 2-s2.0-85151425065 | |
dc.identifier.uri | https://doi.org/10.1109/TPDS.2023.3257105 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/26745 | |
dc.identifier.wos | 958579600003 | |
dc.keywords | Registers | |
dc.keywords | Hardware | |
dc.keywords | Monitoring | |
dc.keywords | Phasor measurement units | |
dc.keywords | Instruction sets | |
dc.keywords | Benchmark testing | |
dc.keywords | Software | |
dc.keywords | Precise event sampling | |
dc.keywords | PMUs | |
dc.keywords | Profiling | |
dc.language.iso | eng | |
dc.publisher | IEEE Computer Soc | |
dc.relation.grantno | European Research Council (ERC) - uropean Union's Horizon 2020 research and innovation programme [949587]; Scientific and Technological Research Council of Turkey (TUBITAK) [120E492]; Royal Society-Newton Advanced Fellowship; European Research Council (ERC) [949587] Funding Source: European Research Council (ERC) | |
dc.relation.ispartof | IEEE Transactions on Parallel and Distributed Systems | |
dc.subject | Computer Science | |
dc.title | Precise event sampling on AMD versus intel: quantitative and qualitative comparison | |
dc.type | Journal Article | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Sasongko, Muhammad Aditya | |
local.contributor.kuauthor | Erten, Didem Unat | |
local.publication.orgunit1 | College of Engineering | |
local.publication.orgunit2 | Department of Computer Engineering | |
relation.isOrgUnitOfPublication | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isOrgUnitOfPublication.latestForDiscovery | 89352e43-bf09-4ef4-82f6-6f9d0174ebae | |
relation.isParentOrgUnitOfPublication | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 | |
relation.isParentOrgUnitOfPublication.latestForDiscovery | 8e756b23-2d4a-4ce8-b1b3-62c794a8c164 |
Files
Original bundle
1 - 1 of 1