Publication: A comparison of scoring algorithms for the NIH Toolbox executive function tasks in a US norming sample
dc.contributor.coauthor | Shono, Yusuke | |
dc.contributor.coauthor | Ece, Berivan | |
dc.contributor.coauthor | Ho, Emily H. | |
dc.contributor.coauthor | Kaat, Aaron J. | |
dc.contributor.coauthor | La Forte, Erica M. | |
dc.contributor.coauthor | Gershon, Richard | |
dc.contributor.department | Department of Psychology | |
dc.contributor.kuauthor | Aytürk, Ezgi | |
dc.contributor.schoolcollegeinstitute | College of Social Sciences and Humanities | |
dc.date.accessioned | 2025-03-06T21:00:22Z | |
dc.date.issued | 2024 | |
dc.description.abstract | Executive function (EF) has been extensively linked to various behavioral, clinical, and educational outcomes. There have been, however, few systematic investigations into how best to score EF tasks using speed and accuracy performance, particularly how to generate a summary and norm-referenced score. Using data from an updated norming study for the NIH Toolbox Version 3 (NIHTB V3) with the general U.S. population aged between 3 and 85 (N = 3,794;52.3% female;M-age = 25.06, SDage = 22.92), we empirically evaluated and compared several scoring algorithms for two EF tests: The Dimensional Change Card Sort (a test of cognitive flexibility) and Flanker (a test of inhibitory control) Tests. Results showed that joint scoring algorithms integrating speed and accuracy into single scores (namely, rate-correct score, linear integrated speed-accuracy score, and speed-accuracy additive score) provided more robust psychometric evidence for the EF tests than single-index scores of accuracy and speed. These integrated speed-accuracy scores were consistent and stable within and across tasks and time;similar to that of another well-validated EF measure, but as predicted, not related to a crystallized intelligence measure score;and increased rapidly from early childhood through late adolescence/early adulthood and then declined toward late adulthood. The rate-correct score was particularly free from ceiling effects and sensitive to age-related changes and variability in EF performance. Among various scoring algorithms, we recommend rate-correct score, which served as the basis for generating new NIHTB V3 norm-referenced scores, with good test-retest reliability (Dimensional Change Card Sort = .77, Flanker = .81) and acceptable convergent and discriminant validity. | |
dc.description.indexedby | WOS | |
dc.description.indexedby | Scopus | |
dc.description.indexedby | PubMed | |
dc.description.publisherscope | International | |
dc.description.sponsoredbyTubitakEu | N/A | |
dc.description.sponsorship | This work was supported by National Institutes of Health Office of the Director (U24OD023319, principal investigators: Richard Gershon and David Cella), National Institute on Aging (U2CAG057441, principal investigators: Richard Gershon and Sandra Weintraub and U2CAG060426, principal investigators: Richard Gershon, Aaron J. Kaat, Lara Mangravite, May Dorene, and Michael Weiner), and National Institute of Neurological Disorders and Stroke (NIHDS;UG3NS105562, principal investigators: Richard Gershon and Michael Wolf). The authors thank Hubert Adam for his work on the original data curation. | |
dc.identifier.doi | 10.1037/pas0001350 | |
dc.identifier.eissn | 1939-134X | |
dc.identifier.grantno | National Institutes of Health Office of the Director [U24OD023319];National Institute on Aging [U2CAG057441, U2CAG060426];National Institute of Neurological Disorders and Stroke (NIHDS) [UG3NS105562] | |
dc.identifier.issn | 1040-3590 | |
dc.identifier.issue | 12 | |
dc.identifier.quartile | Q1 | |
dc.identifier.scopus | 2-s2.0-85212594139 | |
dc.identifier.uri | https://doi.org/10.1037/pas0001350 | |
dc.identifier.uri | https://hdl.handle.net/20.500.14288/27861 | |
dc.identifier.volume | 36 | |
dc.identifier.wos | 1376961500001 | |
dc.keywords | NIH Toolbox | |
dc.keywords | Executive function | |
dc.keywords | Speed-accuracy tradeoff | |
dc.keywords | Psychometrics | |
dc.keywords | Norming | |
dc.language.iso | eng | |
dc.publisher | American Psychological Association | |
dc.relation.ispartof | Psychological Assessment | |
dc.subject | Psychology, clinical | |
dc.title | A comparison of scoring algorithms for the NIH Toolbox executive function tasks in a US norming sample | |
dc.type | Journal Article | |
dspace.entity.type | Publication | |
local.contributor.kuauthor | Aytürk, Ezgi | |
local.publication.orgunit1 | College of Social Sciences and Humanities | |
local.publication.orgunit2 | Department of Psychology | |
relation.isOrgUnitOfPublication | d5fc0361-3a0a-4b96-bf2e-5cd6b2b0b08c | |
relation.isOrgUnitOfPublication.latestForDiscovery | d5fc0361-3a0a-4b96-bf2e-5cd6b2b0b08c | |
relation.isParentOrgUnitOfPublication | 3f7621e3-0d26-42c2-af64-58a329522794 | |
relation.isParentOrgUnitOfPublication.latestForDiscovery | 3f7621e3-0d26-42c2-af64-58a329522794 |