A comparison of scoring algorithms for the NIH Toolbox executive function tasks in a US norming sample

Publication:
A comparison of scoring algorithms for the NIH Toolbox executive function tasks in a US norming sample

dc.contributor.coauthor	Shono, Yusuke
dc.contributor.coauthor	Ece, Berivan
dc.contributor.coauthor	Ho, Emily H.
dc.contributor.coauthor	Kaat, Aaron J.
dc.contributor.coauthor	La Forte, Erica M.
dc.contributor.coauthor	Gershon, Richard
dc.contributor.department	Department of Psychology
dc.contributor.kuauthor	Aytürk, Ezgi
dc.contributor.schoolcollegeinstitute	College of Social Sciences and Humanities
dc.date.accessioned	2025-03-06T21:00:22Z
dc.date.issued	2024
dc.description.abstract	Executive function (EF) has been extensively linked to various behavioral, clinical, and educational outcomes. There have been, however, few systematic investigations into how best to score EF tasks using speed and accuracy performance, particularly how to generate a summary and norm-referenced score. Using data from an updated norming study for the NIH Toolbox Version 3 (NIHTB V3) with the general U.S. population aged between 3 and 85 (N = 3,794;52.3% female;M-age = 25.06, SDage = 22.92), we empirically evaluated and compared several scoring algorithms for two EF tests: The Dimensional Change Card Sort (a test of cognitive flexibility) and Flanker (a test of inhibitory control) Tests. Results showed that joint scoring algorithms integrating speed and accuracy into single scores (namely, rate-correct score, linear integrated speed-accuracy score, and speed-accuracy additive score) provided more robust psychometric evidence for the EF tests than single-index scores of accuracy and speed. These integrated speed-accuracy scores were consistent and stable within and across tasks and time;similar to that of another well-validated EF measure, but as predicted, not related to a crystallized intelligence measure score;and increased rapidly from early childhood through late adolescence/early adulthood and then declined toward late adulthood. The rate-correct score was particularly free from ceiling effects and sensitive to age-related changes and variability in EF performance. Among various scoring algorithms, we recommend rate-correct score, which served as the basis for generating new NIHTB V3 norm-referenced scores, with good test-retest reliability (Dimensional Change Card Sort = .77, Flanker = .81) and acceptable convergent and discriminant validity.
dc.description.indexedby	WOS
dc.description.indexedby	Scopus
dc.description.indexedby	PubMed
dc.description.publisherscope	International
dc.description.sponsoredbyTubitakEu	N/A
dc.description.sponsorship	This work was supported by National Institutes of Health Office of the Director (U24OD023319, principal investigators: Richard Gershon and David Cella), National Institute on Aging (U2CAG057441, principal investigators: Richard Gershon and Sandra Weintraub and U2CAG060426, principal investigators: Richard Gershon, Aaron J. Kaat, Lara Mangravite, May Dorene, and Michael Weiner), and National Institute of Neurological Disorders and Stroke (NIHDS;UG3NS105562, principal investigators: Richard Gershon and Michael Wolf). The authors thank Hubert Adam for his work on the original data curation.
dc.identifier.doi	10.1037/pas0001350
dc.identifier.eissn	1939-134X
dc.identifier.grantno	National Institutes of Health Office of the Director [U24OD023319];National Institute on Aging [U2CAG057441, U2CAG060426];National Institute of Neurological Disorders and Stroke (NIHDS) [UG3NS105562]
dc.identifier.issn	1040-3590
dc.identifier.issue	12
dc.identifier.quartile	Q1
dc.identifier.scopus	2-s2.0-85212594139
dc.identifier.uri	https://doi.org/10.1037/pas0001350
dc.identifier.uri	https://hdl.handle.net/20.500.14288/27861
dc.identifier.volume	36
dc.identifier.wos	1376961500001
dc.keywords	NIH Toolbox
dc.keywords	Executive function
dc.keywords	Speed-accuracy tradeoff
dc.keywords	Psychometrics
dc.keywords	Norming
dc.language.iso	eng
dc.publisher	American Psychological Association
dc.relation.ispartof	Psychological Assessment
dc.subject	Psychology, clinical
dc.title	A comparison of scoring algorithms for the NIH Toolbox executive function tasks in a US norming sample
dc.type	Journal Article
dspace.entity.type	Publication
local.contributor.kuauthor	Aytürk, Ezgi
local.publication.orgunit1	College of Social Sciences and Humanities
local.publication.orgunit2	Department of Psychology
relation.isOrgUnitOfPublication	d5fc0361-3a0a-4b96-bf2e-5cd6b2b0b08c
relation.isOrgUnitOfPublication.latestForDiscovery	d5fc0361-3a0a-4b96-bf2e-5cd6b2b0b08c
relation.isParentOrgUnitOfPublication	3f7621e3-0d26-42c2-af64-58a329522794
relation.isParentOrgUnitOfPublication.latestForDiscovery	3f7621e3-0d26-42c2-af64-58a329522794

Collections

Publications without Fulltext

Publication: A comparison of scoring algorithms for the NIH Toolbox executive function tasks in a US norming sample

Files

Collections

Publication:
A comparison of scoring algorithms for the NIH Toolbox executive function tasks in a US norming sample