Evaluating the reliability of functional near-infrared spectroscopy data in the context of a reasoning paradigm

doi:10.64898/2026.01.16.699971

Evaluating the reliability of functional near-infrared spectroscopy data in the context of a reasoning paradigm

2026 · doi:10.64898/2026.01.16.699971

preprint OA: closed

Full text JSON View at publisher

Full text 2,353 characters · extracted from oa-doi-fallback · click to expand

Abstract Functional near-infrared spectroscopy (fNIRS) is a portable, motion-tolerant neuroimaging method particularly well suited for developmental and naturalistic research. To evaluate the utility of fNIRS for studying individual differences and longitudinal changes, we measured activation and functional connectivity during a relational reasoning task in young adults (N = 73). We sought to (1) establish whether fNIRS captures frontoparietal activation patterns consistent with prior fMRI studies using similar paradigms, (2) assess the effect of the amount of data (number of task blocks) on signal strength and precision, (3) assess the paradigm’s measurement properties in the form of intra- and interindividual stability of activation and functional connectivity within and across testing sessions, and (4) examine whether grouping channels into anatomical regions of interest (ROIs) conferred benefits to the above. We observed robust task-evoked activation across lateral prefrontal and parietal cortices, with effect sizes on par with prior fMRI studies. Generally, we observed diminishing returns in effect size and measurement precision beyond ∼7 minutes. Internal consistency and test-retest reliability varied across metrics; while they were very low for a specific task contrast, they were extremely high for functional connectivity, confirming the robustness of channel- and ROI-level connectivity as a stable marker of functional architecture. Exploratory analyses supported prior observations of lower signal quality in participants with darker skin tones and hair, underscoring the need for inclusive methodological strategies. Together, these findings highlight key design considerations for optimizing longitudinal and individual-differences research on higher-level cognition, particularly in diverse and developmentally variable populations. Highlights We measured within- and between-session reliability of fNIRS metrics Collecting more data yielded diminishing returns in effect size and precision There were tradeoffs to aggregating channel data into regions of interest General task activation was more reliable than a specific task contrast Functional connectivity showed extremely high test–retest reliability Competing Interest Statement The authors have declared no competing interest. Footnotes ↵* Joint senior authors

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

⚙ Ask this paper AI returns verbatim quotes from the full text · source: oa-doi-fallback ⓘ

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2026) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc: last seen: 2026-05-20T01:45:00.602351+00:00