Is noxious stimulus-evoked electroencephalography response a reliable, valid, and interpretable outcome measure to assess analgesic efficacy in neonates? A systematic review and individual participant data (IPD) meta-analysis protocol | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Is noxious stimulus-evoked electroencephalography response a reliable, valid, and interpretable outcome measure to assess analgesic efficacy in neonates? A systematic review and individual participant data (IPD) meta-analysis protocol Luke Baxter, Marianne van der Vaart, Maria M Cobo, Patricia Gunawan, and 14 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-4464161/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 26 Jul, 2025 Read the published version in Systematic Reviews → Version 1 posted 5 You are reading this latest preprint version Abstract Background : There are several major challenges limiting our ability to test analgesic efficacy for treatment of neonatal pain, and progress in analgesic drug studies in neonates has stalled. One significant issue is the reliance of clinical pain assessments on traditional behavioural and vital signs-based measures and the exclusion of novel brain-based biomarkers. In this review protocol, we outline our strategy to assess the reliability, validity, and interpretability of an electroencephalography (EEG)-based response biomarker for assessment of acute somatic nociceptive pain in neonates. Methods : To standardise EEG analysis and generate the outcome of interest, we will perform an individual participant data (IPD) meta-analysis using data from neonates aged 34-44 weeks postmenstrual age that have had EEG recorded during acute somatic nociceptive skin-breaking procedures. Relevant data from both published and grey literature will be identified by searching six databases (MEDLINE, Embase, CINAHL, Web of Science, Scopus, Google Scholar), two clinical trial registry platforms (ClinicalTrials.gov, WHO ICTRP), and by consulting expert opinion. We will assess availability bias, data accuracy, and data quality by cross-referencing provided data with data descriptions in the literature, identifying duplicates and non-sensical values, and extracting quality control metrics. Data will be synthesised via a two-stage IPD meta-analysis using a random effects modelling approach grouped by site. Reliability (inter- and intra-rater) outcomes will be measured as Gwet's AC1 coefficient. Validity (known-groups and known-stimuli) outcomes will be measured as EEG response magnitude differences between clinically meaningfully different stimuli. Interpretability will be addressed by providing normative values, in both original and standardised units. Discussion : The purpose of this study is to establish the reliability, validity, and interpretability of a specific EEG-based response biomarker for assessing acute somatic nociceptive pain in neonates. It will provide an overview of available data and how EEG is being used globally to assess acute neonatal pain. If sufficient IPD are made available and the outcome is reliable, valid, and interpretable, this work will support the use of EEG-based outcome measures as primary endpoints in clinical trials assessing analgesic efficacy in neonates. Systematic review registration : The protocol was registered with PROSPERO on 14-July-2023: CRD42023444809. Neonate pain brain electroencephalography biomarkers meta-analysis individual participant data validity reliability interpretability Background Rationale: Hospitalised neonates undergo numerous painful procedures each day as part of routine clinical care, including heel lances, cannulation, intubation, and peripheral arterial/venous line insertions [ 1 ]. Unfortunately, analgesic provision for neonates is ad-hoc, off-label, un-standardized, and suboptimal [ 2 ]. This is primarily due to difficulties in assessing acute pain in neonates due to the subjective nature of pain and the non-verbal nature of the neonatal population, with current pain outcome measures designed for older populations (children as well as adults) not being suitable. These challenges have limited our ability to test the efficacy and safety of analgesics for use in acute pain in this vulnerable population. As such, the progress in analgesic drug studies in neonates has stalled [ 3 ]. There is a need for pain assessment tools in neonates that can accurately evaluate pain and that are specifically intended for use in this vulnerable patient population. More than 40 different multicomponent pain scales have been developed to assess neonatal pain [ 4 ]. This includes some that have been accepted by the U.S. Food and Drug Administration (FDA) as trial endpoints (e.g. LNPS [Leuven Neonatal Pain Score], FLACC [Face, Legs, Activity, Cry, and Consolability], Alder Hey Triage Pain Score) [ 5 , 6 ]. However, there is a clear acknowledgement that these measures are only acceptable in the absence of better alternatives [ 7 ]. To advance the development and licensing of analgesics and improve the success of trials assessing analgesic efficacy, there is an urgent need for better (i.e. greater validity, reliability, and interpretability) primary endpoints to be developed for neonates. In this review, we will assess the potential value of a brain-based pain-relevant response recorded using electroencephalography (EEG). A well-characterized EEG pattern of noxious stimulus-evoked brain activity exhibiting a positive deflection at approximately 560 ms post-stimulation has been recorded in peri-term-aged neonates in response to acute somatic nociceptive skin-breaking procedures such as heel lancing and intramuscular injections [ 8 , 9 ]. The magnitude of this positive deflection can be quantified using a standardized waveform that is scaled to the amplitude of the evoked signal [ 10 ] using linear regression. This can be used to study the influence of analgesic interventions on the magnitude of noxious stimulus-evoked brain activity in randomised clinical trials [ 11 , 12 ]. We suggest that this specific EEG metric of noxious stimulus-evoked brain activity can provide a reliable, valid, and interpretable proxy measure of neonatal pain that is not behaviour-, autonomic-, or haemodynamic-based. This could serve as a useful endpoint to evaluate the efficacy of analgesics in neonatal clinical trials. To assess the reliability, validity, and interpretability of noxious stimulus-evoked EEG activity in neonates for use as an analgesic efficacy endpoint in clinical trials, it is highly desirable to assess these properties across multiple geographic sites and independent research centres. Aggregating the data in the literature using meta-analysis is the gold standard for this research aim. While there are several research centres around the world that are working with neonatal noxious stimulus-evoked EEG data, there are a multitude of EEG pre-processing and analytic methods adopted in the literature. Only one research centre is currently using the EEG methodology outlined above [ 10 ]. Thus, a traditional aggregate data meta-analysis approach is not feasible. Instead, we will adopt the individual participant data (IPD) meta-analysis approach to request relevant raw EEG data that can subsequently be pre-processed and analysed using a standardised analysis pipeline. This will allow the derivation of a single standardised metric across all retrieved data. To assess the reliability, validity, and interpretability of the EEG metric of neonatal acute pain assessment, we will adopt the COSMIN (COnsensus-based Standards for the selection of health Measurement INstruments) terminology framework [ 13 , 14 ]. This terminology is closely related to that of the FDA [ 15 ]; the measurement properties of reliability and validity map directly, while the FDA measurement property ‘ability to detect change’ maps onto COSMIN’s measurement property ‘responsiveness’, which COSMIN classify as a subcategory of validity [ 13 ]. However, the FDA framework has been developed with patient-reported outcome (PRO) measures in mind, while the specific EEG metric under investigation is a pharmacodynamic or response biomarker. We have adopted the COSMIN framework due to its more expansive literature base, its broader coverage of both validity and reliability subtypes, as well as its greater clarity around the concept of measurement interpretability. We also provide a Glossary in the Supplementary Information (Table S1 ) to clarify relevant terminology. Objectives: We aim to establish the extent to which the noxious stimulus-evoked EEG measure is a pain-relevant indicator that is suitable for use as a response biomarker in clinical trials of established and novel analgesics. To achieve this, we will assess its validity, reliability, and interpretability, as defined by the COSMIN Taxonomy of Measurement Properties [ 13 , 14 ] (see Supplementary Information Table S1 Glossary). Our primary research question is as follows: “ Is the specific EEG-recorded brain activity magnitude metric under investigation, that is evoked by acute somatic nociceptive skin-breaking procedures in neonates 34–44 weeks PMA, a reliable, valid, and interpretable pain-relevant indicator? ”. The research question is tabulated in PICO format in the Supplementary Information (Table S2 ). When pre-processing EEG data, an initial quality assessment step involves determining whether to include or exclude an individual noxious stimulus-evoked EEG response (i.e. EEG epoch rejection) from the group-level analysis. This step involves a rater’s judgment, thus allowing us to assess rater reliability [ 16 ]. We will assess both inter-rater and intra-rater reliability based on independent raters’ decisions to include or exclude each stimulus-response epoch. To assess validity [ 17 ], we focus on two types of construct validity: known-groups validity and known-stimuli validity (see Glossary for definitions Table S1 ). For known-groups validity, our “known groups” should differ in the sensory intensity component of acute somatic nociceptive pain and are: (i) noxious stimulation occurring with general anaesthetic or non-topical local anaesthetic (e.g. caudal block) and (ii) noxious stimulation without general anaesthetic or non-topical local anaesthetic. The mechanism of action of these anaesthetics is well-established and should reduce noxious stimulation input to the brain (topical local anaesthetics are not included since preliminary evidence suggests topical anaesthetics might not penetrate the skin deep enough to have an analgesic effect [ 18 ]). For known-stimuli validity, our “known stimuli” that should differ in the sensory intensity component of acute somatic nociceptive pain are: (i) acute somatic nociceptive skin-breaking procedures (e.g. heel lance) and (ii) acute somatic non-nociceptive non-skin-breaking procedures (e.g. sham heel lance). The differences between these two stimulus types should be reflected in the amount of noxious stimulation input reaching the brain. Our research questions address the interpretability of values that can be considered at least moderately important in magnitude. As outlined in the validity assessments above, two key contexts that will be assessed are: (i) a noxious skin-breaking procedure with and without anaesthetic (general anaesthetic or non-topical local anaesthetic) and (ii) a noxious skin-breaking procedure compared to a non-noxious non-skin-breaking procedure. Both scenarios compare relatively extreme cases where the magnitude of the effects will be much larger than a minimally important change (MIC). Assessing an MIC will not be possible, as there is no standard methodology to determine pain MICs for neonates. Whether the magnitude of the effects in the two scenarios is considered moderately or substantially important is open to interpretation. To establish interpretability, an anchor-based approach [ 19 ] will be adopted relying on ‘clinician-based assessments’, which means that the interpretation of the two clinically important scenarios outlined above is deemed reasonable from a clinician’s (or researcher’s or parent’s) perspective e.g. the difference between a noxious skin-breaking procedure and a non-noxious non-skin-breaking procedure is important from the perspective of clinicians, researchers, and parents. The anchor is the subjective assessment of the clinicians, researchers, and parents in our consortium, where the qualitative meaning of the quantitative values derives from the clinical meaning of the populations, stimuli, and interventions. To establish interpretability, typical values of the stimulus-evoked EEG response will be assessed to establish normative values for the neonatal population. The magnitude of the clinically important difference in stimulus-evoked EEG responses between noxious and innocuous stimuli will be determined, as will responses with or without anaesthetic. The effect sizes for these normative values and clinically meaningful differences will be reported to allow sample size planning for future randomised controlled trials. In addition to the anchor-based approach, the interpretability results will be supported by a statistics-based distribution-based approach [ 19 ]. We will also use a distribution-based approach to report small, medium, and large effect sizes using established statistical benchmarks. In addition to the three primary objectives of assessing reliability, validity, and interpretability, there are three exploratory objectives to this study. The first is examining the potential differences in noxious stimulus-evoked EEG responses between biologically distinguishable groups, defined using variables such as age, sex, ethnicity, physiological state, and health status; the second is assessing the impact of proposed analgesic interventions (both pharmacological and non-pharmacological) and topical local anaesthetic interventions on the magnitude of the noxious stimulus-evoked EEG response; and the third objective is assessing the association between the noxious stimulus-evoked EEG response and the noxious stimulus-evoked response assessed using non-EEG methods (e.g. behavioural measures, physiological measures, infant clinical pain scales). Methods Protocol structure and registration: This protocol has been structured to conform to PRISMA-P guidelines [ 20 , 21 ], and a completed PRISMA-P checklist is provided in the Supplementary Information. Our protocol was registered with PROSPERO on 14 July 2023: CRD42023444809. An expanded version of the protocol can also be found on the Open Science Framework (OSF)[ 22 ]. Eligibility criteria: We include primary empirical studies with any study design that have EEG recordings during acute somatic nociceptive skin-breaking procedure for neonates aged 34–44 weeks PMA. There are no exclusion criteria based on language or year of dissemination. The eligibility criteria are tabulated in full in Supplementary Information Table S3. Information sources and search strategies: We use three categories of information sources: bibliographic databases, clinical trial registries, and authors of included studies. For electronic searches, the search strategy was initially developed in MEDLINE, then translated for other databases and registries. The search focused on the combination of three core topics: ‘neonate’, ‘pain’, and ‘EEG’. The search strategies were independently peer-reviewed using the PRESS Checklist [ 23 , 24 ] by an Outreach Librarian at the Bodleian Health Care Libraries, University of Oxford. The bibliographic databases (and their platforms) are MEDLINE (Ovid), Embase (Ovid), CINAHL (EBSCO Industries), Web of Science Core Collection (Clarivate Analytics), Scopus (Elsevier), Google Scholar (Publish or Perish). Clinical trial registrations were identified using: ClinicalTrials.gov [ 25 ] and WHO ICTRP [ 26 ]. All search strategies are presented in full in the Supplementary Information. Study records: For the literature review data to identify sources of IPD, all screening (on title and abstract followed by full text) will be performed independently in duplicate, with disagreements resolved through discussion with a third reviewer. Data from bibliographic databases will be managed (deduplicated and screened) using EPPI-Reviewer Web [ 27 ], while data from clinical trial registries will be managed via spreadsheets and in-house scripts with data files stored safely on a University server (scripts will be publicly available via GitHub). Original investigators who have relevant EEG data will be invited to be co-authors on a scoping review, where the summary results of this literature search will be reported, and the accuracy of literature data extraction verified by the original investigators. After identifying original investigators with relevant data, we will also invite them to contribute their IPD for systematic review and meta-analysis and to be co-authors on the publication. All IPD from a single site will be pooled across studies conducted within the same site to form a single dataset per site (see Supplementary Information – Risk of Bias in Individual Studies). To ensure consistent analysis of all IPD across sites, a single researcher competent in implementing the specific analysis pipeline will visit all participating sites to run the analysis pipeline and extract the relevant outcomes and image files for each neonate’s dataset. Relevant non-EEG data (see ‘Data items’ below) will be compiled locally by the participating sites, with central guidance. The IPD dataset will initially be stored locally at each participating site. The summary deidentified outcome measures per neonate and relevant non-EEG variables will be stored in spreadsheet files, shared with researchers at the University of Oxford, and stored securely on a University of Oxford server for subsequent meta-analysis. Image files for each EEG epoch will be stored with the spreadsheets. Finally, the spreadsheets and image files per site will be stored by C-Path on the Rare Disease Cures Accelerator-Data and Analytics Platform (RDCA-DAP), an FDA-funded initiative that provides a centralized and standardized data analytics infrastructure [ 28 ]. Each original investigator will also be invited to share their full IPD dataset with C-Path for storage on RDCA-DAP, to ensure safe and secure data storage that meets FAIR data principles [ 29 , 30 ]. Data items: For the literature review, the data items extracted from the literature will be outlined and defined in the data extraction form and accompanying data dictionary, which will be made publicly available as part of the scoping review publication. For the IPD systematic review and meta-analysis, there are several variables required per individual EEG epoch for the meta-analysis. To address the validity and interpretability research objectives, a response magnitude (regression coefficient) and its associated measure of variance (the squared standard error of the regression coefficient) will be required. To address the reliability research objective, several epoch quality control (QC) metrics as well as image files of the EEG epochs will be required to guide users of the analysis methodology to make data-informed decisions regarding the inclusion or exclusion of each individual epoch. The specific details of how each of the response metrics and QC metrics are extracted will be provided in a forthcoming publication detailing the analysis pipeline. The analysis pipeline will be made publicly available via GitHub or GitLab. Additionally, non-EEG data will be required. To address the validity and interpretability research objectives, we will require information on the nature of the recorded noxious and innocuous stimuli (e.g. heel lance, sham heel lance) as well as whether the baby received general anaesthetic or non-topical local anaesthetic. For the exploratory analyses, we will require information relating to age, sex, ethnicity, physiological state, health status, analgesic interventions (both pharmacological and non-pharmacological), topical local anaesthetic interventions and noxious stimulus-evoked responses assessed using non-EEG methods (e.g. behavioural measures, physiological measures, infant clinical pain scales). The exact data items required for the exploratory analyses will be based on the data extraction form from the scoping review, and determined after data extraction for the scoping review is complete. This ensures that requests for data include items that are collected by the individual sites. For full storage of the raw EEG data in C-Path’s RDCA-DAP, the exact data items required will be defined once data contributions are made and the full requirements are understood. Importantly, the storage of the raw EEG data in C-Path’s RDCA-DAP will not be a prerequisite for addressing all primary objectives of this IPD systematic review and meta-analysis. Outcomes and prioritization: The three primary objectives (assessing reliability, validity, and interpretability) are of equally high priority. The three exploratory objectives (assessing biological groups, analgesic and topical anaesthetic interventions, and non-EEG measures) are of lower prioritization. Reliability outcomes will be measured as Gwet's AC1 coefficient (a chance-corrected agreement coefficient, and a more stable alternative to Cohen’s Kappa) and 95% confidence intervals (CIs) will be reported. Adequacy will be assessed using the Altman's benchmark scale: poor (-1 to 0.2), fair (0.2 to 0.4), moderate (0.4 to 0.6), good (0.6 to 0.8), very good (0.8 to 1). Validity outcomes will be measured as stimulus-evoked EEG response magnitude differences (e.g. noxious minus innocuous, analgesic-negative minus analgesic-positive). Means and 95% CIs will be reported. Significance will be assessed by whether the 95% CI excludes (significant) or includes (not significant) the null value. Adequacy will be assessed using the COSMIN recommendations for hypothesis testing construct validity: at least 75% of the hypotheses should be confirmed. Interpretability outcomes will be measured as stimulus-evoked EEG response magnitudes and magnitude differences. Means and 95% CIs will be reported. Statistical benchmarks of small, medium, and large effect sizes will supplement these outcomes, using established statistical benchmarks: Cohen’s D = 0.2, 0.5, 0.8. For the exploratory outcomes, biological group comparison outcomes will be measured as stimulus-evoked EEG response magnitudes (means and 95% CIs will be reported). Analgesic and topical local anaesthetic intervention assessment outcomes will be measured as stimulus-evoked EEG response magnitudes (means and 95% CIs will be reported). Associations between noxious stimulus-evoked EEG response and noxious stimulus-evoked response using non-EEG methods will be measured with Pearson correlation coefficients and 95% CIs. For composite or multi-component infant pain scales (e.g. PIPP-R), we will assess associations between the EEG response and the overall infant pain scale score. If data availability allows, we will further assess associations between the EEG response and (i) each individual item in the scale, and (ii) the mean value per dimension (e.g. behavioural dimension, physiological dimension) in the scale. Data synthesis: A two-stage IPD analytic approach will be adopted, because it automatically stratifies parameter estimates and residual variances appropriately by analysing each dataset separately in the first stage, utilises well-known meta-analysis methods that offer familiarity and transparency in the second stage, and enables visual summaries using forest plots [ 31 ]. Additionally, it provides the opportunity to assess the variability in primary outcomes by dataset. Where appropriate, a one-stage approach will be considered if most of the included datasets are small (e.g. <20–30 participants) [ 31 ]. We will adopt a random effects modelling approach, as the assumption of a common effect across datasets is unlikely to be reasonable. For completeness, we will additionally report the fixed-effect (i.e. common-effect) result, with the random-effects being the primary outcome. When fitting the random effects model in the second stage, we will use the REML (restricted maximum likelihood) estimation approach and CIs generated using the HKSJ (Hartung-Knapp-Sidik-Jonkman) method [ 32 ]. This will be implemented in R [ 33 ] using the metafor package [ 34 ]. Summary effects will be generated for measures of reliability (inter- and intra-rater agreement), validity (magnitude differences for both known-groups and known-stimuli scenarios), and interpretability (typical magnitude and magnitude difference values), thus summarising the measurement properties for the specific EEG metric under investigation. From these summary effects, standardised effect sizes (Cohen’s D) will be generated, which can be used in future clinical trial sample size planning. Heterogeneity across datasets will be reported using I 2 , tau 2 , and 95% prediction intervals. Risk of bias in individual studies, meta-bias, and confidence in cumulative evidence: Given the nature of the research objectives (establishing novel outcomes of measurement properties and interpretability) and our level of data grouping (data grouped per site rather than per publication), the standard approaches to assessing bias and meta-bias are not directly applicable to this review. We provide an extended discussion of these topics in the Supplementary Information, where the aim to assess availability bias, data accuracy, and data quality is outlined. Similarly, given the nature of the research objectives, we will not formally assess the confidence in cumulative evidence. Discussion The purpose of this systematic review and IPD meta-analysis is to establish the reliability, validity, and interpretability of a specific EEG-based response biomarker [ 10 ] for assessing acute somatic nociceptive pain in neonates. It will provide an overview of the available data and how EEG is being used globally to assess acute neonatal pain. If sufficient IPD are made available and the outcome is demonstrated to be reliable, valid, and interpretable, this work will support the use of EEG-based outcome measures as primary endpoints in clinical trials assessing analgesic efficacy in neonates. Currently, brain-based metrics are not featured in any of the dozens of clinical pain scales for neonatal pain [ 35 ], and we believe this review will help address this key knowledge gap. This review is limited in its ability to establish certain types of reliability, validity, and interpretability, due to the nature of IPD meta-analysis, the nature of pain, and the non-verbal nature of neonates. The following discussion on reliability and validity subtypes and the interpretability of MICs is based on the COSMIN terminology [ 16 , 17 , 19 ]. Given that the EEG metric is a single-item outcome measurement, the internal consistency subtype of reliability is not relevant. Additionally, the reliability subtypes of measurement error and test-retest reliability are not possible. These reliability subtypes rely on the assumption that the patient conditions remain unchanged across repeated applications of painful stimulation (test-retest reliability) – an assumption that is highly questionable and cannot be established – or require that total measurement variance can be partitioned into true biological variance and noise variance (measurement error) – a requirement that cannot be met due to the inherent inability to disambiguate these variance subtypes. Due to the quantitative and data-driven nature of an IPD meta-analysis, assessment of content validity is outside the review’s remit. Content validity (including face validity) is primarily qualitative and theory-driven and can be established using approaches such as questionnaires and expert panel reviews. Due to the subjective nature of pain and the non-verbal nature of neonates, assessing criterion validity is not possible as this type of validity requires the existence of a ‘gold standard’. Neither brain-based (e.g., EEG), behaviour-based, nor vital signs-based measures constitute a gold standard for pain assessment. As this review aims to establish measurement properties and interpretability of a specific single-item EEG metric, assessing structural validity is not relevant. Cross-cultural validity is not relevant, as there is no aspect of the specific EEG metric analysis approach that is language-based that would require translation and cross-cultural validation. Importantly, cross-cultural validity is unrelated to cultural differences in pain experience and response. Thus, there may be cross-cultural differences in noxious stimulus-evoked EEG signal magnitudes, and these potential differences can be explored in this review. However, exploring these potential cultural differences would not constitute an assessment of cross-cultural validity. We also do not assess convergent validity, as establishing the agreement of the specific EEG metric with validated neonatal clinical pain scales (e.g., PIPP-R) should not be a pre-requisite for validating an EEG metric. Given that no existing neonatal pain scale has reached gold standard status and given the mechanistic differences underpinning behavioural, vital signs, and EEG signals, these differing measures may indicate different pain-relevant signals that need not agree. Instead of assessing convergent validity, we will explore (as outlined in our Methods section) the association between the EEG metric and alternative non-EEG measures to help begin to shed light on the degree of overlap among these different measures. Divergent validity is unlikely to be able to be evaluated, as it would require researchers to have used a validated scale during the painful procedure to measure a concept-of-interest that is not pain. As described in the interpretability objectives outlined in the Background section, there is no existing agreed-upon method to determine pain MICs for neonates. Consequently, it cannot be established by this review. However, the scenarios that we will assess compare relatively extreme cases, where the magnitude of the effects will be much larger than an MIC, thus allowing us to establish an upper limit to the magnitude of an MIC. Lastly, this review is limited to assessing the reliability, validity, and interpretability of a noxious stimulus-evoked EEG response metric in neonates aged 34–44 weeks PMA. Infants below 34 weeks PMA display noxious stimulus-evoked brain activity of a different morphology than the brain activity quantified by the standardized waveform studied here [ 36 , 37 ]. The standardized waveform used in this review has been developed for and used in neonates in this 34–44 week PMA age group [ 10 , 12 , 38 – 40 ]. While this age group does not include the youngest premature neonates to which most painful procedures happen, the methodology assessed in this review will help establish a foundation and roadmap for the future development of novel age-appropriate EEG outcome measures. This review will thus be of importance and interest to those hoping to improve pain assessment and treatment in the youngest most vulnerable neonatal age groups. The dissemination plan is to publish results in a peer-reviewed academic journal. Any amendments made to this protocol when conducting the study will be outlined and reported in the final manuscript as well as an updated version of our extended protocol document on OSF. Abbreviations CI = Confidence interval COSMIN = COnsensus-based Standards for the selection of health Measurement Instruments C-Path = Critical Path Institute EEG = Electroencephalography FAIR = Findability, accessibility, interoperability, and reusability FDA = U.S. Food and Drug Administration FLACC = Face, Legs, Activity, Cry, Consolability scale HKSJ = Hartung-Knapp-Sidik-Jonkman IPD = Individual participant data LNPS = Leuven Neonatal Pain Score MIC = Minimally important change OSF = Open Science Framework PICO = Population, intervention, comparator, outcome PIPP-R = Premature infant pain profile – revised PMA = Postmenstrual age PRESS = Peer Review of Electronic Search Strategies PRISMA-P = Preferred Reporting Items for Systematic Reviews and Meta-Analyses – for Protocols PRO = Patient-reported outcome PROSPERO = International Prospective Register of Systematic Reviews QC = Quality control RDCA-DAP = Rare Disease Cures Accelerator-Data and Analytics Platform REML = Restricted maximum likelihood WHO ICTRP = World Health Organization International Clinical Trials Registry Platform Declarations Ethics approval and consent to participate Not applicable. Consent for publication Not applicable. Availability of data and materials Not applicable. Competing interests The authors declare that they have no competing interests. Funding This review is financially supported by International Neonatal Consortium (INC) of Critical Path Institute (C-Path) (Tucson, Az). JP Sheppard receives funding from the Wellcome Trust/Royal Society via a Sir Henry Dale Fellowship (ref: 211182/Z/18/Z). This research was funded in part, by the Wellcome Trust [ref: 211182/Z/18/Z]. For the purpose of open access, the author has applied a CC-BY public copyright licence to any Author Accepted Manuscript version arising from this submission. Authors' contributions LB wrote the original protocol draft. All authors contributed to the conceptualisation and development of the protocol. Disclaimer: The opinions expressed in this article are those of the authors and should not be interpreted as the position of the U.S. Food and Drug Administration. Acknowledgements INC Pain working party authorship group: Dina Apele-Freimane (Pauls Stradins Clinical University Hospital, Riga, Latvia), Takeshi Arimistu (Department of Pediatrics, Keio University School of Medicine, Japan), Christine Barry (Critical Path Institute, INC, C-Path, Tucson, Arizona, USA), Deb Discenza (Parent Representative, Preemieworld, USA), Olivia Giola (Critical Path Institute, INC, C-Path, Tucson, Arizona, USA), Collin Hovinga (Critical Path Institute, INC, C-Path, Tucson, Arizona, USA), Yamile Jackson (Parent Representative, USA), Danielle Matthews (Parent Representative, USA), Varsha Mehta† (Food and Drug Administration FDA, Silver Spring, Maryland, USA), Katie Reginato Cascamo (Parent Representative, USA), Norma Vivas (Sanofi Pharmaceuticals, Argentina). † Deceased. References Carbajal R, Rousset A, Danan C, Coquery S, Nolent P, Ducrocq S, et al. Epidemiology and treatment of painful procedures in neonates in intensive care units. JAMA. 2008;300:60–70. Flint RB, van Beek F, Andriessen P, Zimmermann LJ, Liem KD, Reiss IKM, et al. Large differences in neonatal drug use between NICUs are common practice: time for consensus? Br J Clin Pharmacol. 2018;84:1313–23. Slater R, Moultrie F, Bax R, Anker J, van den, Bhatt A. Preterm health: time to bridge the evidence gap. Lancet. 2020;396:872–3. Cong X, McGrath JM, Cusson RM, Zhang D. Pain assessment and measurement in neonates: an updated review. Adv Neonatal Care. 2013;13:379–95. Boyd WM. NDA 205388/S-006, OMIDRIA (phenylephrine and ketorolac intraocular solution) 1% / 0.3%. FDA. 2017. https://www.fda.gov/media/110357/download . Fang C. Efficacy Review of NDA 22–450 N000 (IV Acetaminophen). FDA. 2017. https://www.fda.gov/media/103514/download . Hertz S. Analgesic Development for Pediatric Patients. FDA. 2016. https://www.fda.gov/files/advisory%20committees/published/Analgesic-Development-for-Pediatric-Patients.pdf . Slater R, Worley A, Fabrizi L, Roberts S, Meek J, Boyd S, et al. Evoked potentials generated by noxious stimulation in the human infant brain. Eur J Pain. 2010;14:321–6. Verriotis M, Fabrizi L, Lee A, Ledwidge S, Meek J, Fitzgerald M. Cortical activity evoked by inoculation needle prick in infants up to one-year old. Pain. 2015;156:222–30. Hartley C, Duff EP, Green G, Mellado GS, Worley A, Rogers R et al. Nociceptive brain activity as a measure of analgesic efficacy in infants. Sci Transl Med. 2017;9. Cobo MM, Moultrie F, Hauck AGV, Crankshaw D, Monk V, Hartley C, et al. Multicentre, randomised controlled trial to investigate the effects of parental touch on relieving acute procedural pain in neonates (Petal). BMJ Open. 2022;12:e061841. Hartley C, Moultrie F, Hoskin A, Green G, Monk V, Bell JL, et al. Analgesic efficacy and safety of morphine in the Procedural Pain in Premature Infants (Poppi) study: randomised placebo-controlled trial. Lancet. 2018;392:2595–605. De Vet HCW, Terwee CB, Mokkink LB, Knol DL. Measurement in Medicine: A Practical Guide. Cambridge: Cambridge University Press; 2011. Mokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63:737–45. FDA. Guidance for Industry Patient-Reported Outcome Measures. : Use in Medical Product Development to Support Labeling Claims. 2009. https://www.fda.gov/media/77832/download . De Vet HCW, Terwee CB, Mokkink LB, Knol DL. Reliability. Measurement in Medicine: A Practical Guide. Cambridge University Press; 2011. pp. 96–149. De Vet HCW, Terwee CB, Mokkink LB, Knol DL. Validity. Measurement in Medicine: A Practical Guide. Cambridge University Press; 2011. pp. 150–201. Larsson BA, Jylli L, Lagercrantz H, Olsson GL. Does a local anaesthetic cream (EMLA) alleviate pain from heel-lancing in neonates? Acta Anaesthesiol Scand. 1995;39:1028–31. De Vet HCW, Terwee CB, Mokkink LB, Knol DL. Interpretability. Measurement in Medicine: A Practical Guide. Cambridge University Press; 2011. pp. 227–74. Moher D, Shamseer L, Clarke M, Ghersi D, Liberati A, Petticrew M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Syst Reviews. 2015;4:1. Shamseer L, Moher D, Clarke M, Ghersi D, Liberati A, Petticrew M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015: elaboration and explanation. BMJ. 2015;349:g7647. Baxter L, Slater R, Allegaert K, Davis J, Turner M, Ward RM et al. Is the measurement of noxious stimulus-evoked EEG activity a reliable, valid, and interpretable tool to assess analgesic efficacy in neonates? 2023. https://doi.org/10.17605/OSF.IO/6UHZE . McGowan J, Sampson M, Salzwedel DM, Cogo E, Foerster V, Lefebvre C. PRESS Peer Review of Electronic Search Strategies: 2015 Guideline Statement. J Clin Epidemiol. 2016;75:40–6. McGowan J, Sampson M, Salzwedel DM, Cogo E, Lefebvre C. PRESS Peer Review of Electronic Search Strategies: 2015 Guideline Explanation and Elaboration (PRESS E&E). CADTH Methods and Guidelines. 2016. ClinicalTrials.gov. https://clinicaltrials.gov. Accessed 30 Jan 2024. ICTRP Search Portal. https://trialsearch.who.int. Accessed 30 Jan 2024. Thomas J, Graziosi S, Brunton J, Ghouze Z, O’Driscoll P, Bond M et al. EPPI-Reviewer: advanced software for systematic reviews, maps and evidence synthesis. 2022. C-Path RDCA-DAP Portal. https://portal.rdca.c-path.org . Accessed 30 Jan 2024. Jacobsen A, de Miranda Azevedo R, Juty N, Batista D, Coles S, Cornet R, et al. FAIR Principles: Interpretations and Implementation Considerations. Data Intell. 2020;2:10–29. Wilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016;3:160018. Riley RD, Burke DL, Morris T. One-stage versus Two-stage Approach to IPD Meta-Analysis: Differences and Recommendations. In: Riley RD, Tierney JF, Stewart LA, editors. Individual Participant Data Meta-Analysis: A Handbook for Healthcare Research. Newark, UNITED KINGDOM: John Wiley & Sons, Incorporated; 2021. pp. 199–217. Riley RD, Debray TPA, Morris TP, Jackson D. The Two-stage Approach to IPD Meta-Analysis. In: Riley RD, Tierney JF, Stewart LA, editors. Individual Participant Data Meta-Analysis: A Handbook for Healthcare Research. Newark, UNITED KINGDOM: John Wiley & Sons, Incorporated; 2021. pp. 89–125. Schwarzer G, Carpenter JR, Rücker G. Meta-Analysis with R. 1st edition. Springer Cham; 2015. Viechtbauer W. Conducting Meta-Analyses in R with the metafor Package. J Stat Softw. 2010;36:1–48. Bueno M, Eriksson M, Stevens BJ. Neonatal and infant pain assessment. In: Stevens BJ, Hathway G, Zempsky WT, editors. Oxford Textbook of Pediatric Pain. Second Edition. Oxford, New York: Oxford University Press; 2021. p. 375–90. Fabrizi L, Slater R, Worley A, Meek J, Boyd S, Olhede S, et al. A shift in sensory processing that enables the developing human brain to discriminate touch from pain. Curr Biol. 2011;21:1552–8. van der Vaart M, Hartley C, Baxter L, Mellado GS, Andritsou F, Cobo MM, et al. Premature infants display discriminable behavioral, physiological, and brain responses to noxious and nonnoxious stimuli. Cereb Cortex. 2022;32:3799–815. Cobo MM, Green G, Andritsou F, Baxter L, Evans Fry R, Grabbe A, et al. Early life inflammation is associated with spinal cord excitability and nociceptive sensitivity in human infants. Nat Commun. 2022;13:3943. Cobo MM, Hartley C, Gursul D, Andritsou F, van der Vaart M, Schmidt Mellado G, et al. Quantifying noxious-evoked baseline sensitivity in neonates to optimise analgesic trials. eLife. 2021;10:e65266. Gursul D, Goksan S, Hartley C, Mellado GS, Moultrie F, Hoskin A, et al. Stroking modulates noxious-evoked brain activity in human infants. Curr Biol. 2018;28:R1380–1. Supplementary Files PRISMAPChecklist.pdf Supplement.pdf Cite Share Download PDF Status: Published Journal Publication published 26 Jul, 2025 Read the published version in Systematic Reviews → Version 1 posted Editorial decision: Accept 20 Jun, 2025 Reviewers agreed at journal 02 Aug, 2024 Reviewers invited by journal 24 May, 2024 Editor assigned by journal 23 May, 2024 First submitted to journal 31 Jan, 2024 You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-4464161","acceptedTermsAndConditions":true,"allowDirectSubmit":false,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":306413858,"identity":"1d5a08ac-598f-4d46-ac89-fde10026a859","order_by":0,"name":"Luke Baxter","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAABPklEQVRIie2RMUvDQBTH33FwWZ5mvVBav0JKoCCWfpYcgXQtuAgtMdNliXWtk1+jm5WDdAm4Cl0CgcwRQQIFtUkVbCLiKJjfcPfg7sf/3j2AlpY/yDEwAOJXNUnKRZal/eWKWVPYp4IA1Nwr9PcK41DV9PBKQ+GuSMhyJO60q2w6mQ27c+0hypOZBzq3SV6AspqKMknsWIjrwWYRuZZEhy7sSIERrqgRgho0lLHkRNIecpdtkCkhwaEg/BWYax86AGrYVIKCyEvEk4yd4+ubkHpaKh6YCuj2W8WNdr+kdimM0SO5EpJXKbRMYWVK42GYOVzEawtjl3Zu5o4leWrBrhc0wnt5Gprjevu65vbzp+VUBEFEnicvo+7ttUhJMfN6OnfUY3Fx1vehif2xHwwE9/OqT6UG/fm4paWl5b/yDnl/YqgoqlwUAAAAAElFTkSuQmCC","orcid":"https://orcid.org/0000-0001-9548-7162","institution":"University of Oxford","correspondingAuthor":true,"prefix":"","firstName":"Luke","middleName":"","lastName":"Baxter","suffix":""},{"id":306413859,"identity":"a64b75f3-8aa8-4461-a015-1e61a1ccbbf8","order_by":1,"name":"Marianne van der Vaart","email":"","orcid":"","institution":"University of Oxford","correspondingAuthor":false,"prefix":"","firstName":"Marianne","middleName":"van der","lastName":"Vaart","suffix":""},{"id":306413860,"identity":"e1f1f93a-88e1-49e8-9b7c-6edcfe9a9dac","order_by":2,"name":"Maria M Cobo","email":"","orcid":"","institution":"University of Oxford","correspondingAuthor":false,"prefix":"","firstName":"Maria","middleName":"M","lastName":"Cobo","suffix":""},{"id":306413861,"identity":"9c0ca879-940f-47ea-8afe-4d918808d7cd","order_by":3,"name":"Patricia Gunawan","email":"","orcid":"","institution":"University of Oxford","correspondingAuthor":false,"prefix":"","firstName":"Patricia","middleName":"","lastName":"Gunawan","suffix":""},{"id":306413862,"identity":"7b34afe4-748d-4810-a3c0-4e17af12e608","order_by":4,"name":"Karel Allegaert","email":"","orcid":"","institution":"KU Leuven: Katholieke Universiteit Leuven","correspondingAuthor":false,"prefix":"","firstName":"Karel","middleName":"","lastName":"Allegaert","suffix":""},{"id":306413863,"identity":"f6cfcabd-1455-4c1e-ab7a-dee4cf75c99b","order_by":5,"name":"Jonathan M Davis","email":"","orcid":"","institution":"Tufts Medical Center","correspondingAuthor":false,"prefix":"","firstName":"Jonathan","middleName":"M","lastName":"Davis","suffix":""},{"id":306413864,"identity":"d9cb90cf-18bd-4357-b7ca-e12b553f06f4","order_by":6,"name":"Mark Turner","email":"","orcid":"","institution":"University of Liverpool","correspondingAuthor":false,"prefix":"","firstName":"Mark","middleName":"","lastName":"Turner","suffix":""},{"id":306413865,"identity":"ff31808f-5ce4-425f-aa35-62372f25497c","order_by":7,"name":"Robert Ward","email":"","orcid":"","institution":"The University of Utah School of Medicine","correspondingAuthor":false,"prefix":"","firstName":"Robert","middleName":"","lastName":"Ward","suffix":""},{"id":306413866,"identity":"edb86639-d794-458d-8639-08571138b77c","order_by":8,"name":"Edress Darsey","email":"","orcid":"","institution":"Pfizer Inc","correspondingAuthor":false,"prefix":"","firstName":"Edress","middleName":"","lastName":"Darsey","suffix":""},{"id":306413867,"identity":"3970ac09-cebe-4fb6-b034-6df344472a59","order_by":9,"name":"James P Sheppard","email":"","orcid":"","institution":"University of Oxford","correspondingAuthor":false,"prefix":"","firstName":"James","middleName":"P","lastName":"Sheppard","suffix":""},{"id":306413868,"identity":"1e60077b-fe0c-422d-904b-50e7279f7ff7","order_by":10,"name":"Aomesh Bhatt","email":"","orcid":"","institution":"University of Oxford","correspondingAuthor":false,"prefix":"","firstName":"Aomesh","middleName":"","lastName":"Bhatt","suffix":""},{"id":306413869,"identity":"858e8ffa-6d58-4676-819a-1354ec3a9781","order_by":11,"name":"John van den Anker","email":"","orcid":"","institution":"University of Oxford","correspondingAuthor":false,"prefix":"","firstName":"John","middleName":"van den","lastName":"Anker","suffix":""},{"id":306413870,"identity":"77d98e80-77db-43a4-96da-912f01734fec","order_by":12,"name":"An N Massaro","email":"","orcid":"","institution":"FDA: US Food and Drug Administration","correspondingAuthor":false,"prefix":"","firstName":"An","middleName":"N","lastName":"Massaro","suffix":""},{"id":306413871,"identity":"9b86f8c1-82a1-494e-ba1a-3ac4a1ea4997","order_by":13,"name":"Ramona L Walls","email":"","orcid":"","institution":"Critical Path Institute","correspondingAuthor":false,"prefix":"","firstName":"Ramona","middleName":"L","lastName":"Walls","suffix":""},{"id":306413872,"identity":"58edd304-2a70-4f4f-9040-fe346d1f5852","order_by":14,"name":"Laura S Song","email":"","orcid":"","institution":"Critical Path Institute","correspondingAuthor":false,"prefix":"","firstName":"Laura","middleName":"S","lastName":"Song","suffix":""},{"id":306413873,"identity":"a00faf91-d9b4-4fa6-905c-02359fd67bf1","order_by":15,"name":"Kanwaljit Singh","email":"","orcid":"","institution":"Critical Path Institute","correspondingAuthor":false,"prefix":"","firstName":"Kanwaljit","middleName":"","lastName":"Singh","suffix":""},{"id":306413874,"identity":"23efa27d-d6b5-4691-8b38-8fbea2ae81f6","order_by":16,"name":"INC Pain working party authorship group N/A","email":"","orcid":"","institution":"International Neonatal Consortium","correspondingAuthor":false,"prefix":"","firstName":"INC","middleName":"Pain working party authorship group","lastName":"N/A","suffix":""},{"id":306413875,"identity":"8db98d14-678e-4316-a6ae-9d2248fa28a5","order_by":17,"name":"Rebeccah Slater","email":"","orcid":"","institution":"University of Oxford","correspondingAuthor":false,"prefix":"","firstName":"Rebeccah","middleName":"","lastName":"Slater","suffix":""}],"badges":[],"createdAt":"2024-05-23 04:38:00","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-4464161/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-4464161/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1186/s13643-025-02890-4","type":"published","date":"2025-07-26T15:57:03+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":87756654,"identity":"a32cb000-e2ed-4634-98e1-3df3682fa7e8","added_by":"auto","created_at":"2025-07-28 16:05:56","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":752079,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-4464161/v1/f1c448d9-87f9-4c69-8c22-15af83f45cb0.pdf"},{"id":57813133,"identity":"2e4a5455-0616-4d52-9b92-aab3c9dbb0ae","added_by":"auto","created_at":"2024-06-06 03:37:37","extension":"pdf","order_by":5,"title":"","display":"","copyAsset":false,"role":"supplement","size":140325,"visible":true,"origin":"","legend":"","description":"","filename":"PRISMAPChecklist.pdf","url":"https://assets-eu.researchsquare.com/files/rs-4464161/v1/cc32babb6f9d01b71a06286c.pdf"},{"id":57813134,"identity":"4d6f2482-0879-463d-974f-58a2a829699b","added_by":"auto","created_at":"2024-06-06 03:37:38","extension":"pdf","order_by":6,"title":"","display":"","copyAsset":false,"role":"supplement","size":189614,"visible":true,"origin":"","legend":"","description":"","filename":"Supplement.pdf","url":"https://assets-eu.researchsquare.com/files/rs-4464161/v1/9851c81b23480aa7b812575d.pdf"}],"financialInterests":"","formattedTitle":"Is noxious stimulus-evoked electroencephalography response a reliable, valid, and interpretable outcome measure to assess analgesic efficacy in neonates? A systematic review and individual participant data (IPD) meta-analysis protocol","fulltext":[{"header":"Background","content":"\u003cdiv id=\"Sec2\" class=\"Section2\"\u003e \u003ch2\u003eRationale:\u003c/h2\u003e \u003cp\u003eHospitalised neonates undergo numerous painful procedures each day as part of routine clinical care, including heel lances, cannulation, intubation, and peripheral arterial/venous line insertions [\u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e1\u003c/span\u003e]. Unfortunately, analgesic provision for neonates is ad-hoc, off-label, un-standardized, and suboptimal [\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2\u003c/span\u003e]. This is primarily due to difficulties in assessing acute pain in neonates due to the subjective nature of pain and the non-verbal nature of the neonatal population, with current pain outcome measures designed for older populations (children as well as adults) not being suitable. These challenges have limited our ability to test the efficacy and safety of analgesics for use in acute pain in this vulnerable population. As such, the progress in analgesic drug studies in neonates has stalled [\u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e3\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eThere is a need for pain assessment tools in neonates that can accurately evaluate pain and that are specifically intended for use in this vulnerable patient population. More than 40 different multicomponent pain scales have been developed to assess neonatal pain [\u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e4\u003c/span\u003e]. This includes some that have been accepted by the U.S. Food and Drug Administration (FDA) as trial endpoints (e.g. LNPS [Leuven Neonatal Pain Score], FLACC [Face, Legs, Activity, Cry, and Consolability], Alder Hey Triage Pain Score) [\u003cspan citationid=\"CR5\" class=\"CitationRef\"\u003e5\u003c/span\u003e, \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e6\u003c/span\u003e]. However, there is a clear acknowledgement that these measures are only acceptable in the absence of better alternatives [\u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e7\u003c/span\u003e]. To advance the development and licensing of analgesics and improve the success of trials assessing analgesic efficacy, there is an urgent need for better (i.e. greater validity, reliability, and interpretability) primary endpoints to be developed for neonates.\u003c/p\u003e \u003cp\u003eIn this review, we will assess the potential value of a brain-based pain-relevant response recorded using electroencephalography (EEG). A well-characterized EEG pattern of noxious stimulus-evoked brain activity exhibiting a positive deflection at approximately 560 ms post-stimulation has been recorded in peri-term-aged neonates in response to acute somatic nociceptive skin-breaking procedures such as heel lancing and intramuscular injections [\u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e8\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e9\u003c/span\u003e]. The magnitude of this positive deflection can be quantified using a standardized waveform that is scaled to the amplitude of the evoked signal [\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e] using linear regression. This can be used to study the influence of analgesic interventions on the magnitude of noxious stimulus-evoked brain activity in randomised clinical trials [\u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e11\u003c/span\u003e, \u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e]. We suggest that this specific EEG metric of noxious stimulus-evoked brain activity can provide a reliable, valid, and interpretable proxy measure of neonatal pain that is not behaviour-, autonomic-, or haemodynamic-based. This could serve as a useful endpoint to evaluate the efficacy of analgesics in neonatal clinical trials.\u003c/p\u003e \u003cp\u003eTo assess the reliability, validity, and interpretability of noxious stimulus-evoked EEG activity in neonates for use as an analgesic efficacy endpoint in clinical trials, it is highly desirable to assess these properties across multiple geographic sites and independent research centres. Aggregating the data in the literature using meta-analysis is the gold standard for this research aim. While there are several research centres around the world that are working with neonatal noxious stimulus-evoked EEG data, there are a multitude of EEG pre-processing and analytic methods adopted in the literature. Only one research centre is currently using the EEG methodology outlined above [\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e]. Thus, a traditional aggregate data meta-analysis approach is not feasible. Instead, we will adopt the individual participant data (IPD) meta-analysis approach to request relevant raw EEG data that can subsequently be pre-processed and analysed using a standardised analysis pipeline. This will allow the derivation of a single standardised metric across all retrieved data.\u003c/p\u003e \u003cp\u003eTo assess the reliability, validity, and interpretability of the EEG metric of neonatal acute pain assessment, we will adopt the COSMIN (COnsensus-based Standards for the selection of health Measurement INstruments) terminology framework [\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e, \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e]. This terminology is closely related to that of the FDA [\u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e15\u003c/span\u003e]; the measurement properties of reliability and validity map directly, while the FDA measurement property \u0026lsquo;ability to detect change\u0026rsquo; maps onto COSMIN\u0026rsquo;s measurement property \u0026lsquo;responsiveness\u0026rsquo;, which COSMIN classify as a subcategory of validity [\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e]. However, the FDA framework has been developed with patient-reported outcome (PRO) measures in mind, while the specific EEG metric under investigation is a pharmacodynamic or response biomarker.\u003c/p\u003e \u003cp\u003eWe have adopted the COSMIN framework due to its more expansive literature base, its broader coverage of both validity and reliability subtypes, as well as its greater clarity around the concept of measurement interpretability. We also provide a Glossary in the Supplementary Information (Table \u003cspan refid=\"MOESM1\" class=\"InternalRef\"\u003eS1\u003c/span\u003e) to clarify relevant terminology.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003eObjectives:\u003c/h2\u003e \u003cp\u003eWe aim to establish the extent to which the noxious stimulus-evoked EEG measure is a pain-relevant indicator that is suitable for use as a response biomarker in clinical trials of established and novel analgesics. To achieve this, we will assess its validity, reliability, and interpretability, as defined by the COSMIN Taxonomy of Measurement Properties [\u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e13\u003c/span\u003e, \u003cspan citationid=\"CR14\" class=\"CitationRef\"\u003e14\u003c/span\u003e] (see Supplementary Information Table \u003cspan refid=\"MOESM1\" class=\"InternalRef\"\u003eS1\u003c/span\u003e Glossary). Our primary research question is as follows: \u0026ldquo;\u003cem\u003eIs the specific EEG-recorded brain activity magnitude metric under investigation, that is evoked by acute somatic nociceptive skin-breaking procedures in neonates 34\u0026ndash;44 weeks PMA, a reliable, valid, and interpretable pain-relevant indicator?\u003c/em\u003e\u0026rdquo;. The research question is tabulated in PICO format in the Supplementary Information (Table \u003cspan refid=\"MOESM2\" class=\"InternalRef\"\u003eS2\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eWhen pre-processing EEG data, an initial quality assessment step involves determining whether to include or exclude an individual noxious stimulus-evoked EEG response (i.e. EEG epoch rejection) from the group-level analysis. This step involves a rater\u0026rsquo;s judgment, thus allowing us to assess rater reliability [\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e]. We will assess both inter-rater and intra-rater reliability based on independent raters\u0026rsquo; decisions to include or exclude each stimulus-response epoch.\u003c/p\u003e \u003cp\u003eTo assess validity [\u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e], we focus on two types of construct validity: known-groups validity and known-stimuli validity (see Glossary for definitions Table \u003cspan refid=\"MOESM1\" class=\"InternalRef\"\u003eS1\u003c/span\u003e). For known-groups validity, our \u0026ldquo;known groups\u0026rdquo; should differ in the sensory intensity component of acute somatic nociceptive pain and are: (i) noxious stimulation occurring with general anaesthetic or non-topical local anaesthetic (e.g. caudal block) and (ii) noxious stimulation without general anaesthetic or non-topical local anaesthetic. The mechanism of action of these anaesthetics is well-established and should reduce noxious stimulation input to the brain (topical local anaesthetics are not included since preliminary evidence suggests topical anaesthetics might not penetrate the skin deep enough to have an analgesic effect [\u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e18\u003c/span\u003e]). For known-stimuli validity, our \u0026ldquo;known stimuli\u0026rdquo; that should differ in the sensory intensity component of acute somatic nociceptive pain are: (i) acute somatic nociceptive skin-breaking procedures (e.g. heel lance) and (ii) acute somatic non-nociceptive non-skin-breaking procedures (e.g. sham heel lance). The differences between these two stimulus types should be reflected in the amount of noxious stimulation input reaching the brain.\u003c/p\u003e \u003cp\u003eOur research questions address the interpretability of values that can be considered at least moderately important in magnitude. As outlined in the validity assessments above, two key contexts that will be assessed are: (i) a noxious skin-breaking procedure with and without anaesthetic (general anaesthetic or non-topical local anaesthetic) and (ii) a noxious skin-breaking procedure compared to a non-noxious non-skin-breaking procedure. Both scenarios compare relatively extreme cases where the magnitude of the effects will be much larger than a minimally important change (MIC). Assessing an MIC will not be possible, as there is no standard methodology to determine pain MICs for neonates. Whether the magnitude of the effects in the two scenarios is considered moderately or substantially important is open to interpretation.\u003c/p\u003e \u003cp\u003eTo establish interpretability, an anchor-based approach [\u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e] will be adopted relying on \u0026lsquo;clinician-based assessments\u0026rsquo;, which means that the interpretation of the two clinically important scenarios outlined above is deemed reasonable from a clinician\u0026rsquo;s (or researcher\u0026rsquo;s or parent\u0026rsquo;s) perspective e.g. the difference between a noxious skin-breaking procedure and a non-noxious non-skin-breaking procedure is important from the perspective of clinicians, researchers, and parents. The anchor is the subjective assessment of the clinicians, researchers, and parents in our consortium, where the qualitative meaning of the quantitative values derives from the clinical meaning of the populations, stimuli, and interventions. To establish interpretability, typical values of the stimulus-evoked EEG response will be assessed to establish normative values for the neonatal population. The magnitude of the clinically important difference in stimulus-evoked EEG responses between noxious and innocuous stimuli will be determined, as will responses with or without anaesthetic. The effect sizes for these normative values and clinically meaningful differences will be reported to allow sample size planning for future randomised controlled trials. In addition to the anchor-based approach, the interpretability results will be supported by a statistics-based distribution-based approach [\u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e]. We will also use a distribution-based approach to report small, medium, and large effect sizes using established statistical benchmarks.\u003c/p\u003e \u003cp\u003eIn addition to the three primary objectives of assessing reliability, validity, and interpretability, there are three exploratory objectives to this study. The first is examining the potential differences in noxious stimulus-evoked EEG responses between biologically distinguishable groups, defined using variables such as age, sex, ethnicity, physiological state, and health status; the second is assessing the impact of proposed analgesic interventions (both pharmacological and non-pharmacological) and topical local anaesthetic interventions on the magnitude of the noxious stimulus-evoked EEG response; and the third objective is assessing the association between the noxious stimulus-evoked EEG response and the noxious stimulus-evoked response assessed using non-EEG methods (e.g. behavioural measures, physiological measures, infant clinical pain scales).\u003c/p\u003e \u003c/div\u003e"},{"header":"Methods","content":"\u003cdiv id=\"Sec5\" class=\"Section2\"\u003e \u003ch2\u003eProtocol structure and registration:\u003c/h2\u003e \u003cp\u003eThis protocol has been structured to conform to PRISMA-P guidelines [\u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e20\u003c/span\u003e, \u003cspan citationid=\"CR21\" class=\"CitationRef\"\u003e21\u003c/span\u003e], and a completed PRISMA-P checklist is provided in the Supplementary Information. Our protocol was registered with PROSPERO on 14 July 2023: CRD42023444809. An expanded version of the protocol can also be found on the Open Science Framework (OSF)[\u003cspan citationid=\"CR22\" class=\"CitationRef\"\u003e22\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec6\" class=\"Section2\"\u003e \u003ch2\u003eEligibility criteria:\u003c/h2\u003e \u003cp\u003eWe include primary empirical studies with any study design that have EEG recordings during acute somatic nociceptive skin-breaking procedure for neonates aged 34\u0026ndash;44 weeks PMA. There are no exclusion criteria based on language or year of dissemination. The eligibility criteria are tabulated in full in Supplementary Information Table S3.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec7\" class=\"Section2\"\u003e \u003ch2\u003eInformation sources and search strategies:\u003c/h2\u003e \u003cp\u003eWe use three categories of information sources: bibliographic databases, clinical trial registries, and authors of included studies. For electronic searches, the search strategy was initially developed in MEDLINE, then translated for other databases and registries. The search focused on the combination of three core topics: \u0026lsquo;neonate\u0026rsquo;, \u0026lsquo;pain\u0026rsquo;, and \u0026lsquo;EEG\u0026rsquo;. The search strategies were independently peer-reviewed using the PRESS Checklist [\u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e23\u003c/span\u003e, \u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e24\u003c/span\u003e] by an Outreach Librarian at the Bodleian Health Care Libraries, University of Oxford. The bibliographic databases (and their platforms) are MEDLINE (Ovid), Embase (Ovid), CINAHL (EBSCO Industries), Web of Science Core Collection (Clarivate Analytics), Scopus (Elsevier), Google Scholar (Publish or Perish). Clinical trial registrations were identified using: ClinicalTrials.gov [\u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e25\u003c/span\u003e] and WHO ICTRP [\u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e26\u003c/span\u003e]. All search strategies are presented in full in the Supplementary Information.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003eStudy records:\u003c/h2\u003e \u003cp\u003eFor the literature review data to identify sources of IPD, all screening (on title and abstract followed by full text) will be performed independently in duplicate, with disagreements resolved through discussion with a third reviewer. Data from bibliographic databases will be managed (deduplicated and screened) using EPPI-Reviewer Web [\u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e27\u003c/span\u003e], while data from clinical trial registries will be managed via spreadsheets and in-house scripts with data files stored safely on a University server (scripts will be publicly available via GitHub). Original investigators who have relevant EEG data will be invited to be co-authors on a scoping review, where the summary results of this literature search will be reported, and the accuracy of literature data extraction verified by the original investigators.\u003c/p\u003e \u003cp\u003eAfter identifying original investigators with relevant data, we will also invite them to contribute their IPD for systematic review and meta-analysis and to be co-authors on the publication. All IPD from a single site will be pooled across studies conducted within the same site to form a single dataset per site (see Supplementary Information \u0026ndash; Risk of Bias in Individual Studies). To ensure consistent analysis of all IPD across sites, a single researcher competent in implementing the specific analysis pipeline will visit all participating sites to run the analysis pipeline and extract the relevant outcomes and image files for each neonate\u0026rsquo;s dataset. Relevant non-EEG data (see \u0026lsquo;Data items\u0026rsquo; below) will be compiled locally by the participating sites, with central guidance. The IPD dataset will initially be stored locally at each participating site. The summary deidentified outcome measures per neonate and relevant non-EEG variables will be stored in spreadsheet files, shared with researchers at the University of Oxford, and stored securely on a University of Oxford server for subsequent meta-analysis. Image files for each EEG epoch will be stored with the spreadsheets. Finally, the spreadsheets and image files per site will be stored by C-Path on the Rare Disease Cures Accelerator-Data and Analytics Platform (RDCA-DAP), an FDA-funded initiative that provides a centralized and standardized data analytics infrastructure [\u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e28\u003c/span\u003e]. Each original investigator will also be invited to share their full IPD dataset with C-Path for storage on RDCA-DAP, to ensure safe and secure data storage that meets FAIR data principles [\u003cspan citationid=\"CR29\" class=\"CitationRef\"\u003e29\u003c/span\u003e, \u003cspan citationid=\"CR30\" class=\"CitationRef\"\u003e30\u003c/span\u003e].\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec9\" class=\"Section2\"\u003e \u003ch2\u003eData items:\u003c/h2\u003e \u003cp\u003eFor the literature review, the data items extracted from the literature will be outlined and defined in the data extraction form and accompanying data dictionary, which will be made publicly available as part of the scoping review publication.\u003c/p\u003e \u003cp\u003eFor the IPD systematic review and meta-analysis, there are several variables required per individual EEG epoch for the meta-analysis. To address the validity and interpretability research objectives, a response magnitude (regression coefficient) and its associated measure of variance (the squared standard error of the regression coefficient) will be required. To address the reliability research objective, several epoch quality control (QC) metrics as well as image files of the EEG epochs will be required to guide users of the analysis methodology to make data-informed decisions regarding the inclusion or exclusion of each individual epoch. The specific details of how each of the response metrics and QC metrics are extracted will be provided in a forthcoming publication detailing the analysis pipeline. The analysis pipeline will be made publicly available via GitHub or GitLab.\u003c/p\u003e \u003cp\u003eAdditionally, non-EEG data will be required. To address the validity and interpretability research objectives, we will require information on the nature of the recorded noxious and innocuous stimuli (e.g. heel lance, sham heel lance) as well as whether the baby received general anaesthetic or non-topical local anaesthetic. For the exploratory analyses, we will require information relating to age, sex, ethnicity, physiological state, health status, analgesic interventions (both pharmacological and non-pharmacological), topical local anaesthetic interventions and noxious stimulus-evoked responses assessed using non-EEG methods (e.g. behavioural measures, physiological measures, infant clinical pain scales). The exact data items required for the exploratory analyses will be based on the data extraction form from the scoping review, and determined after data extraction for the scoping review is complete. This ensures that requests for data include items that are collected by the individual sites.\u003c/p\u003e \u003cp\u003eFor full storage of the raw EEG data in C-Path\u0026rsquo;s RDCA-DAP, the exact data items required will be defined once data contributions are made and the full requirements are understood. Importantly, the storage of the raw EEG data in C-Path\u0026rsquo;s RDCA-DAP will not be a prerequisite for addressing all primary objectives of this IPD systematic review and meta-analysis.\u003c/p\u003e \u003c/div\u003e\n\u003ch3\u003eOutcomes and prioritization:\u003c/h3\u003e\n\u003cp\u003eThe three primary objectives (assessing reliability, validity, and interpretability) are of equally high priority. The three exploratory objectives (assessing biological groups, analgesic and topical anaesthetic interventions, and non-EEG measures) are of lower prioritization.\u003c/p\u003e \u003cp\u003eReliability outcomes will be measured as Gwet's AC1 coefficient (a chance-corrected agreement coefficient, and a more stable alternative to Cohen\u0026rsquo;s Kappa) and 95% confidence intervals (CIs) will be reported. Adequacy will be assessed using the Altman's benchmark scale: poor (-1 to 0.2), fair (0.2 to 0.4), moderate (0.4 to 0.6), good (0.6 to 0.8), very good (0.8 to 1).\u003c/p\u003e \u003cp\u003eValidity outcomes will be measured as stimulus-evoked EEG response magnitude differences (e.g. noxious minus innocuous, analgesic-negative minus analgesic-positive). Means and 95% CIs will be reported. Significance will be assessed by whether the 95% CI excludes (significant) or includes (not significant) the null value. Adequacy will be assessed using the COSMIN recommendations for hypothesis testing construct validity: at least 75% of the hypotheses should be confirmed.\u003c/p\u003e \u003cp\u003eInterpretability outcomes will be measured as stimulus-evoked EEG response magnitudes and magnitude differences. Means and 95% CIs will be reported. Statistical benchmarks of small, medium, and large effect sizes will supplement these outcomes, using established statistical benchmarks: Cohen\u0026rsquo;s D\u0026thinsp;=\u0026thinsp;0.2, 0.5, 0.8.\u003c/p\u003e \u003cp\u003eFor the exploratory outcomes, biological group comparison outcomes will be measured as stimulus-evoked EEG response magnitudes (means and 95% CIs will be reported). Analgesic and topical local anaesthetic intervention assessment outcomes will be measured as stimulus-evoked EEG response magnitudes (means and 95% CIs will be reported). Associations between noxious stimulus-evoked EEG response and noxious stimulus-evoked response using non-EEG methods will be measured with Pearson correlation coefficients and 95% CIs. For composite or multi-component infant pain scales (e.g. PIPP-R), we will assess associations between the EEG response and the overall infant pain scale score. If data availability allows, we will further assess associations between the EEG response and (i) each individual item in the scale, and (ii) the mean value per dimension (e.g. behavioural dimension, physiological dimension) in the scale.\u003c/p\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003eData synthesis:\u003c/h2\u003e \u003cp\u003eA two-stage IPD analytic approach will be adopted, because it automatically stratifies parameter estimates and residual variances appropriately by analysing each dataset separately in the first stage, utilises well-known meta-analysis methods that offer familiarity and transparency in the second stage, and enables visual summaries using forest plots [\u003cspan citationid=\"CR31\" class=\"CitationRef\"\u003e31\u003c/span\u003e]. Additionally, it provides the opportunity to assess the variability in primary outcomes by dataset. Where appropriate, a one-stage approach will be considered if most of the included datasets are small (e.g. \u0026lt;20\u0026ndash;30 participants) [\u003cspan citationid=\"CR31\" class=\"CitationRef\"\u003e31\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eWe will adopt a random effects modelling approach, as the assumption of a common effect across datasets is unlikely to be reasonable. For completeness, we will additionally report the fixed-effect (i.e. common-effect) result, with the random-effects being the primary outcome.\u003c/p\u003e \u003cp\u003eWhen fitting the random effects model in the second stage, we will use the REML (restricted maximum likelihood) estimation approach and CIs generated using the HKSJ (Hartung-Knapp-Sidik-Jonkman) method [\u003cspan citationid=\"CR32\" class=\"CitationRef\"\u003e32\u003c/span\u003e]. This will be implemented in R [\u003cspan citationid=\"CR33\" class=\"CitationRef\"\u003e33\u003c/span\u003e] using the metafor package [\u003cspan citationid=\"CR34\" class=\"CitationRef\"\u003e34\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eSummary effects will be generated for measures of reliability (inter- and intra-rater agreement), validity (magnitude differences for both known-groups and known-stimuli scenarios), and interpretability (typical magnitude and magnitude difference values), thus summarising the measurement properties for the specific EEG metric under investigation. From these summary effects, standardised effect sizes (Cohen\u0026rsquo;s D) will be generated, which can be used in future clinical trial sample size planning. Heterogeneity across datasets will be reported using I\u003csup\u003e2\u003c/sup\u003e, tau\u003csup\u003e2\u003c/sup\u003e, and 95% prediction intervals.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section2\"\u003e \u003ch2\u003eRisk of bias in individual studies, meta-bias, and confidence in cumulative evidence:\u003c/h2\u003e \u003cp\u003eGiven the nature of the research objectives (establishing novel outcomes of measurement properties and interpretability) and our level of data grouping (data grouped per site rather than per publication), the standard approaches to assessing bias and meta-bias are not directly applicable to this review. We provide an extended discussion of these topics in the Supplementary Information, where the aim to assess availability bias, data accuracy, and data quality is outlined. Similarly, given the nature of the research objectives, we will not formally assess the confidence in cumulative evidence.\u003c/p\u003e \u003c/div\u003e"},{"header":"Discussion","content":"\u003cp\u003eThe purpose of this systematic review and IPD meta-analysis is to establish the reliability, validity, and interpretability of a specific EEG-based response biomarker [\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e] for assessing acute somatic nociceptive pain in neonates. It will provide an overview of the available data and how EEG is being used globally to assess acute neonatal pain. If sufficient IPD are made available and the outcome is demonstrated to be reliable, valid, and interpretable, this work will support the use of EEG-based outcome measures as primary endpoints in clinical trials assessing analgesic efficacy in neonates. Currently, brain-based metrics are not featured in any of the dozens of clinical pain scales for neonatal pain [\u003cspan citationid=\"CR35\" class=\"CitationRef\"\u003e35\u003c/span\u003e], and we believe this review will help address this key knowledge gap.\u003c/p\u003e \u003cp\u003eThis review is limited in its ability to establish certain types of reliability, validity, and interpretability, due to the nature of IPD meta-analysis, the nature of pain, and the non-verbal nature of neonates. The following discussion on reliability and validity subtypes and the interpretability of MICs is based on the COSMIN terminology [\u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e16\u003c/span\u003e, \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e17\u003c/span\u003e, \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e19\u003c/span\u003e].\u003c/p\u003e \u003cp\u003eGiven that the EEG metric is a single-item outcome measurement, the internal consistency subtype of reliability is not relevant. Additionally, the reliability subtypes of measurement error and test-retest reliability are not possible. These reliability subtypes rely on the assumption that the patient conditions remain unchanged across repeated applications of painful stimulation (test-retest reliability) \u0026ndash; an assumption that is highly questionable and cannot be established \u0026ndash; or require that total measurement variance can be partitioned into true biological variance and noise variance (measurement error) \u0026ndash; a requirement that cannot be met due to the inherent inability to disambiguate these variance subtypes.\u003c/p\u003e \u003cp\u003eDue to the quantitative and data-driven nature of an IPD meta-analysis, assessment of content validity is outside the review\u0026rsquo;s remit. Content validity (including face validity) is primarily qualitative and theory-driven and can be established using approaches such as questionnaires and expert panel reviews. Due to the subjective nature of pain and the non-verbal nature of neonates, assessing criterion validity is not possible as this type of validity requires the existence of a \u0026lsquo;gold standard\u0026rsquo;. Neither brain-based (e.g., EEG), behaviour-based, nor vital signs-based measures constitute a gold standard for pain assessment. As this review aims to establish measurement properties and interpretability of a specific single-item EEG metric, assessing structural validity is not relevant. Cross-cultural validity is not relevant, as there is no aspect of the specific EEG metric analysis approach that is language-based that would require translation and cross-cultural validation. Importantly, cross-cultural validity is unrelated to cultural differences in pain experience and response. Thus, there may be cross-cultural differences in noxious stimulus-evoked EEG signal magnitudes, and these potential differences can be explored in this review. However, exploring these potential cultural differences would not constitute an assessment of cross-cultural validity.\u003c/p\u003e \u003cp\u003eWe also do not assess convergent validity, as establishing the agreement of the specific EEG metric with validated neonatal clinical pain scales (e.g., PIPP-R) should not be a pre-requisite for validating an EEG metric. Given that no existing neonatal pain scale has reached gold standard status and given the mechanistic differences underpinning behavioural, vital signs, and EEG signals, these differing measures may indicate different pain-relevant signals that need not agree. Instead of assessing convergent validity, we will explore (as outlined in our \u003cspan refid=\"Sec4\" class=\"InternalRef\"\u003eMethods\u003c/span\u003e section) the association between the EEG metric and alternative non-EEG measures to help begin to shed light on the degree of overlap among these different measures. Divergent validity is unlikely to be able to be evaluated, as it would require researchers to have used a validated scale during the painful procedure to measure a concept-of-interest that is not pain.\u003c/p\u003e \u003cp\u003eAs described in the interpretability objectives outlined in the \u003cspan refid=\"Sec1\" class=\"InternalRef\"\u003eBackground\u003c/span\u003e section, there is no existing agreed-upon method to determine pain MICs for neonates. Consequently, it cannot be established by this review. However, the scenarios that we will assess compare relatively extreme cases, where the magnitude of the effects will be much larger than an MIC, thus allowing us to establish an upper limit to the magnitude of an MIC.\u003c/p\u003e \u003cp\u003eLastly, this review is limited to assessing the reliability, validity, and interpretability of a noxious stimulus-evoked EEG response metric in neonates aged 34\u0026ndash;44 weeks PMA. Infants below 34 weeks PMA display noxious stimulus-evoked brain activity of a different morphology than the brain activity quantified by the standardized waveform studied here [\u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e36\u003c/span\u003e, \u003cspan citationid=\"CR37\" class=\"CitationRef\"\u003e37\u003c/span\u003e]. The standardized waveform used in this review has been developed for and used in neonates in this 34\u0026ndash;44 week PMA age group [\u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e10\u003c/span\u003e, \u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e12\u003c/span\u003e, \u003cspan additionalcitationids=\"CR39\" citationid=\"CR38\" class=\"CitationRef\"\u003e38\u003c/span\u003e\u0026ndash;\u003cspan citationid=\"CR40\" class=\"CitationRef\"\u003e40\u003c/span\u003e]. While this age group does not include the youngest premature neonates to which most painful procedures happen, the methodology assessed in this review will help establish a foundation and roadmap for the future development of novel age-appropriate EEG outcome measures. This review will thus be of importance and interest to those hoping to improve pain assessment and treatment in the youngest most vulnerable neonatal age groups.\u003c/p\u003e \u003cp\u003eThe dissemination plan is to publish results in a peer-reviewed academic journal. Any amendments made to this protocol when conducting the study will be outlined and reported in the final manuscript as well as an updated version of our extended protocol document on OSF.\u003c/p\u003e"},{"header":"Abbreviations","content":"\u003cp\u003eCI = Confidence interval\u003c/p\u003e\n\u003cp\u003eCOSMIN = COnsensus-based Standards for the selection of health Measurement Instruments\u003c/p\u003e\n\u003cp\u003eC-Path = Critical Path Institute\u003c/p\u003e\n\u003cp\u003eEEG = Electroencephalography\u003c/p\u003e\n\u003cp\u003eFAIR = Findability, accessibility, interoperability, and reusability\u003c/p\u003e\n\u003cp\u003eFDA = U.S. Food and Drug Administration\u003c/p\u003e\n\u003cp\u003eFLACC = Face, Legs, Activity, Cry, Consolability scale\u003c/p\u003e\n\u003cp\u003eHKSJ = Hartung-Knapp-Sidik-Jonkman\u003c/p\u003e\n\u003cp\u003eIPD = Individual participant data\u003c/p\u003e\n\u003cp\u003eLNPS = Leuven Neonatal Pain Score\u003c/p\u003e\n\u003cp\u003eMIC = Minimally important change\u003c/p\u003e\n\u003cp\u003eOSF = Open Science Framework\u003c/p\u003e\n\u003cp\u003ePICO = Population, intervention, comparator, outcome\u003c/p\u003e\n\u003cp\u003ePIPP-R = Premature infant pain profile\u0026nbsp;\u0026ndash;\u0026nbsp;revised\u003c/p\u003e\n\u003cp\u003ePMA = Postmenstrual age\u003c/p\u003e\n\u003cp\u003ePRESS = Peer Review of Electronic Search Strategies\u003c/p\u003e\n\u003cp\u003ePRISMA-P = Preferred Reporting Items for Systematic Reviews and Meta-Analyses \u0026ndash; for Protocols\u003c/p\u003e\n\u003cp\u003ePRO = Patient-reported outcome\u003c/p\u003e\n\u003cp\u003ePROSPERO = International Prospective Register of Systematic Reviews\u003c/p\u003e\n\u003cp\u003eQC = Quality control\u003c/p\u003e\n\u003cp\u003eRDCA-DAP = Rare Disease Cures Accelerator-Data and Analytics Platform\u003c/p\u003e\n\u003cp\u003eREML = Restricted maximum likelihood\u003c/p\u003e\n\u003cp\u003eWHO ICTRP = World Health Organization International Clinical Trials Registry Platform\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eEthics approval and consent to participate\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNot applicable.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eConsent for publication\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNot applicable.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eAvailability of data and materials\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eNot applicable.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eCompeting interests\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThe authors declare that they have no competing interests.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eFunding\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eThis review is financially supported by International Neonatal Consortium (INC) of Critical Path Institute (C-Path) (Tucson, Az). JP Sheppard receives funding from the Wellcome Trust/Royal Society via a Sir Henry Dale Fellowship (ref: 211182/Z/18/Z). This research was funded in part, by the Wellcome Trust [ref: 211182/Z/18/Z]. For the purpose of open access, the author has applied a CC-BY public copyright licence to any Author Accepted Manuscript version arising from this submission.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eAuthors\u0026apos; contributions\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eLB wrote the original protocol draft. All authors contributed to the conceptualisation and development of the protocol. Disclaimer: The opinions expressed in this article are those of the authors and should not be interpreted as the position of the U.S. Food and Drug Administration.\u003c/p\u003e\n\n\u003cp\u003e\u003cstrong\u003eAcknowledgements\u003c/strong\u003e\u003c/p\u003e\n\u003cp\u003eINC Pain working party authorship group: Dina Apele-Freimane (Pauls Stradins Clinical University Hospital, Riga, Latvia), Takeshi Arimistu (Department of Pediatrics, Keio University School of Medicine, Japan), Christine Barry (Critical Path Institute, INC, C-Path, Tucson, Arizona, USA), Deb Discenza (Parent Representative, Preemieworld, USA), Olivia Giola (Critical Path Institute, INC, C-Path, Tucson, Arizona, USA), Collin Hovinga (Critical Path Institute, INC, C-Path, Tucson, Arizona, USA), Yamile Jackson (Parent Representative, USA), Danielle Matthews (Parent Representative, USA), Varsha Mehta\u0026dagger; (Food and Drug Administration FDA, Silver Spring, Maryland, USA), Katie Reginato Cascamo (Parent Representative, USA), Norma Vivas (Sanofi Pharmaceuticals, Argentina). \u0026dagger; Deceased.\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\u003cli\u003e\u003cspan\u003eCarbajal R, Rousset A, Danan C, Coquery S, Nolent P, Ducrocq S, et al. Epidemiology and treatment of painful procedures in neonates in intensive care units. JAMA. 2008;300:60\u0026ndash;70.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFlint RB, van Beek F, Andriessen P, Zimmermann LJ, Liem KD, Reiss IKM, et al. Large differences in neonatal drug use between NICUs are common practice: time for consensus? Br J Clin Pharmacol. 2018;84:1313\u0026ndash;23.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSlater R, Moultrie F, Bax R, Anker J, van den, Bhatt A. Preterm health: time to bridge the evidence gap. Lancet. 2020;396:872\u0026ndash;3.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCong X, McGrath JM, Cusson RM, Zhang D. Pain assessment and measurement in neonates: an updated review. Adv Neonatal Care. 2013;13:379\u0026ndash;95.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBoyd WM. NDA 205388/S-006, OMIDRIA (phenylephrine and ketorolac intraocular solution) 1% / 0.3%. FDA. 2017. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.fda.gov/media/110357/download\u003c/span\u003e\u003cspan address=\"https://www.fda.gov/media/110357/download\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFang C. Efficacy Review of NDA 22\u0026ndash;450 N000 (IV Acetaminophen). FDA. 2017. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.fda.gov/media/103514/download\u003c/span\u003e\u003cspan address=\"https://www.fda.gov/media/103514/download\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHertz S. Analgesic Development for Pediatric Patients. FDA. 2016. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.fda.gov/files/advisory%20committees/published/Analgesic-Development-for-Pediatric-Patients.pdf\u003c/span\u003e\u003cspan address=\"https://www.fda.gov/files/advisory%20committees/published/Analgesic-Development-for-Pediatric-Patients.pdf\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSlater R, Worley A, Fabrizi L, Roberts S, Meek J, Boyd S, et al. Evoked potentials generated by noxious stimulation in the human infant brain. Eur J Pain. 2010;14:321\u0026ndash;6.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eVerriotis M, Fabrizi L, Lee A, Ledwidge S, Meek J, Fitzgerald M. Cortical activity evoked by inoculation needle prick in infants up to one-year old. Pain. 2015;156:222\u0026ndash;30.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHartley C, Duff EP, Green G, Mellado GS, Worley A, Rogers R et al. Nociceptive brain activity as a measure of analgesic efficacy in infants. Sci Transl Med. 2017;9.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCobo MM, Moultrie F, Hauck AGV, Crankshaw D, Monk V, Hartley C, et al. Multicentre, randomised controlled trial to investigate the effects of parental touch on relieving acute procedural pain in neonates (Petal). BMJ Open. 2022;12:e061841.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eHartley C, Moultrie F, Hoskin A, Green G, Monk V, Bell JL, et al. Analgesic efficacy and safety of morphine in the Procedural Pain in Premature Infants (Poppi) study: randomised placebo-controlled trial. Lancet. 2018;392:2595\u0026ndash;605.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDe Vet HCW, Terwee CB, Mokkink LB, Knol DL. Measurement in Medicine: A Practical Guide. Cambridge: Cambridge University Press; 2011.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMokkink LB, Terwee CB, Patrick DL, Alonso J, Stratford PW, Knol DL, et al. The COSMIN study reached international consensus on taxonomy, terminology, and definitions of measurement properties for health-related patient-reported outcomes. J Clin Epidemiol. 2010;63:737\u0026ndash;45.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFDA. Guidance for Industry Patient-Reported Outcome Measures. : Use in Medical Product Development to Support Labeling Claims. 2009. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://www.fda.gov/media/77832/download\u003c/span\u003e\u003cspan address=\"https://www.fda.gov/media/77832/download\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDe Vet HCW, Terwee CB, Mokkink LB, Knol DL. Reliability. Measurement in Medicine: A Practical Guide. Cambridge University Press; 2011. pp. 96\u0026ndash;149.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDe Vet HCW, Terwee CB, Mokkink LB, Knol DL. Validity. Measurement in Medicine: A Practical Guide. Cambridge University Press; 2011. pp. 150\u0026ndash;201.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eLarsson BA, Jylli L, Lagercrantz H, Olsson GL. Does a local anaesthetic cream (EMLA) alleviate pain from heel-lancing in neonates? Acta Anaesthesiol Scand. 1995;39:1028\u0026ndash;31.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eDe Vet HCW, Terwee CB, Mokkink LB, Knol DL. Interpretability. Measurement in Medicine: A Practical Guide. Cambridge University Press; 2011. pp. 227\u0026ndash;74.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMoher D, Shamseer L, Clarke M, Ghersi D, Liberati A, Petticrew M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015 statement. Syst Reviews. 2015;4:1.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eShamseer L, Moher D, Clarke M, Ghersi D, Liberati A, Petticrew M, et al. Preferred reporting items for systematic review and meta-analysis protocols (PRISMA-P) 2015: elaboration and explanation. BMJ. 2015;349:g7647.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBaxter L, Slater R, Allegaert K, Davis J, Turner M, Ward RM et al. Is the measurement of noxious stimulus-evoked EEG activity a reliable, valid, and interpretable tool to assess analgesic efficacy in neonates? 2023. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://doi.org/10.17605/OSF.IO/6UHZE\u003c/span\u003e\u003cspan address=\"10.17605/OSF.IO/6UHZE\" targettype=\"DOI\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMcGowan J, Sampson M, Salzwedel DM, Cogo E, Foerster V, Lefebvre C. PRESS Peer Review of Electronic Search Strategies: 2015 Guideline Statement. J Clin Epidemiol. 2016;75:40\u0026ndash;6.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eMcGowan J, Sampson M, Salzwedel DM, Cogo E, Lefebvre C. PRESS Peer Review of Electronic Search Strategies: 2015 Guideline Explanation and Elaboration (PRESS E\u0026amp;E). CADTH Methods and Guidelines. 2016.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eClinicalTrials.gov. https://clinicaltrials.gov. Accessed 30 Jan 2024.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eICTRP Search Portal. https://trialsearch.who.int. Accessed 30 Jan 2024.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eThomas J, Graziosi S, Brunton J, Ghouze Z, O\u0026rsquo;Driscoll P, Bond M et al. EPPI-Reviewer: advanced software for systematic reviews, maps and evidence synthesis. 2022.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eC-Path RDCA-DAP Portal. \u003cspan class=\"ExternalRef\"\u003e\u003cspan class=\"RefSource\"\u003ehttps://portal.rdca.c-path.org\u003c/span\u003e\u003cspan address=\"https://portal.rdca.c-path.org\" targettype=\"URL\" class=\"RefTarget\"\u003e\u003c/span\u003e\u003c/span\u003e. Accessed 30 Jan 2024.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eJacobsen A, de Miranda Azevedo R, Juty N, Batista D, Coles S, Cornet R, et al. FAIR Principles: Interpretations and Implementation Considerations. Data Intell. 2020;2:10\u0026ndash;29.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eWilkinson MD, Dumontier M, Aalbersberg IJ, Appleton G, Axton M, Baak A, et al. The FAIR Guiding Principles for scientific data management and stewardship. Sci Data. 2016;3:160018.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRiley RD, Burke DL, Morris T. One-stage versus Two-stage Approach to IPD Meta-Analysis: Differences and Recommendations. In: Riley RD, Tierney JF, Stewart LA, editors. Individual Participant Data Meta-Analysis: A Handbook for Healthcare Research. Newark, UNITED KINGDOM: John Wiley \u0026amp; Sons, Incorporated; 2021. pp. 199\u0026ndash;217.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eRiley RD, Debray TPA, Morris TP, Jackson D. The Two-stage Approach to IPD Meta-Analysis. In: Riley RD, Tierney JF, Stewart LA, editors. Individual Participant Data Meta-Analysis: A Handbook for Healthcare Research. Newark, UNITED KINGDOM: John Wiley \u0026amp; Sons, Incorporated; 2021. pp. 89\u0026ndash;125.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eSchwarzer G, Carpenter JR, R\u0026uuml;cker G. Meta-Analysis with R. 1st edition. Springer Cham; 2015.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eViechtbauer W. Conducting Meta-Analyses in R with the metafor Package. J Stat Softw. 2010;36:1\u0026ndash;48.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eBueno M, Eriksson M, Stevens BJ. Neonatal and infant pain assessment. In: Stevens BJ, Hathway G, Zempsky WT, editors. Oxford Textbook of Pediatric Pain. Second Edition. Oxford, New York: Oxford University Press; 2021. p. 375\u0026ndash;90.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eFabrizi L, Slater R, Worley A, Meek J, Boyd S, Olhede S, et al. A shift in sensory processing that enables the developing human brain to discriminate touch from pain. Curr Biol. 2011;21:1552\u0026ndash;8.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003evan der Vaart M, Hartley C, Baxter L, Mellado GS, Andritsou F, Cobo MM, et al. Premature infants display discriminable behavioral, physiological, and brain responses to noxious and nonnoxious stimuli. Cereb Cortex. 2022;32:3799\u0026ndash;815.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCobo MM, Green G, Andritsou F, Baxter L, Evans Fry R, Grabbe A, et al. Early life inflammation is associated with spinal cord excitability and nociceptive sensitivity in human infants. Nat Commun. 2022;13:3943.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eCobo MM, Hartley C, Gursul D, Andritsou F, van der Vaart M, Schmidt Mellado G, et al. Quantifying noxious-evoked baseline sensitivity in neonates to optimise analgesic trials. eLife. 2021;10:e65266.\u003c/span\u003e\u003c/li\u003e \u003cli\u003e\u003cspan\u003eGursul D, Goksan S, Hartley C, Mellado GS, Moultrie F, Hoskin A, et al. Stroking modulates noxious-evoked brain activity in human infants. Curr Biol. 2018;28:R1380\u0026ndash;1.\u003c/span\u003e\u003c/li\u003e\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"
[email protected]","identity":"systematic-reviews","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"sysr","sideBox":"Learn more about [Systematic Reviews](http://systematicreviewsjournal.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/sysr/default.aspx","title":"Systematic Reviews","twitterHandle":"@MedicalEvidence","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"em","reportingPortfolio":"BMC/SO AJ","inReviewEnabled":true,"inReviewRevisionsEnabled":true},"keywords":"Neonate, pain, brain, electroencephalography, biomarkers, meta-analysis, individual participant data, validity, reliability, interpretability","lastPublishedDoi":"10.21203/rs.3.rs-4464161/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-4464161/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003e\u003cstrong\u003eBackground\u003c/strong\u003e: There are several major challenges limiting our ability to test analgesic efficacy for treatment of neonatal pain, and progress in analgesic drug studies in neonates has stalled. One significant issue is the reliance of clinical pain assessments on traditional behavioural and vital signs-based measures and the exclusion of novel brain-based biomarkers. In this review protocol, we outline our strategy to assess the reliability, validity, and interpretability of an electroencephalography (EEG)-based response biomarker for assessment of acute somatic nociceptive pain in neonates.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eMethods\u003c/strong\u003e: To standardise EEG analysis and generate the outcome of interest, we will perform an individual participant data (IPD) meta-analysis using data from neonates aged 34-44 weeks postmenstrual age that have had EEG recorded during acute somatic nociceptive skin-breaking procedures. Relevant data from both published and grey literature will be identified by searching six databases (MEDLINE, Embase, CINAHL, Web of Science, Scopus, Google Scholar), two clinical trial registry platforms (ClinicalTrials.gov, WHO ICTRP), and by consulting expert opinion. We will assess availability bias, data accuracy, and data quality by cross-referencing provided data with data descriptions in the literature, identifying duplicates and non-sensical values, and extracting quality control metrics. Data will be synthesised via a two-stage IPD meta-analysis using a random effects modelling approach grouped by site. Reliability (inter- and intra-rater) outcomes will be measured as Gwet's AC1 coefficient. Validity (known-groups and known-stimuli) outcomes will be measured as EEG response magnitude differences between clinically meaningfully different stimuli. Interpretability will be addressed by providing normative values, in both original and standardised units.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eDiscussion\u003c/strong\u003e: The purpose of this study is to establish the reliability, validity, and interpretability of a specific EEG-based response biomarker for assessing acute somatic nociceptive pain in neonates. It will provide an overview of available data and how EEG is being used globally to assess acute neonatal pain. If sufficient IPD are made available and the outcome is reliable, valid, and interpretable, this work will support the use of EEG-based outcome measures as primary endpoints in clinical trials assessing analgesic efficacy in neonates.\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eSystematic review registration\u003c/strong\u003e: The protocol was registered with PROSPERO on 14-July-2023: CRD42023444809.\u003c/p\u003e","manuscriptTitle":"Is noxious stimulus-evoked electroencephalography response a reliable, valid, and interpretable outcome measure to assess analgesic efficacy in neonates? A systematic review and individual participant data (IPD) meta-analysis protocol","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-06-06 03:37:33","doi":"10.21203/rs.3.rs-4464161/v1","editorialEvents":[{"type":"communityComments","content":0},{"type":"decision","content":"Accept","date":"2025-06-20T23:24:43+00:00","index":"","fulltext":""},{"type":"reviewerAgreed","content":"","date":"2024-08-02T13:30:09+00:00","index":0,"fulltext":""},{"type":"reviewersInvited","content":"","date":"2024-05-24T14:07:06+00:00","index":"","fulltext":""},{"type":"editorAssigned","content":"","date":"2024-05-23T04:37:24+00:00","index":"","fulltext":""},{"type":"submitted","content":"Systematic Reviews","date":"2024-01-31T13:18:05+00:00","index":"","fulltext":""}],"status":"published","journal":{"display":true,"email":"
[email protected]","identity":"systematic-reviews","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":false,"externalIdentity":"sysr","sideBox":"Learn more about [Systematic Reviews](http://systematicreviewsjournal.biomedcentral.com/)","snPcode":"","submissionUrl":"https://www.editorialmanager.com/sysr/default.aspx","title":"Systematic Reviews","twitterHandle":"@MedicalEvidence","acdcEnabled":true,"dfaEnabled":true,"editorialSystem":"em","reportingPortfolio":"BMC/SO AJ","inReviewEnabled":true,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"9d11aa4a-8a2e-4883-a8e1-4276d8735aed","owner":[],"postedDate":"June 6th, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[],"tags":[],"updatedAt":"2025-07-28T15:59:20+00:00","versionOfRecord":{"articleIdentity":"rs-4464161","link":"https://doi.org/10.1186/s13643-025-02890-4","journal":{"identity":"systematic-reviews","isVorOnly":false,"title":"Systematic Reviews"},"publishedOn":"2025-07-26 15:57:03","publishedOnDateReadable":"July 26th, 2025"},"versionCreatedAt":"2024-06-06 03:37:33","video":"","vorDoi":"10.1186/s13643-025-02890-4","vorDoiUrl":"https://doi.org/10.1186/s13643-025-02890-4","workflowStages":[]},"version":"v1","identity":"rs-4464161","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-4464161","identity":"rs-4464161","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}
Text is read by the "Ask this paper" AI Q&A widget below.
Extraction quality varies by source — PMC NXML preserves structure
cleanly, OA-HTML may include some navigation residue, and OA-PDF can
have broken hyphenation. The publisher copy
(via DOI)
is the canonical version.