Prediction of attention deficit hyperactivity disorder based on explainable artificial intelligence

preprint OA: closed
Full text JSON View at publisher
Full text 180,218 characters · extracted from preprint-html · click to expand
Prediction of attention deficit hyperactivity disorder based on explainable artificial intelligence | Research Square window.SnipcartSettings = { analytics: { enabled: false } }; (function() { var accessVector = localStorage.getItem('access_vector') || ''; window.dataLayer = window.dataLayer || []; if (accessVector) { window.dataLayer.push({ user: { profile: { profileInfo: { snid: accessVector } } } }); } })(); (function(w,d,s,l,i){w[l]=w[l]||[];w[l].push({'gtm.start':new Date().getTime(),event:'gtm.js'});var f=d.getElementsByTagName(s)[0],j=d.createElement(s),dl=l!='dataLayer'?'&l='+l:'';j.async=true;j.src='https://www.googletagmanager.com/gtm.js?id='+i+dl;f.parentNode.insertBefore(j,f);})(window,document,'script','dataLayer','GTM-K279D39R'); Browse Preprints In Review Journals COVID-19 Preprints AJE Video Bytes Research Tools Research Promotion AJE Professional Editing AJE Rubriq About Preprint Platform In Review Editorial Policies Our Team Advisory Board Help Center Sign In Submit a Preprint Cite Share Download PDF Research Article Prediction of attention deficit hyperactivity disorder based on explainable artificial intelligence Ignasi Navarro-Soria, Juan Ramón Rico-Juan, Rocío Juárez-Ruiz de Mier, and 1 more This is a preprint; it has not been peer reviewed by a journal. https://doi.org/ 10.21203/rs.3.rs-3992393/v1 This work is licensed under a CC BY 4.0 License Status: Published Journal Publication published 08 Apr, 2024 Read the published version in Applied Neuropsychology: Child → Version 1 posted You are reading this latest preprint version Abstract The aim of this study is to predict the probability being diagnosed with ADHD using ML algorithms and to explain the behavior of the model to support decision making. The dataset studied included 694 cases. Information was obtained on age, sex and WISC-IV scores. Algorithms belonging to different ML learning styles were tested. A stratified 10-fold-cross-validation was applied to evaluate the models. The metrics were used: accuracy, area under the receiver operating characteristic, sensitivity and specificity. We compared models using all initial features and a suitable wrapper-type feature selection algorithm. After, we calculated Shapley additive values to assign weights to each predictor based on its additive contribution to the outcome and explain the predictions. The Random Forest algorithm performed best on most metrics. The main predictors included, GAI-CPI, WMI, CPI, PSI, VCI, WMI - PSI, PRI and LN. The ML model adequately predicted ADHD diagnosis in 90% of cases. Figures Figure 1 Figure 2 Figure 3 Figure 4 Figure 5 Figure 6 Figure 7 Figure 8 Figure 9 Figure 10 1. Introduction Attention deficit hyperactivity disorder (ADHD) is a neurodevelopmental disorder that, according to the DSM-V, begins in childhood (APA, 2013). It affects a considerable number of children between 3% and 7%, being more frequent among boys. The diagnosis is usually made around 5 to 8 years of age, coinciding with the onset of schooling (Alda et al., 2012 ). Symptoms emerge in childhood, but persist into adolescence and adulthood (Simon et al., 2009 ). According to the American Psychiatric Association ( 2013 ) ADHD comprises a persistent pattern of behaviors characterized by three types of symptoms: inattention, hyperactivity and impulsivity. Primarily, the diagnosis is based on behavioural symptoms and is made before the age of 12. For a diagnosis to take place, at least two of the following areas must be affected: home life, school or social situations (Maciá, 2012). ADHD is a neurodevelopmental disorder (NDD) with a strong hypothesis of genetic variation (Thapar, 2018 ) which seems to cause alterations in certain neurotransmitters (Mehta et al., 2019 ), producing shortfalls in basic psychological processes such as Working Memory (WM) (Yang, Allen, Holmes, & Chan, 2017 ; Kubo et al., 2018 ; Kibby et al., 2019 ; Moura, Costa, & Simoes, 2019) and Processing Speed (PS) (Cheung, Fazier-Wood, Asherson, & Kuntsi, 2014; Fosco et al., 2020 ), and directly impairing executive functioning (Barkley, 1997 , 2005 ; Lavigne Cerván & Romero Pérez, 2010; Kim et al., 2020 ). Consequently, ADHD is fundamentally distinguished by a persistent pattern of inattention, hyperactivity, and impulsivity that significantly interferes with individual, social, academic, or occupational functioning (APA, 2013; Bélanger et al., 2018 ). ADHD is the most common NDD in children and adolescents (Polanczyk et al., 2014 ; Thomas et al., 2015 ). Its characteristic symptomatology usually appears between 6–12 years of age (APA, 2013; Asherson & Agnew-Blais, 2019 ), a stage of evolutionary development highly sensitive to psychoeducational intervention (Baloh, 2022 ). Therefore, early diagnosis and, consequently, early care are essential to ensure a favorable prognosis. (Rappaport et al., 1998 ; Sonuga-Barke et al., 2011 ; Hamed et al., 2015 ; Miller et al., 2018 ; Hare et al., 2021 ; Kajka & Kulik, 2021 ). However, ADHD is a NDD that entails a complex assessment (Navarro-Soria et al., 2020 ), with a diagnostic evaluation that generally combines the implementation of direct observation in natural environments, semi-structured interviews and parent and teacher observation scales with the use of psychometric tests (Reid & Maag, 1994 ; Isquith et al., 2013 ; Krieger & Amador-Campos, 2018 ; Marshall et al., 2021 ). The Wechsler Intelligence Scale for Children (WISC) proposes the most widely used psychometric test in the world (Oakland & Hu, 1992 ; Oakland et al., 2013 ; Oakland et al., 2016 ; Benson et al., 2019 ; Muñiz et al., 2020 ). It continues to be one of the most widely used instruments within the common batteries for the clinical assessment of ADHD (Bustillo & Servera, 2015 ; Calub et al., 2019 ; Fenollar-Cortés et al., 2015; Fenollar-Cortés et al., 2019 ; Fernández-Jaén et al., 2012 ; Gomez et al., 2016 ; Jiang et al., 2015 ; Kim & Song, 2020 ; Krane & Tannock, 2001 ; Matsuura et al., 2014 ; Mayes & Calhoun, 2004 ; Mayes & Calhoun, 2006 ; Navarro-Soria et al., 2020 ; Snow & Sapp, 2000 ; Styck & Watkins, 2017 ; Parke et al., 2020 ; Thaler et al., 2015 ; Ulla et al., 2007 ; Walg et al., 2017 ; Wanderer et al., 2021 ). Specifically, its fourth edition (WISC-IV; Wechsler, 2003) has been a notable help in the diagnostic evaluation of ADHD, due to the particular configuration of its different indexes. The first two subscales of the WISC-IV, Verbal Comprehension (VC) and Perceptual Reasoning (PR), do not seem to correspond to variables affected in ADHD, however, WM and PS do relate to altered basic psychological processes (Yang, Allen, Homes, & Chan, 2017; Kubo et al., 2018 ; Kibby et al., 2019 ; Moura, Costa, & Simoes, 2019; Cheung, Fazier-Wood, Asherson, & Kuntsi, 2014; Fosco et al., 2020 ). The combination of the WISC-IV and ADHD seems to reveal a response pattern characterized by higher scores in VC and PR, and lower scores in WM and PS (Barkley, Murphy, & Bush, 2001 ; Mayes & Calhoun, 2006 ; Arribas, Santamaría, Sueiro, & Pereña, 2011; Bustillo & Servera, 2015 ; Corral, Yang et al., 2013 ; Toffalini et al., 2022 ). Thus, the very structure of the WISC-IV allows to observe in ADHD what some authors call a "cognitive step", i.e., stable scores in VC and PR, with sharp score drops in WM and PS. (Fenollar-Cortés et al., 2015; Navarro-Soria et al., 2020 ). On the other hand, there are algorithms based on machine learning (ML), which is an area of Artificial Intelligence (AI) used to detect patterns in data and make predictions (supervised learning) automatically. These systems require two steps: the first is that they use the algorithms to train models from data; and the second is that the trained model is used to make predictions on unknown data. More and more ML is being applied to problems in different domains such as education (Rico-Juan et al., 2019 ; Cachero, et al., 2023 ), industries (Dalzochio et al., 2020 ; Liu, 2020 ; Dhini et al., 2021 ) and of course healthcare (Javaid et al., 2022 ; Lui et al., 2022). The main reason why ML algorithms were not used so far in data analysis is due to their difficult interpretation in most cases. A colloquial term often used to refer to these systems is black boxes (Koh and Liang, 2017 ). The algorithms have different learning styles as neighborhood-based, decision trees, support vector machines or neural networks with algorithms capable of inferring relationships between variables, both linear and nonlinear relationships. This feature makes the resulting models better suited to data distributions than purely linear models (Liang et al., 2022 ). The usual practice is to try different algorithms for a problem and choose the one or ones that obtain the best results to make the predictions. In addition, recent advances in the area known as eXplicable Artificial Intelligence (XAI) (Barredo Arrieta et al., 2020 ) make it easier for us to extract some useful and understandable information from ML models. Specifically, post-hoc techniques offer a unified approach to explicate the predictions made by an ML model. SHAP (SHapley Additive exPlanations) (Lundberg, 2019 ) is an example of a tool whose calculations are based on game theory and which allows us to know the importance of variables at a general level, by groups or at an individual level. Taking into account the above, the contributions of this research are: (1) to apply ML-based techniques to the test results of 694 cases to improve the detection of cognitive patterns of ADHD, (2) to offer new relationships between the variables obtained in the diagnostic process for use in an automated diagnostic aid system that includes feedback based on the explainability of ML systems. The intention is that this system will be part of a web application in the future to help psychologists not specialized in the detection of ADHD, as well as to cross-check the diagnoses of specialists. 2. Research method 2.1. Objectives and context definition Cognitive patterns associated with different neurobiological disorders such as ADHD -among others-, have been intensively studied during the last two decades with the help of traditional statistical tools (Asherson & Agnew-Blais, 2019 ; Fenollar et al., 2015; Fosco et al., 2020 ; Navarro et al., 2020). These methodologies mainly analyze and contrast linear relationships between variables whose behaviors are usually assumed to be normal, which is a limitation both for making predictions and for explaining their relationships. The current proposal is oriented to the application of ML algorithms to alleviate these limitations. These algorithms have the ability to learn or adjust value structures in a first training phase to create a predictive model capable of relating the different input variables, whether these are linear or not. Moreover, this learning can be performed on any type of data distribution, which can help us to better explain the results, obtaining even greater reliability than those achieved so far with traditional statistical tools. In addition, recent advances in ML explainability (Barredo Arrieta et al., 2020 ) now allow predictions to be explained with post-hoc agnostic models. This, given the complexity of the diagnostic processes, especially in children and adolescents, would provide a useful tool for professionals to obtain valid information to aid diagnostic decisions. For this reason, we propose (see Fig. 1 ): (1) Apply ML-based techniques as a strategy to improve the detection of cognitive patterns associated with ADHD in children aged 6 to 16 years, (2) To have an automated diagnostic aid system that includes feedback based on the explainability of the systems. This system will be based on the significant relationships between the scores obtained in the WISC-IV of boys and girls ages 6 to 16 years of age. Figure 1 represents in a visual way the application of our methodology where the group of experts would be in charge of applying the psychometric tests (WISC IV) to the subjects, obtaining the verified results and updating the database. In addition, updating the diagnostic aid system. 2.1. Participants The recruited sample consisted of 406 subjects aged 6–16 years, male and female, derived from child and adolescent mental health units in the province of Alicante (Valencian Community), the Region of Murcia and the province of Malaga (Andalusia). As shown in Table 1 , of the subjects in the clinical group (ADHD), 326 were male (80.3%) and the mean age was 9.4 ( SD = 2.2). The inclusion criteria were as follows: having been previously diagnosed with ADHD by a clinician, having an IQ equal to or higher than 80 and not observing significant symptoms of severe mental disorders or severe medical illnesses. Subjects who did not meet these criteria were excluded from the sample. In all cases, the legal guardians of the minors signed the informed consent to participate in the current research. Table 1 Basic statistics on the data collected. N, M and SD are the number of cases, the mean and the standard deviation, respectively. ADHD ‘No’, refers to the control group. ADHD Gender N M SD No Male 195 9.7 2.7 Female 93 9.4 2.5 Yes Male 326 8.7 2.2 Female 80 9.0 2.0 On the other hand, the control group consisted of 288 subjects collected from public and private schools in the urban areas of Alicante, Murcia, Marbella and Malaga, made up of children of both sexes and ages between 6 and 16, of whom 195 were boys (67.7%) and the average age of the control group was 9.5 years old. ( SD = 2.51). Furthermore, all participants in the control group had to meet the same inclusion criteria as the clinical group (with the exception of a prior diagnosis of ADHD) and obtain scores on the Strengths and Difficulties Questionnaires (SDQ) for Parents and Teachers (Goodman, 1997 ), in both informants, below the clinically significant lower limit on any of its subindices. Finally, the total sample amounted to 694 subjects, including clinical and neurotypical population. 2.2. Instruments During the process of sample selection and data collection, the following psychometric tools were used: Strengths and Difficulties Questionnaire for Parents and Teachers (SDQ; Goodman, 1997 ): a 25-item Likert-type short behavioural screening questionnaire. For the investigation the “hyperactivity” subscale was used with its stipulated clinical limits, except in subjects in whom the inattention dimension predominated, in which case the limit was set at a score equal to or higher than 4 (Ullebo, Posserud, Heiervang, & Gillberg, 2011). Subjects who did not obtain a clinically significant score, from family and the school, were excluded from the clinical sample. ADHD-Rating Scale-IV (For Children and Adolescents) (RS; DuPaul, Power, & Anastopoulos, 1998): An 18-item Likert-type scale based on the DSM-IV diagnostic criteria for ADHD, which measures the dimensions "inattention" and "impulsivity/hyperactivity", both in the home and school environment. The Spanish version was used (Servera & Cardó, 2007), considering a score of ≥ P90 in each dimension as clinically significant. Conners 3rd edition for parents and teachers (spanish version) (Conners, 2008 ): Likert type scale of 45 (parents version) and 41 items (teachers version) that generates 5 indices: "Inattention", "Hyperactivity/Impulsivity", "Learning Difficulties/Executive Functions", "Aggressiveness/Defiance" and "Relations with peers". For this research, the score cut-off point was set as T ≥ 65 for the index to be considered clinically significant. The Parental Account of Childhood Symptoms (PACS) (Taylor, Schachar, Thorley, & Wieselberg, 1986 ): a member of the research team conducted a semi-structured interview with the family to verify that the subject met the ADHD criteria of the DSM-V (APA, 2013) as well as to review the clinical history and corroborate that the inclusion criteria of the sample design were met.. Wechsler Intelligence Scale for Children-Fourth Edition (WISC-IV; Wechsler, 2005 ): scale composed of 15 tests for children between 6–16 years of age. The Spanish version was used for this study. The result of each of the tests is related to the level of cognitive development in a series of intellectual competencies the results from which 5 indexes are obtained: Verbal Comprehension (VC), Perceptual Reasoning (PR), Working Memory (WM), Processing Speed (PS), Total Intellectual Quotient (FSIQ). In addition, we found 2 composite indices: the Index of Cognitive Competence (ICC) resulting from the grouping of WM-PS and the Index of General Ability (GAI) resulting from the VC-PR grouping. This scale has demonstrated the adequacy of reliability and validity coefficients in both the American and Spanish versions. Therefore, the reliability of the WISC-IV in its Spanish adaptation, presents coefficients in the range of 0.86 (IVP) to 0.95 (FSIQ) for the split-half method, which are similar to the coefficients found in the American version, demonstrating the high consistency and stability of the test. 2.3. Experimental planning In order to carry out the current research, child and adolescent mental health units with neuropediatrics or psychiatry and psychology services were contacted in the provinces of Malaga, Alicante and Murcia. Researchers from the task force presented them with the aims of the research and the request to recruit data on boys and girls aged 6 to 16 years with a diagnosis of ADHD having previously been made by a clinician, having an IQ equal to or higher than 80 and not observing significant symptoms of severe mental disorders or severe medical illnesses. Previously, the legal guardians of each of the participants had to sign an informed consent form. The data collection was carried out by three administrators (psychologists trained specifically for this purpose), one from each province, who used the same assessment protocol that included the tools described above. Regarding the participants of the control group, contact was made with various educational guidance departments of public and private schools in the urban areas of Alicante, Murcia, Marbella and Malaga. The objectives of the study were presented to them and authorization was requested to obtain data on boys and girls between 6 and 16 years of age, with normotypical development, after the signing of the informed consent by the legal guardians of each minor. This research was carried out in accordance with the 1964 Declaration of Helsinki and its subsequent modifications. Likewise, approval was requested from the Ethics Committee of the University of Alicante, which provides and approves the methodology used, assigning the approval the file number UA-2018-03-08. The collection of the control group data was carried out by two researchers of the team, collecting data in parallel in the areas of Alicante and Murcia, on the one hand, and in Marbella and Malaga, on the other. 3. Data analysis For the experimentation we have considered a validation scheme for the training of ML algorithms, we have selected a series of metrics applied to this type of studies to evaluate the results, the selection of a subset of representative variables with respect to the total to concentrate the study on these variables, and finally, we have added an explanatory part on the predictions. 3.1. Machine learning approach Various learning styles of ML algorithms were tested to study which algorithms perform better in their predictions in order to explain more accurately the relationships between independent and dependent variables. The chosen styles can be grouped into neighborhood-based, decision trees, support vector machines, Bayesian models and neural networks. In the following we will briefly detail what each of the selected algorithms consists of: Baseline: This is the baseline model and follows a uniform distribution of the data whose prediction is around 50% for a binary classification (our study). Its behavior is like that of a random system. Decision tree (Breiman et al., 2017): This model follows a hierarchical structure. This structure is built from a data set by selecting the most relevant variable and applying a cutoff value creating two subsets of samples. This same scheme is repeated for the subsets generated until a finish condition is satisfied. Random forest (Breiman, 2001 ): This algorithm builds several decision trees in order to combine their predictions into one. The combination of several predictions leads to a more robust behavior. AdaBoost (Adaptive Boosting) (Freund & Schapire, 1997 ): This algorithm is based on the use of different linear regressors. The final prediction is calculated from the regressors weighted according to the confidence learned by the algorithm in the training phase. eXtreme Gradient Boost [XGBoost] (T. Chen & Guestrin, 2016 ), gradient boosting with categorical features support [CatBoost] (Dorogush et al., 2018 ) and Light Gradient Boosting Machine [LightGBM] (Ke et al., 2017 ): These algorithms use the boosting technique combined with the creation of several decision trees, where the optimization processes use derivable cost functions along with weight updating with gradient descent (as in neural networks). Each algorithm applies different types of optimizers and all of them have performed well in open challenges. Support vector machines [SVM] (Cortes & Vapnik, 1995 ): These algorithms are divided into two stages. In the first phase, the original data space is transformed to another space, usually of higher dimension. In a second phase, a linear hyperplane is sought that separates and distances as much as possible the classes in the resulting space. Multilayer perceptron [MLP] (Hinton, 1989): This type of neural network is basic but at the same time widely used its basic feature is that all neurons of consecutive layers are connected to each other. Gaussian Naive Bayes [Gaussian NB] (Chan et al., 1983 ): This model is based on the Bayes model, uses conditional probabilities and assumes independence among the variables in the model. Nearest neighbors [NN] (Cover & Hart, 1967 ): The class of a new sample is calculated based on the k (parameter) nearest training samples and their associated classes. The proximity function used is the Euclidean distance, and the parameter k was set to 1, 3, 5 and 7. 3.2. Validation and metrics In order to know the goodness of prediction of a model, a validation technique is applied. In our case we will use the well-known k-fold cross-validation (Fig. 2 ) which consists of creating k disjoint partitions of similar size, leaving one as a test and using the rest as a training set, iterate k times obtaining k different models and calculating their mean as the final result. The usual value to set k is 10. Let's start by defining the type of hits and misses that occur when we classify samples in a binary or dichotomous problem such as ours (has or does not have ADHD) (see Table 2 ). Table 2 Binary classification with the four possible cases. Predicted condition Positive Negative Actual condition Positive True positive (TP) False negative (FN) Negative False positive (FP) True negative (TN) The metrics we will use for this study are those commonly used. Accuracy [1] is the number of hits divided by the total number of predictions. Sensitivity [2] is the probability of being right when a sample is truly positive. Specificity [3] is the probability of being right when a sample is truly negative. Area Under the Receiver Operating Characteristic (AUC) [5] can be interpreted as the probability that a classifier will order a randomly chosen positive instance higher than a negative one. \(Acc =\frac{TP + TN}{TP +FN + FP + TN}\) [1] \(Sensitivity= \frac{TP}{TP + FN}\) [2] \(Specificity= \frac{TN}{TN + FP}\) [3] \(FPR= \frac{ FP}{FP + TN}\) [4] \(AUC= \frac{1 + Sensitivity - FPR}{2}\) [5] 3.3. Feature selection When conducting a study of this type, it is interesting to explore the possibility of reducing the number of input variables to the model (independent variables) while maintaining its predictive capacity. In this way, variables that are not relevant to the problem are removed, the learning of the models is facilitated since they focus on the most important information, and it is necessary to know fewer variables that the specialist must provide when predicting a new sample with the consequent saving of time and effort. There are three types of feature selection approaches: the filter, the wrapper and the embedding approach (Guyon and Elisseeff, 2003). The filter approach is based on analyzing, evaluating and assessing the importance of each characteristic separately and assessing their importance to remove those below a threshold (Chandrashekar and Sahin, 2014 ). However, it ignores the dependencies between features, it is applied independently of the classifier so it lacks classifier verification (Ang at al., 2015). The wrapper approach trains a predictive model with a subset of features, and determines its validity (Gnana et al., 2016 ). This approach solves the problems of the filter method although it increases the computational cost of the process. Finally, the embedding approach focuses on searching for the ideal feature subset for a particular classification algorithm (Khaire & Dhanalakshmi, 2022 ) but it is influenced by the classifier hypothesis and is not compatible with other classifiers. Therefore, our study uses an advanced wrapper-type method called Boruta (Kursa and Rudnicki, 2010 ) to extract the most relevant features for use with different classifiers. It is a statistical algorithm and consists of duplicating each variable (shadow) and shuffling its values, and applying some classifier based on decision trees (e.g. Random Forest) to perform statistical tests on the results between the original variables and the shadows (artificially created and representing noise). When the results of the original variables are significantly better than a threshold computed with the shadow variables, they are identified as important. This process is repeated several times until it is determined which variables are important in most tests. As we can see in Fig. 3 , eight variables are selected (colored in green): GAI - CPI, WMI, VCI, WMI - PSI, PSI, CPI, LN and PRI. Hereafter we will refer to All when using all features and FS when using feature selection with these eight variables. 3.4. Results To avoid the presence of outliers, while normalizing important differences in scales between the different variables, we applied a robust scaling method (RS) (see Eq. [6]) before proceeding to train the models. \(RS\left(v\right)= \frac{i - Q2\left(v\right)}{Q3\left(v\right) - Q1\left(v\right)} :i∊v\) [6] Where v the de variable, i is a value of the variable and Q1 , Q2 and Q3 of v are the respective quantiles. Figure 4 shows the average results of the cross-validation according to the two groups of characteristics and metrics proposed. We observe the similarity of results between the best All and FS algorithms for the different metrics, since FS has a less number of features we will consider only FS for the following experiments. It is also desirable to contrast the results to know if the difference between the algorithms is significant or not. For this we use the Wilcoxon paired test (Wilcoxon, 1945 ) with 95% confidence which is the commonly used value. Figure 5 shows a comparison of statistical significance between algorithmic pairs according to ACC as it is one of the main metrics. It can be observed that: i) Baseline, Gaussian NB and NN get the worst results; ii) The best algorithms are based on decision trees (Random Forest, CatBoost and LGBM); Random Forest is significantly better than the rest in terms of accuracy. In addition, we have also performed preliminary tests on some partitions with hyperparameter optimization algorithms (Jamieson & Talwalkar, 2016 ; Li et al., 2017 ) on the ML models that have obtained the best results such as Random Forest, CatBoost and LightGBM, since despite having a high computational cost with respect to the simple learning of a single model, sometimes this parameter adjustment is justified by the improvement in the results in the predictions. On this occasion, the differences obtained were not significant with respect to the default parameters, so we ruled out performing the complete tests with cross-validation. Therefore, since Random Forest is significantly better than the rest in accuracy, and in addition, it is one of the best in all other metrics (Fig. 3 ), we selected it as the algorithm with feature selection variables (FS) for explainability. 3.5. Machine learning and explainability The best predictions are usually made by the most complex models and to try to explain why they make consistent predictions there is a so-called post-hoc approach that can be applied to any ML model once trained. The first post-hoc explainability approach is based on making permutations (Breiman, 2001 ) on the values of each individual input variable (predictor) and comparing the variability directly on its predictions. This form allows us to estimate the importance of the predictors of a model that is already trained; the second approach is based on the construction of a new linear model that explains the complex model already trained. Basically, this type of explainability can be tackled in two ways. One is based on using permutations on the independent variables in a controlled way (Breiman, 2001 ) and measuring how they affect the dependent variable. In this way the importance of each variable can be estimated. The second approach is based on game theory and makes use of Shapley values (Roth, 1988 ). The very definition of these values coincides with the objective of ML to explain the models. These values calculate the marginal contribution of each independent variable considering all possible combinations with the rest of the variables to explain the final outcome (dependent variable). In addition, this method ensures local accuracy, missingness and consistency. Figure 6 shows the general post-hoc scheme when combined with ML. As we can see, the main prediction system is trained with a machine learning model to issue a diagnosis (decision), and in parallel the explainability system estimates the importance of the values of the input variables with respect to the decision taken. This information is used to provide feedback on the prediction process. Shapley values are useful when we want to know the overall impact of a variable but cannot explain the impact on a group of samples or on a specific sample. Lundberg and Lee (Lundberg & Lee, 2017a , Lundberg & Lee, 2017b ) have made these advances by solving the above problems. One tool that follows this approach is SHAP (SHapley Additive exPlanations) (Lundberg, 2019 ) which allows scaling explainability at the global, group or individual level. This explainability of the ML model may be of great interest to both highly skilled and unskilled ADHD screening personnel. In the first case, it can confirm or not a diagnosis and the professional himself/herself could help to improve the model, and in the second case our system, in addition to issuing a diagnosis, guides the staff by explaining which factors are relevant to a particular diagnosis and the certainty he has in it. 3.6. Importance of the model's features The importance of the features of the model depends on the data used in the training. In our case, we use the Random Forest algorithm, according to the results obtained in the Experimentation section. In this trained model, Shapley values are calculated for each variable and their cumulative absolute values determine their importance. Figure 7 shows the predictors of the model in order of importance. Figure 8 shows the individual SHAP values by coloring in red those cases diagnosed as ADHD and in blue those not diagnosed as ADHD. As we can see in most of the subfigures the relationships are not linear with the input values which makes this ML model a better fit than other linear ones. On the other hand, from this graph we can also extract which value or range of values are the most neutral (the y-axis value is zero and is represented by a dashed line) for each variable. For example, in GAI - CPI a value around 10 is neutral the higher the system tends to predict having ADHD, while if it is lower it tends to predict the opposite; in the variable WMI its neutral value is around 90 and the opposite happens with the previous variable the lower it is the more tendency to diagnose ADHD; As a special case, the variable WMI - PSI where the neutral values are around − 10 or + 10, between this range the tendency is to be a negative case and outside this range to be a positive case. 4. Discussion A large group of international research teams, with the aim of facilitating a complex clinical diagnosis, center their works around the detection of a cognitive pattern in infant-juvenile population presenting ADHD (Cheung, Fazier-Wood, Asherson, & Kuntsi, 2014; Fenollar-Cortés et al., 2015; Fosco et al., 2020 ; Kubo et al., 2018 ; Kibby et al., 2019 ; Moura, Costa, & Simoes, 2019; Navarro-Soria et al., 2020 ; Yang, Allen, Homes, & Chan, 2017). Therefore, the purpose of this work is not so much to corroborate the existence of this cognitive pattern, but to propose an effective system to support the clinician in making diagnostic decisions. In the current research, ML-based algorithms are used to determine which variables measured in the cognitive assessment by WISC-IV are important and their impact on a diagnosis of ADHD. From the results shown in the previous section, specifically those developed in Fig. 3 which represents feature selection, it follows that the difference between the GAI and CPI indices, and the classic WMI index, as demonstrated in previous studies, (Fenollar-Cortés et al., 2015; Mayes & Calhoun, 2006 ; Thaler et al., 2015 ), remain highly reliable predictor indices in the diagnosis of ADHD. Continuing with this figure, it is also significant the importance detected by the algorithms used in indexes such as VCI or PRI which, despite measuring cognitive skills associated with brain areas less affected by ADHD, stand out as adequate predictors. This may be due to its stability in relation to the FSIQ and to the fact that when a cognitive step is detected in the VCI and PRI indices, the pattern coincides with disorders other than ADHD, providing information to the algorithms used to discard the results as typical of the target pattern in this study. Also, the difference between the WMI and PSI indices is presented as relevant in the detection of the cognitive pattern, as stated in previous research (Navarro-Soria et al., 2020 ;). Naturally, the PSI index continues to be important in detecting the cognitive pattern of ADHD, with a somewhat lower weight than WMI, as expected, since depending on whether the disorder presents as combined or inattentive, the impairment to the skills related to the PSI index will be higher or lower. The CPI composite index, stands out in third position of importance, which is calculated from the WMI and PSI indices and their component subtests that are associated with the cognitive skills most affected by ADHD. Finally, the LN subtest, which in the practical field of clinical neuropsychological assessment has always been an indicator widely taken into account to support the diagnosis of ADHD, the algorithms used highlight it as significant, but below the level of predictive ability of the other previously mentioned indexes, the other data being more robust and effective. To explain how this system would work as a diagnostic support system, the four possible cases are presented, knowing that the accuracy of the model is around 90%. In the following cases we will assume that probabilities higher than 0.5 would be diagnosed as ADHD. In practice, these graphs with the breakdown of the odds and the final probability would have to be interpreted by a specialist. The first two cases are for a true positive (TP) and a true negative (TN). On the other hand, as observed in Fig. 9 , the algorithms chosen by the implemented IA use the 8 most effective variables to ensure a high percentage of success in classifying the subjects from the sample. In these examples, in which the IA correctly classifies, associating a specific pattern according to the importance of the profile values of a subject diagnosed with ADHD (figure b, with an estimated probability of 0.97) and discarding another with a negative diagnosis (figure a, with an estimated probability of 0.024), because it fits a pattern of normal subjects (control group) and thus allows us to check the way in which the algorithm is based to calculate the probability of diagnosis. In the first graph presented in this figure, it can be seen that the IA gives great relevance to the difference between the GAI and CPI indices, which as stated previously, is very relevant in these cases. Also considered of relevance is the CPI, the composite index that is most affected by ADHD and, of course, the WMI and PSI indexes. On the other hand, the graph in Fig. 9 b shows that the coincidence with the expected scores for the cognitive pattern associated with ADHD, in results such as the difference between GAI and CPI, and the higher than usual scores for ADHD in the CPI and WMI indices are sufficient criteria for IA to classify the cognitive profile as non-ADHD. As we can see in Fig. 9 a, the majority of values indicate a negative diagnosis. While in Fig. 9 b the values indicate the opposite. The final probabilities of the diagnoses are 0.024 and 0.97 out of 1. In these cases presented as examples, it can be seen in Fig. 10 a that, despite being a case diagnosed with ADHD with a final probability estimated by the model at 0.281 (false negative example), the cognitive step affects the scores in VCI and PRI more, which means that the difference between GAI and CPI is inverted, this being incongruent with what, as a general rule, has been observed by the IA in the database from which it learns.. This unusual profile would be a false negative and to perform a proper classification of profiles like these, the number of cases with these characteristics need to be expanded. On the other hand, in the second graph, Fig. 10 b (false positive example), a singular case is presented which, among the professionals who evaluate using WISC, is not surprising, but it is a case of false positive, since its estimated probability is 0.628. Giftedness, which affects more or less regularly all cognitive skills, in the case of presenting with a diagnosis of ADHD may represent a significant cognitive step between the skills affected by the disorder and those not affected, obtaining scores within the normal range in the affected areas, if compared with subjects with average scores. While in the indexes measuring cognitive skills not affected by ADHD, their results are those of giftedness. The cognitive step exists, but it is only observed in comparison of the scores with the same subject, in the score resulting from the difference between GAI and CPI, the rest of the scores are within normality, despite being altered by ADHD. 5. Conclusions and future works The results obtained in the presented study support the idea that IA can facilitate and improve the support to neuropsychological diagnosis made by medical doctors, psychologists and psychopedagogues, applying the knowledge we already have about cognitive patterns and the detailed observation of the data it obtains. We have selected 8 of the 20 initial variables as the most relevant, which in order of importance are GAI - CPI, WMI, CPI, PSI, VCI, WMI - PSI, PRI and LN, obtaining very promising results such as 0.9 hits, 0.94 of area under the ROC curve, 0.91 sensitivity and 0.92 specificity. In addition, XAI techniques allow detailing the most relevant factors of each case to facilitate the work of professionals new to this type of diagnosis or to help contrast them with those of experts. All these results have been obtained from a sample of around 700 subjects between the clinical and control groups, which from a clinical point of view is a large sample that is costly to obtain, although we consider this to be a limitation. Increasing the database with new records in both clinical and control groups will also help to increase the representativeness of the cases and thus allow the ML algorithms to improve their training, with this improvement impacting on reducing the number of errors in the predictions, as well as helping to reduce the biases committed by the model itself. Furthermore, once the efficacy in determining cognitive patterns and correctly classifying subjects diagnosed with ADHD from those who do not meet diagnostic criteria has been confirmed, it's considered if the algorithms used can be sensitive to differences in the WMI and PSI indices. We consider it would be interesting to investigate whether this difference can help experts to discriminate between the two main presentations of the disorder (combined and inattentive), as previous studies, carried out with traditional statistical algorithms with a smaller number of cases, have shown. Therefore, a future line of work would be the integration of our ML-based system into a web application for professionals with a double benefit. On the one hand, the professionals themselves would obtain a prediction and its corresponding feedback from our ADHD diagnostic support system, and on the other hand, these professionals could also incorporate anonymized data from new cases so that we could update the support system and make new predictions with greater accuracy. This application should be carried out both for WISC-IV, still widespread among clinical and educational assessment teams, and for WISC-V, the current version of the Wechsler scales. The purpose would be to provide the medical community with another way to support its diagnostic process and, just as the medical practitioner requests the completion of protocols from family members/teachers or collects direct information in consultation, he/she could also enter the direct scores from applying WISC to a subject being studied for a diagnosis of ADHD and receive feedback on the probability that this cognitive profile is associated with ADHD. Declarations Funding: The research work carried out is part of the StepTDAH-UA project (GRE21-15A) from the Vice-Rectorate of Research, Development, and Innovation of the University of Alicante.. Institutional Review Board Statement: The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee and Vice-Rectorate for Research and Knowledge Transfer of the University of Alicante (UA-2018-03-08). Informed Consent Statement: Informed consent was obtained from all subjects involved in the study. Conflicts of Interest: The authors declare no conflict of interest Availability of data and materials: The data that support the findings of this study are not openly available due to reasons of sensitivity and are available from the corresponding author upon reasonable request. The data is stored in controlled access storage at the University of Alicante Repository. References Alda, J. A., Ubiñana, M., Ferreira, E., Serrano, E., Fusté, A., & San, L. (2012). Psychic distress in parents of children newly diagnosed with ADHD. Neuropsychiatrie de l'enfance et de l'adolescence , 5 (60), S274-S275. https://doi.org/10.1016/j.neurenf.2012.04.739 American Psychiatric Association. (2013). Guía de Consulta de los Criterios Diagnósticos del DSM-5 .APP. https://doi.org/10.1176/appi.books.9780890425657 Ang, J. C., Mirzal, A., Haron, H., & Hamed, H. N. A. (2015). Supervised, unsupervised, and semi-supervised feature selection: a review on gene selection. IEEE/ACM transactions on computational biology and bioinformatics , 13(5), 971-989. https://doi.org/10.1109/TCBB.2015.2478454 Asherson, P., & Agnew‐Blais, J. (2019). Annual Research Review: Does late‐onset attention‐deficit/hyperactivity disorder exist?. Journal of Child Psychology and Psychiatry , 60 (4), 333-352. https://doi.org/10.1111/jcpp.13020 Atkeson, C., Moore, A., & Schaal, S. (1996). Locally weighted learning. AI Review . https://doi.org/10.1007/978-94-017-2053-3_2 Baloh, R. W. (2022). The Developing Brain. In Exercise and the Brain: Why Physical Exercise is Essential to Peak Cognitive Health (pp. 67-87). Cham: Springer International Publishing. https://doi.org/10.1007/978-3-031-13924-6_4 Barkley, R. A. (1997). Behavioral inhibition, sustained attention, and executive functions: constructing a unifying theory of ADHD. Psychological bulletin , 121 (1), 65. https://doi.org/10.1037/0033-2909.121.1.65 Barkley, R. A., Murphy, K. R., & Bush, T. (2001). Time perception and reproduction in young adults with attention deficit hyperactivity disorder. Neuropsychology , 15 (3), 351.https://doi.org/10.1037/0894-4105.15.3.351 Barkley, R.A. (2005). ADHD and the Nature of Self-Control. Guilford Press Barredo Arrieta, A., Díaz-Rodríguez, N., Del Ser, J., Bennetot, A., Tabik, S., Barbado, A., García, S., Gil-López, S., Molina, D., Benjamins, R., et al. (2020). Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible ai. Information Fusion , 58:82–115. https://doi.org/10.1016/j.inffus.2019.12.012 Bélanger, S. A., Andrews, D., Gray, C., & Korczak, D. (2018). ADHD in children and youth: Part 1—Etiology, diagnosis, and comorbidity. Pediatrics & child health , 23 (7), 447-453. https://doi.org/10.1093/pch/pxy109 Benson, N. F., Floyd, R. G., Kranzler, J. H., Eckert, T. L., Fefer, S. A., & Morgan, G. B. (2019). Test use and assessment practices of school psychologists in the United States: Findings from the 2017 National Survey. Journal of School Psychology , 72 , 29-48. https://doi.org/10.1016/j.jsp.2018.12.004 Breiman, L. (2001). Random forests. Machine learning , 45(1):5–32. https://doi.org/10.1023/A:1010933404324 Breiman, L. (2017). Classification and regression trees . Routledge. https://doi.org/10.1201/9781315139470 Bustillo, M., & Servera, M. (2015). Analysis of the performance pattern of a sample of children with ADHD in the WISC-IV. Revista de Psicología Clínica con Niños y Adolescentes , 2 (2), 121-128. Cachero, C., Rico-Juan, J. R., & Macià, H. (2023). Influence of personality and modality on peer assessment evaluation perceptions using Machine Learning techniques. Expert Systems with Applications , 119150. https://doi.org/10.1016/j.eswa.2022.119150 Calub, C. A., Rapport, M. D., Friedman, L. M., & Eckrich, S. J. (2019). IQ and academic achievement in children with ADHD: The differential effects of specific cognitive functions. Journal of Psychopathology and Behavioral Assessment , 41 , 639-651. https://doi.org/10.1007/s10862-019-09728-z Chan, T. F., Golub, G. H., & LeVeque, R. J. (1983). Algorithms for computing the sample variance: Analysis and recommendations. The American Statistician , 37(3):242–247. https://doi.org/10.1080/00031305.1983.10483115 Chandrashekar, G., & Sahin, F. (2014). A survey on feature selection methods. Computers & Electrical Engineering , 40(1), 16-28. https://doi.org/10.1016/j.compeleceng.2013.11.024 Chen, T. & Guestrin, C. (2016). XGBoost: A Scalable Tree Boosting System. CoRR , abs/1603.02754. https://doi.org/10.1145/2939672.2939785 Chen, T.-H. & Chang, R.-C. (2020). Using machine learning to evaluate the influence of fintech patents: The case of Taiwan's financial industry. Journal of Computational and Applied Mathematics , page 113215. https://doi.org/10.1016/j.cam.2020.113215 Cheung, C. H., Fazier-Wood, A. C., Asherson, P., Rijsdijk, F., & Kuntsi, J. (2014). Shared cognitive impairments and aetiology in ADHD symptoms and reading difficulties. PloS one , 9 (6), e98590. https://doi.org/10.1371/journal.pone.0098590 Conners, C. K. (2008). Conners 3. North Tonawanda, NJ: MHS. Cortes, C., & Vapnik, V. (1995). Support-vector networks. Machine learning , 20 , 273-297. https://doi.org/10.1007/BF00994018 Cover, T. M. & Hart, P. E. (1967). Nearest neighbor pattern classification. Information Theory, IEEE Transactions on , 13(1):21–27. https://doi.org/10.1109/TIT.1967.1053964 Dalzochio, J., Kunst, R., Pignaton, E., Binotto, A., Sanyal, S., Favilla, J., and Barbosa, J. (2020). Machine learning and reasoning for predictive maintenance in industry 4.0: Current status and challenges. Computers in Industry , 123:103298. https://doi.org/10.1016/j.compind.2020.103298 Dhini, A., Surjandari, I., Kusumoputro, B., & Kusiak, A. (2021). Extreme learning machine–radial basis function (elm-rbf) networks for diagnosing faults in a steam turbine. Journal of Industrial and Production Engineering, (pp. 1–9). https://doi.org/10.1080/21681015.2021.1887948 Dorogush, A. V., Ershov, V., & Gulin, A. (2018). Catboost: gradient boosting with categorical features support. arXiv preprint arXiv:1810.11363 . DuPaul, G. J., Power, T. J., Anastopoulos, A. D., & Reid, R. (1998). ADHD Rating Scale—IV: Checklists, norms, and clinical interpretation . Guilford press. https://doi.org/10.1037/t00680-000 Drucker, H., Burges, C. J., Kaufman, L., Smola, A. J., and Vapnik, V. (1997). Support vector regression machines. In Advances in neural information processing systems , pages 155–161. Duda, R. O., Hart, P. E., and Stork, D. G. (2001). Pattern Classification . Wiley, 2 edition. Dumont, M., Mar´ee, R., Wehenkel, L., and Geurts, P. (2009). Fast multi-class image annotation with random subwindows and multiple output random- ized trees. In Proc. International Conference on Computer Vision Theory and Applications (VISAPP) , volume 2, pages 196–203. Fenollar Cortés, J., González Gómez, C., García Sevilla, J., & Navarro Soria, IJ (2015). Detección de perfiles cognitivos mediante WISC-IV en niños diagnosticados de ADHD:¿Existen diferencias entre subtipos?. Revista de Psicodidáctica . https://doi.org/10.1387/RevPsicodidact.12531 Fenollar-Cortés, J., López-Pinar, C., & Watkins, M. W. (2019). Structural validity of the Spanish Wechsler Intelligence Scale for Children–Fourth Edition in a large sample of Spanish children with attention-deficit hyperactivity disorder. International Journal of School & Educational Psychology , 7 (sup1), 2-14. https://doi.org/10.1080/21683603.2018.1474820 Fernández-Jaén, A., Fernández-Mayoralas, D. M., López-Arribas, S., Pardos-Véglia, A., Muñiz-Borrega, B., García-Savaté, C., ... & Fernández-Perrone, A. L. (2012). Habilidades sociales y de liderazgo en el trastorno por déficit de atención/hiperactividad: relación con las capacidades cognitivo-atencionales. Actas Españolas de Psiquiatría , 40 (3). Fosco, W. D., Kofler, M. J., Groves, N. B., Chan, E. S., & Raiker, J. S. (2020). Which ‘working’ components of working memory aren’t working in youth with ADHD?. Journal of Abnormal Child Psychology , 48 , 647-660. https://doi.org/10.1007/s10802-020-00621-y Frank, E., Hall, M., and Pfahringer, B. (2003). Locally weighted naive bayes. In 19th Conference in Uncertainty in Artificial Intelligence , pages 249–256. Morgan Kaufmann. Frank, E., Mayo, M., and Kramer, S. (2015). Alternating model trees. In Proceedings of the 30th Annual ACM Symposium on Applied Computing , pages 871–878. ACM. https://doi.org/10.1145/2695664.2695848 Freund, Y. and Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. Journal of computer and system sciences , 55(1):119–139. https://doi.org/10.1006/jcss.1997.1504 Friedman, J. (1999). Stochastic gradient boosting. Technical report, Stanford University. Gnana, D. A. A., Balamurugan, S. A. A., & Leavline, E. J. (2016). Literature review on feature selection methods for high-dimensional data. International Journal of Computer Applications, 136(1), 9-17. https://doi.org/10.5120/ijca2016908317 Gomez, R., Vance, A., & Watson, S. D. (2016). Structure of the Wechsler Intelligence Scale for Children–Fourth Edition in a group of children with ADHD. Frontiers in Psychology , 7 , 737. https://doi.org/10.3389/fpsyg.2016.00737 Goodman, R. (1997). The Strengths and Difficulties Questionnaire: a research note. Journal of child psychology and psychiatry , 38 (5), 581-586. https://doi.org/10.1111/j.1469-7610.1997.tb01545.x Guyon, I. and Elisseeff, A. An introduction to variable and feature selection. Hinton, G. E. (1990). Connectionist learning procedures. In Machine Learning, Volume III , pages 555–610. Elsevier. Hamed, A. M., Kauer, A. J., & Stevens, H. E. (2015). Why the diagnosis of attention deficit hyperactivity disorder matters. Frontiers in psychiatry , 6 , 168. https://doi.org/10.3389/fpsyt.2015.00168 Hare, M. M., Garcia, A. M., Hart, K. C., & Graziano, P. A. (2021). Intervention response among preschoolers with ADHD: The role of emotion understanding. Journal of school psychology , 84 , 19-31. https://doi.org/10.1016/j.jsp.2020.11.001 Ho, T. K. (1998). The random subspace method for constructing decision forests. IEEE Transactions on Pattern Analysis and Machine Intelligence , 20(8):832–844. https://doi.org/10.1109/34.709601 Hoerl, A. E. and Kennard, R. W. (1970). Ridge regression: Biased estimation for nonorthogonal problems. Technometrics , 12(1):55–67. https://doi.org/10.1080/00401706.1970.10488634 Isquith, P. K., Roth, R. M., & Gioia, G. (2013). Contribution of rating scales to the assessment of executive functions. Applied Neuropsychology: Child , 2 (2), 125-132. https://doi.org/10.1080/21622965.2013.748389 Jamieson, K. and Talwalkar, A. (2016). Non-stochastic best arm identification and hyperparameter optimization. In Artificial intelligence and statistics , pages 240–248. PMLR. Javaid, A., Zghyer, F., Kim, C., Spaulding, E. M., Isakadze, N., Ding, J., Kargillis, D., Gao, Y., Rahman, F., Brown, D. E., Saria, S., Martin, S. S., Kramer, C. M., Blumenthal, R. S., and Marvel, F. A. (2022). Medicine 2032: The future of cardiovascular disease prevention with machine learning and digital health technology. American Journal of Preventive Cardiology , 12:100379. https://doi.org/10.1016/j.ajpc.2022.100379 Jiang, W., Li, Y., Du, Y., & Fan, J. (2015). Cognitive deficits feature of male with attention deficit hyperactivity disorder-based on the study of WISC-IV. J. Psychiatry , 18 , 1-6. https://doi.org/10.4172/Psychiatry.1000252 Kajka, N., & Kulik, A. (2021). The influence of metacognitive strategies on the improvement of reaction inhibition processes in children with ADHD. International Journal of Environmental Research and Public Health , 18 (3), 878. https://doi.org/10.1016/j.jksuci.2019.06.012 Ke, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., and Liu, T.-Y. (2017). lightgbm: A highly efficient gradient boosting decision tree. In Advances in Neural Information Processing Systems , pages 3149–3157. Khaire, U. M., & Dhanalakshmi, R. (2022). Stability of feature selection algorithm: A review. Journal of King Saud University-Computer and Information Sciences, 34(4), 1060-1073. https://doi.org/10.1016/j.jksuci.2019.06.012 Kibby, M. Y., Vadnais, S. A., & Jagger-Rickels, A. C. (2019). Which components of processing speed are affected in ADHD subtypes?. Child Neuropsychology , 25 (7), 964-979. https://doi.org/10.1080/09297049.2018.1556625 Kim, J. H., Kim, J. Y., Lee, J., Jeong, G. H., Lee, E., Lee, S., ... & Fusar-Poli, P. (2020). Environmental risk factors, protective factors, and peripheral biomarkers for ADHD: an umbrella review. The Lancet Psychiatry , 7 (11), 955-970. https://doi.org/10.1016/S2215-0366(20)30312-6 Kim, H., & Song, D. H. (2020). Comparison of the K-WISC-IV profiles of boys with autism spectrum disorder and attention-deficit/hyperactivity disorder. Research in Developmental Disabilities , 97 , 103539.https://doi.org/10.1016/j.ridd.2019.103539 Krane, E., & Tannock, R. (2001). WISC-III third factor indexes learning problems but not attention deficit/hyperactivity disorder. Journal of Attention Disorders , 5 (2), 69-78. https://doi.org/10.1177/108705470100500201 Krieger, V., & Amador-Campos, J. A. (2018). Assessment of executive function in ADHD adolescents: contribution of performance tests and rating scales. Child Neuropsychology , 24 (8), 1063-1087. https://doi.org/10.1080/09297049.2017.1386781 Koh, P. W. and Liang, P. (2017). Understanding black-box predictions via influence functions. In Proceedings of the 34th International Conference on Machine Learning - Volume 70 , ICML’17, pages 1885–1894. JMLR.org. Kubo, Y., Kanazawa, T., Kawabata, Y., Tanaka, K., Okayama, T., Fujta, M., ... & Yoneda, H. (2018). Comparative analysis of the WISC between two ADHD subgroups. Psychiatry investigation , 15 (2), 172. https://doi.org/10.30773/pi.2017.07.12 Kursa, M. B. and Rudnicki, W. R. (2010). Feature selection with the Boruta package. Journal of Statistical Software , 36(11):1–13. https://doi.org/10.18637/jss.v036.i11 Lavigne- Cerván, R., & Romero- Pérez, JF (2010). El ADHD:¿qué es?,¿qué lo causa?,¿cómo evaluarlo y tratarlo?. (1ª ed.) Pirámide. Li, L., Jamieson, K., DeSalvo, G., Rostamizadeh, A., and Talwalkar, A. (2017). Hyperband: A novel bandit-based approach to hyperparameter optimiza- tion. The Journal of Machine Learning Research , 18(1):6765–6816. Liang, D., Frederick, D. A., Lledo, E. E., Rosenfield, N., Berardi, V., Linstead, E., and Maoz, U. (2022). Examining the utility of nonlinear machine learn- ing approaches versus linear regression for predicting body image outcomes: The u.s. body project i. Body Image , 41:32–45. https://doi.org/10.1016/j.bodyim.2022.01.013 Liu, B. (2020). New technology application in logistics industry based on machine learning and embedded network. Microprocessors and Microsystems , page 103596. https://doi.org/10.1016/j.micpro.2020.103484 Liu, D., Fox, K., Weber, G., and Miller, T. (2022). Confederated learning in healthcare: Training machine learning models using disconnected data separated by individual, data type and identity for large-scale health system intelligence. Journal of Biomedical Informatics , 134:104151. https://doi.org/10.1016/j.jbi.2022.104151 Lundberg, S. (2019). Shap (Shapley additive explanations). https://github.com/slundberg/shap. Lundberg, S. M. and Lee, S.-I. (2017a). Consistent feature attribution for tree ensembles. arXiv preprint arXiv:1706.06060 . Lundberg, S. M. and Lee, S.-I. (2017b). A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems , pages 4765–4774. Macià Antón, D. (2012). ADHD en la infancia y la adolescencia: concepto, evaluación y tratamiento. (1ª ed.). Pirámide. MacKay, D. J. (1992). Bayesian interpolation. Neural computation , 4(3):415– 447. https://doi.org/10.1162/neco.1992.4.3.415 Marshall, P., Hoelzle, J., & Nikolas, M. (2021). Diagnosing Attention-Deficit/Hyperactivity Disorder (ADHD) in young adults: A qualitative review of the utility of assessment measures and recommendations for improving the diagnostic process. The Clinical Neuropsychologist , 35 (1), 165-198. https://doi.org/10.1080/13854046.2019.1696409 Matsuura, N., Ishitobi, M., Arai, S., Kawamura, K., Asano, M., Inohara, K., ... & Kosaka, H. (2014). Distinguishing between autism spectrum disorder and attention deficit hyperactivity disorder by using behavioral checklists, cognitive assessments, and neuropsychological test battery. Asian journal of psychiatry , 12 , 50-57. https://doi.org/10.1016/j.ajp.2014.06.011 Mayes, S. D., & Calhoun, S. L. (2004). Similarities and differences in Wechsler Intelligence Scale for Children—Third Edition (WISC-III) profiles: Support for subtest analysis in clinical referrals. The Clinical Neuropsychologist , 18 (4), 559-572. https://doi.org/10.1080/13854040490888530 Mayes, S. D., & Calhoun, S. L. (2006). WISC-IV and WISC-III profiles in children with ADHD. Journal of Attention Disorders , 9 (3), 486-493. https://doi.org/10.1177/1087054705283616 Mehta, T. R., Monegro, A., Nene, Y., Fayyaz, M., & Bollu, P. C. (2019). Neurobiology of ADHD: a review. Current Developmental Disorders Reports , 6 , 235-240. https://doi.org/10.1007/s40474-019-00182-w Miller, M., Iosif, A. M., Young, G. S., Hill, M. M., & Ozonoff, S. (2018). Early detection of ADHD: Insights from infant siblings of children with autism. Journal of Clinical Child & Adolescent Psychology , 47 (5), 737-744. https://doi.org/10.1080/15374416.2016.1220314 Moura, O., Costa, P., & Simões, M. R. (2019). WISC-III cognitive profiles in children with ADHD: specific cognitive impairments and diagnostic utility. The Journal of general psychology , 146 (3), 258-282. https://doi.org/10.1080/00221309.2018.1561410 Muñiz, J., Hernández, A., & Fernández-Hermida, J. R. (2020). Utilización de los test en España: El punto de vista de los psicólogos. Papeles del Psicólogo , 41 (1), 1-15. https://doi.org/10.23923/pap.psicol2020.2921 Navarro-Soria, I., Fenollar, J., Carbonell, J., & Real, M. (2020). Memoria de trabajo y velocidad de procesamiento evaluado mediante WISC-IV como claves en la evaluación del ADHD. https://doi.org/10.21134/rpcna.2020.07.1.3 Narciso, D. A. and Martins, F. (2020). Application of machine learning tools for energy efficiency in industry: A review. Energy Reports , 6:1181–1199. https://doi.org/10.1016/j.egyr.2020.04.035 Oakland, T., & Hu, S. (1992). The top 10 tests used with children and youth worldwide. Bulletin of the International Test Commission , 19 (1), 99-120. Oakland, T., Iliescu, D., Chen, H. Y., & Chen, J. H. (2013). Cross-national assessment of adaptive behavior in three countries. Journal of Psychoeducational Assessment , 31 (5), 435-447. https://doi.org/10.1177/0734282912469492 Oakland, T., Douglas, S., & Kane, H. (2016). Top ten standardized tests used internationally with children and youth by school psychologists in 64 countries: A 24-year follow-up study. Journal of Psychoeducational Assessment , 34 (2), 166-176. https://doi.org/10.1177/0734282915595303 Parke, E. M., Thaler, N. S., Etcoff, L. M., & Allen, D. N. (2020). Intellectual profiles in children with ADHD and comorbid learning and motor disorders. Journal of attention disorders , 24 (9), 1227-1236. https://doi.org/10.1177/1087054715576343 Pedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E. (2011). Scikit-learn: Machine learning in Python. Journal of Machine Learning Research , 12:2825–2830. Polanczyk, G. V., Willcutt, E. G., Salum, G. A., Kieling, C., & Rohde, L. A. (2014). ADHD prevalence estimates across three decades: an updated systematic review and meta-regression analysis. International journal of epidemiology , 43 (2), 434-442. https://doi.org/10.1093/ije/dyt261 Quinlan, R. J. (1992). Learning with continuous classes. In 5th Australian Joint Conference on Artificial Intelligence , pages 343–348, Singapore. World Scientific. Rappaport, G. C., Ornoy, A., & Tenenbaum, A. (1998). Is early intervention effective in preventing ADHD?. Israel Journal of Psychiatry , 35 (4), 271. Reid, R., & Maag, J. W. (1994). How many fidgets in a pretty much: A critique of behavior rating scales for identifying students with ADHD. Journal of School Psychology , 32 (4), 339-354. https://doi.org/10.1016/0022-4405(94)90032-9 Rico-Juan, J. R., Gallego, A.-J., & Calvo-Zaragoza, J. (2019). Automatic detection of inconsistencies between numerical scores and textual feedback in peer-assessment processes with machine learning. Computers & Education,140 , 103609. https://doi.org/10.1016/j.compedu.2019.103609 Roth, A. E. (1988). The Shapley value: essays in honor of Lloyd S. Shapley . Cambridge University Press. https://doi.org/10.1017/CBO9780511528446 Servera, M., & Cardo, E. (2007). ADHD Rating Scale-IV en una muestra escolar española: datos normativos y consistencia interna para maestros, padres y madres. Rev Neurol , 45 (7), 393-9. https://doi.org/10.33588/rn.4507.2007301 Sharma, A. (2018). Guided stochastic gradient descent algorithm for inconsistent datasets. Applied Soft Computing , 73:1068–1080. https://doi.org/10.1016/j.asoc.2018.09.038 Simon, V., Czobor, P., Bálint, S., Mészáros, A., & Bitter, I. (2009). Prevalence and correlates of adult attention-deficit hyperactivity disorder: meta-analysis. The British Journal of Psychiatry , 194 (3), 204-211. https://doi.org/10.1192/bjp.bp.107.048827 Sonuga-Barke, E. J., Koerting, J., Smith, E., McCann, D. C., & Thompson, M. (2011). Early detection and intervention for attention-deficit/hyperactivity disorder. Expert Review of Neurotherapeutics , 11 (4), 557-563. https://doi.org/10.1586/ern.11.39 Snow, J. B., & Sapp, G. L. (2000). WISC—III subtest patterns of ADHD and normal samples. Psychological Reports , 87 (3), 759-765.https://doi.org/10.2466/pr0.2000.87.3.759 Styck, K. M., & Watkins, M. W. (2017). Structural validity of the WISC-IV for students with ADHD. Journal of Attention Disorders , 21 (11), 921-928. https://doi.org/10.1177/1087054714553052 Taylor, E., Schachar, R., Thorley, G., & Wieselberg, M. (1986). Parental account of children's symptoms. British Journal of Psychiatry , 149 , 760-767. https://doi.org/10.1192/bjp.149.6.760 Toffalini, E., Buono, S., & Cornoldi, C. (2022). The structure, profile, and diagnostic significance of intelligence in children with ADHD are impressively similar to those of children with a specific learning disorder. Research in Developmental Disabilities , 129 , 104306. https://doi.org/10.1016/j.ridd.2022.104306 Thaler, N. S., Barchard, K. A., Parke, E., Jones, W. P., Etcoff, L. M., & Allen, D. N. (2015). Factor structure of the Wechsler Intelligence Scale for Children: in children with ADHD. Journal of attention disorders , 19 (12), 1013-1021. https://doi.org/10.1177/1087054712459952 Thapar, A. (2018). Discoveries on the genetics of ADHD in the 21st century: new findings and their implications. American Journal of Psychiatry , 175 (10), 943-950. https://doi.org/10.1176/appi.ajp.2018.18040383 Thomas, R., Sanders, S., Doust, J., Beller, E., & Glasziou, P. (2015). Prevalence of attention-deficit/hyperactivity disorder: a systematic review and meta-analysis. Pediatrics , 135 (4), e994-e1001. https://doi.org/10.1542/peds.2014-3482 Tibshirani, R. (1996). Regression shrinkage and selection via the lasso. Journal of the Royal Statistical Society: Series B (Methodological) , 58(1):267–288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x Ulla, Ek. Fernell, E., Westerlund, J., Holmberg, K., Olsson, P. O., & Gillberg, C. (2007). Cognitive strengths and deficits in schoolchildren with ADHD. Acta Paediatrica , 96 (5), 756-761. https://doi.org/10.1111/j.1651-2227.2007.00297.x Ullebø, A. K., Posserud, M. B., Heiervang, E., Gillberg, C., & Obel, C. (2011). Screening for the attention deficit hyperactivity disorder phenotype using the strength and difficulties questionnaire. European child & adolescent psychiatry , 20 , 451-458. https://doi.org/10.1007/s00787-011-0198-9 Walg, M., Hapfelmeier, G., El-Wahsch, D., & Prior, H. (2017). The faster internal clock in ADHD is related to lower processing speed: WISC-IV profile analyses and time estimation tasks facilitate the distinction between real ADHD and pseudo-ADHD. European child & adolescent psychiatry , 26 , 1177-1186. https://doi.org/10.1007/s00787-017-0971-5 Wanderer, S., Roessner, V., Strobel, A., & Martini, J. (2021). WISC-IV performance of children with chronic tic disorder, obsessive–compulsive disorder and attention-deficit/hyperactivity disorder: results from a German clinical study. Child and Adolescent Psychiatry and Mental Health , 15 (1), 1-13. https://doi.org/10.1186/s13034-021-00392-4 Wang, Y. & Witten, I. H. (1997). Induction of model trees for predicting continuous classes. In Poster papers of the 9th European Conference on Machine Learning . Springer. Weisberg, S. (2005). Applied linear regression , volume 528. John Wiley & Sons. https://doi.org/10.1002/0471704091 Weschler, D. (2003). Wechsler intelligence scale for children-forth edition (WISC-IV) administration and scoring manual. The Psychological Association, San Antonio . Wechsler, D. (2005). WISC-IV . TEA Wilcoxon, F. (1945). Individual comparisons by ranking methods. Biometrics bulletin , 1(6):80–83. https://doi.org/10.2307/3001968 Yang, H. N., Tai, Y. M., Yang, L. K., & Gau, S. S. F. (2013). Prediction of childhood ADHD symptoms to quality of life in young adults: adult ADHD and anxiety/depression as mediators. Research in developmental disabilities , 34 (10), 3168-3181. https://doi.org/10.1016/j.ridd.2013.06.011 Yang, T. X., Allen, R. J., Holmes, J., & Chan, R. C. (2017). Impaired memory for instructions in children with attention-deficit hyperactivity disorder is improved by action at presentation and recall. Frontiers in Psychology , 39. https://doi.org/10.3389/fpsyg.2017.00039 Additional Declarations No competing interests reported. Cite Share Download PDF Status: Published Journal Publication published 08 Apr, 2024 Read the published version in Applied Neuropsychology: Child → Version 1 posted You are reading this latest preprint version Research Square lets you share your work early, gain feedback from the community, and start making changes to your manuscript prior to peer review in a journal. As a division of Research Square Company, we’re committed to making research communication faster, fairer, and more useful. We do this by developing innovative software and high quality services for the global research community. Our growing team is made up of researchers and industry professionals working together to solve the most critical problems facing scientific publishing. Also discoverable on Platform About Our Team In Review Editorial Policies Advisory Board Help Center Resources Author Services Accessibility API Access RSS feed Manage Cookie Preferences © Research Square 2026 | ISSN 2693-5015 (online) Privacy Policy Terms of Service Do Not Sell My Personal Information {"props":{"pageProps":{"initialData":{"identity":"rs-3992393","acceptedTermsAndConditions":true,"allowDirectSubmit":true,"archivedVersions":[],"articleType":"Research Article","associatedPublications":[],"authors":[{"id":277347018,"identity":"1bd953fc-8d6f-49f9-821a-ad49568a7fa6","order_by":0,"name":"Ignasi Navarro-Soria","email":"","orcid":"","institution":"University of Alicante","correspondingAuthor":false,"prefix":"","firstName":"Ignasi","middleName":"","lastName":"Navarro-Soria","suffix":""},{"id":277347020,"identity":"6bc4a1c7-20e9-4d69-8bd4-07c9c684d1f2","order_by":1,"name":"Juan Ramón Rico-Juan","email":"","orcid":"","institution":"University of Alicante","correspondingAuthor":false,"prefix":"","firstName":"Juan","middleName":"Ramón","lastName":"Rico-Juan","suffix":""},{"id":277347021,"identity":"6aa09212-4b34-4c72-8eac-361153fda8f6","order_by":2,"name":"Rocío Juárez-Ruiz de Mier","email":"","orcid":"","institution":"University of Malaga","correspondingAuthor":false,"prefix":"","firstName":"Rocío","middleName":"Juárez-Ruiz","lastName":"de Mier","suffix":""},{"id":277347022,"identity":"dc1fa350-5e08-4578-b535-0cdc8726a907","order_by":3,"name":"Rocío Lavigne-Cerván","email":"data:image/png;base64,iVBORw0KGgoAAAANSUhEUgAAAZAAAAAyAQMAAABI0h/eAAAABlBMVEX///8AAABVwtN+AAAACXBIWXMAAA7EAAAOxAGVKw4bAAAAiklEQVRIiWNgGAWjYDACdgbGByRqYWZgNiBZC5sEaTrMm5mfVfPm2DHw8x8gUovMYTaz27zbkhkkZyQQqUWCmYcNqIWZweAGsQ4DaSnm3VbPYH+eWIeBtDDzbjvMYMBAvMPYjCXnbjvOI3GDaC3szQ8/vN1WLcffT6zDYICHRPWjYBSMglEwCvACAK97G+hbKj84AAAAAElFTkSuQmCC","orcid":"","institution":"University of Malaga","correspondingAuthor":true,"prefix":"","firstName":"Rocío","middleName":"","lastName":"Lavigne-Cerván","suffix":""}],"badges":[],"createdAt":"2024-02-27 01:39:28","currentVersionCode":1,"declarations":"","doi":"10.21203/rs.3.rs-3992393/v1","doiUrl":"https://doi.org/10.21203/rs.3.rs-3992393/v1","draftVersion":[],"editorialEvents":[{"content":"https://doi.org/10.1080/21622965.2024.2336019","type":"published","date":"2024-04-09T01:44:59+00:00"}],"editorialNote":"","failedWorkflow":false,"files":[{"id":52530486,"identity":"8f219bc1-3ff2-450e-b116-2304ebc04185","added_by":"auto","created_at":"2024-03-12 16:20:56","extension":"png","order_by":1,"title":"Figure 1","display":"","copyAsset":false,"role":"figure","size":40606,"visible":true,"origin":"","legend":"\u003cp\u003eGraphical representation of the scheme proposed.\u003c/p\u003e","description":"","filename":"1.png","url":"https://assets-eu.researchsquare.com/files/rs-3992393/v1/d9fc5f715f4d8ebeeb395dae.png"},{"id":52527494,"identity":"12834aae-ef30-4411-9e20-3956e6ca688a","added_by":"auto","created_at":"2024-03-12 16:12:56","extension":"png","order_by":2,"title":"Figure 2","display":"","copyAsset":false,"role":"figure","size":11825,"visible":true,"origin":"","legend":"\u003cp\u003efold cross-validation scheme.\u003c/p\u003e","description":"","filename":"2.png","url":"https://assets-eu.researchsquare.com/files/rs-3992393/v1/92379b83c74553d4de923bc0.png"},{"id":52527496,"identity":"7d713db2-c3a0-41b8-8483-eb1cdd792bd8","added_by":"auto","created_at":"2024-03-12 16:12:56","extension":"png","order_by":3,"title":"Figure 3","display":"","copyAsset":false,"role":"figure","size":41680,"visible":true,"origin":"","legend":"\u003cp\u003eResults of feature selection algorithm Boruta on dataset.\u003c/p\u003e","description":"","filename":"3.png","url":"https://assets-eu.researchsquare.com/files/rs-3992393/v1/b7a07bd0e3abdd4657ada321.png"},{"id":52530489,"identity":"66292400-438d-4e12-965d-8f1ea7175aac","added_by":"auto","created_at":"2024-03-12 16:20:56","extension":"png","order_by":4,"title":"Figure 4","display":"","copyAsset":false,"role":"figure","size":175775,"visible":true,"origin":"","legend":"\u003cp\u003eAverage metrics results of 10-fold cross-validation. The number in parentheses represents the improvement ratio with respect to the baseline (current result / baseline result). Columns show groups of features (All and FS) and rows show metrics. The higher value is better. Baseline is marked in blue.\u003c/p\u003e","description":"","filename":"4.png","url":"https://assets-eu.researchsquare.com/files/rs-3992393/v1/5ea4532dff6fb4d857ac58f1.png"},{"id":52527497,"identity":"397d0dc7-7c20-48b0-aaa7-db745815a411","added_by":"auto","created_at":"2024-03-12 16:12:56","extension":"png","order_by":5,"title":"Figure 5","display":"","copyAsset":false,"role":"figure","size":66009,"visible":true,"origin":"","legend":"\u003cp\u003eAverage results of Wilcoxon test of 10-fold cross validation of FS sorted by the ACC significance. Green bullets represent that the row algorithm is significantly better at 95% than the column algorithm.\u003c/p\u003e","description":"","filename":"5.png","url":"https://assets-eu.researchsquare.com/files/rs-3992393/v1/f840813c16fe051f25758945.png"},{"id":52527501,"identity":"bc20a306-d03b-4e26-a65e-98dc824bcbdc","added_by":"auto","created_at":"2024-03-12 16:12:56","extension":"png","order_by":6,"title":"Figure 6","display":"","copyAsset":false,"role":"figure","size":51400,"visible":true,"origin":"","legend":"\u003cp\u003eGeneral scheme of the post-hoc explicability of the machine learning models\u003c/p\u003e","description":"","filename":"6.png","url":"https://assets-eu.researchsquare.com/files/rs-3992393/v1/72d64c07e66e55c667a2a3d9.png"},{"id":52527499,"identity":"8b064e81-2697-4ed6-8c42-33531c8b6b0e","added_by":"auto","created_at":"2024-03-12 16:12:56","extension":"png","order_by":7,"title":"Figure 7","display":"","copyAsset":false,"role":"figure","size":43955,"visible":true,"origin":"","legend":"\u003cp\u003eImportance of predictors and their impact on the outcome variable (ADHD) using the eight selected features (FS). Left: general importance; right: positive or negative impact broken down by sample. Red indicates higher values and blue lower values\u003c/p\u003e","description":"","filename":"7.png","url":"https://assets-eu.researchsquare.com/files/rs-3992393/v1/5e62293399a908d52540854f.png"},{"id":52527500,"identity":"61f0aa49-9c1c-4d48-ab91-531f7d8dc2ba","added_by":"auto","created_at":"2024-03-12 16:12:56","extension":"png","order_by":8,"title":"Figure 8","display":"","copyAsset":false,"role":"figure","size":97474,"visible":true,"origin":"","legend":"\u003cp\u003eScatter plot per predictor. Each plot indicates the individual impact sample value on the outcome. The variables are ordered by importance from left to right from top to bottom. The overall impact of each predictor is indicated in brackets. The density of the values is shown in gray at the background of each plot.\u003c/p\u003e","description":"","filename":"8.png","url":"https://assets-eu.researchsquare.com/files/rs-3992393/v1/2bf40d891cd6ac84cb75445d.png"},{"id":52530490,"identity":"b8e2cce0-9a5d-4b62-a9ed-a5489f28e9d5","added_by":"auto","created_at":"2024-03-12 16:20:56","extension":"png","order_by":9,"title":"Figure 9","display":"","copyAsset":false,"role":"figure","size":27564,"visible":true,"origin":"","legend":"\u003cp\u003eHorizontal bar chart showing the importance of the value of each variable with respect to the prediction. The graph (a) is a true negative and (b) is a true positive.\u003c/p\u003e","description":"","filename":"9.png","url":"https://assets-eu.researchsquare.com/files/rs-3992393/v1/31dc1fef9156486ccc4927c4.png"},{"id":52527503,"identity":"adb9110f-a583-4938-8108-10cd2d417f68","added_by":"auto","created_at":"2024-03-12 16:12:57","extension":"png","order_by":10,"title":"Figure 10","display":"","copyAsset":false,"role":"figure","size":29008,"visible":true,"origin":"","legend":"\u003cp\u003eHorizontal bar chart showing the importance of the value of each variable with respect to the prediction. The graph (a) is a false negative and (b) is a false positive.\u003c/p\u003e","description":"","filename":"10.png","url":"https://assets-eu.researchsquare.com/files/rs-3992393/v1/3d5a59b1f90485df671983b4.png"},{"id":54400333,"identity":"43d517f0-0e06-4547-b4f7-bd418a9bdc8d","added_by":"auto","created_at":"2024-04-10 01:45:06","extension":"pdf","order_by":0,"title":"","display":"","copyAsset":false,"role":"manuscript-pdf","size":920970,"visible":true,"origin":"","legend":"","description":"","filename":"manuscript.pdf","url":"https://assets-eu.researchsquare.com/files/rs-3992393/v1/68a04fd1-3b7f-4cf8-8921-f6cb8f001d8f.pdf"}],"financialInterests":"No competing interests reported.","formattedTitle":"Prediction of attention deficit hyperactivity disorder based on explainable artificial intelligence","fulltext":[{"header":"1. Introduction","content":"\u003cp\u003eAttention deficit hyperactivity disorder (ADHD) is a neurodevelopmental disorder that, according to the DSM-V, begins in childhood (APA, 2013). It affects a considerable number of children between 3% and 7%, being more frequent among boys. The diagnosis is usually made around 5 to 8 years of age, coinciding with the onset of schooling (Alda et al., \u003cspan citationid=\"CR1\" class=\"CitationRef\"\u003e2012\u003c/span\u003e). Symptoms emerge in childhood, but persist into adolescence and adulthood (Simon et al., \u003cspan citationid=\"CR97\" class=\"CitationRef\"\u003e2009\u003c/span\u003e). According to the American Psychiatric Association (\u003cspan citationid=\"CR2\" class=\"CitationRef\"\u003e2013\u003c/span\u003e) ADHD comprises a persistent pattern of behaviors characterized by three types of symptoms: inattention, hyperactivity and impulsivity. Primarily, the diagnosis is based on behavioural symptoms and is made before the age of 12. For a diagnosis to take place, at least two of the following areas must be affected: home life, school or social situations (Maci\u0026aacute;, 2012). ADHD is a neurodevelopmental disorder (NDD) with a strong hypothesis of genetic variation (Thapar, \u003cspan citationid=\"CR104\" class=\"CitationRef\"\u003e2018\u003c/span\u003e) which seems to cause alterations in certain neurotransmitters (Mehta et al., \u003cspan citationid=\"CR78\" class=\"CitationRef\"\u003e2019\u003c/span\u003e), producing shortfalls in basic psychological processes such as Working Memory (WM) (Yang, Allen, Holmes, \u0026amp; Chan, \u003cspan citationid=\"CR117\" class=\"CitationRef\"\u003e2017\u003c/span\u003e; Kubo et al., \u003cspan citationid=\"CR62\" class=\"CitationRef\"\u003e2018\u003c/span\u003e; Kibby et al., \u003cspan citationid=\"CR56\" class=\"CitationRef\"\u003e2019\u003c/span\u003e; Moura, Costa, \u0026amp; Simoes, 2019) and Processing Speed (PS) (Cheung, Fazier-Wood, Asherson, \u0026amp; Kuntsi, 2014; Fosco et al., \u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e2020\u003c/span\u003e), and directly impairing executive functioning (Barkley, \u003cspan citationid=\"CR7\" class=\"CitationRef\"\u003e1997\u003c/span\u003e, \u003cspan citationid=\"CR9\" class=\"CitationRef\"\u003e2005\u003c/span\u003e; Lavigne Cerv\u0026aacute;n \u0026amp; Romero P\u0026eacute;rez, 2010; Kim et al., \u003cspan citationid=\"CR57\" class=\"CitationRef\"\u003e2020\u003c/span\u003e). Consequently, ADHD is fundamentally distinguished by a persistent pattern of inattention, hyperactivity, and impulsivity that significantly interferes with individual, social, academic, or occupational functioning (APA, 2013; B\u0026eacute;langer et al., \u003cspan citationid=\"CR11\" class=\"CitationRef\"\u003e2018\u003c/span\u003e). ADHD is the most common NDD in children and adolescents (Polanczyk et al., \u003cspan citationid=\"CR89\" class=\"CitationRef\"\u003e2014\u003c/span\u003e; Thomas et al., \u003cspan citationid=\"CR105\" class=\"CitationRef\"\u003e2015\u003c/span\u003e). Its characteristic symptomatology usually appears between 6\u0026ndash;12 years of age (APA, 2013; Asherson \u0026amp; Agnew-Blais, \u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e2019\u003c/span\u003e), a stage of evolutionary development highly sensitive to psychoeducational intervention (Baloh, \u003cspan citationid=\"CR6\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). Therefore, early diagnosis and, consequently, early care are essential to ensure a favorable prognosis. (Rappaport et al., \u003cspan citationid=\"CR91\" class=\"CitationRef\"\u003e1998\u003c/span\u003e; Sonuga-Barke et al., \u003cspan citationid=\"CR98\" class=\"CitationRef\"\u003e2011\u003c/span\u003e; Hamed et al., \u003cspan citationid=\"CR45\" class=\"CitationRef\"\u003e2015\u003c/span\u003e; Miller et al., \u003cspan citationid=\"CR79\" class=\"CitationRef\"\u003e2018\u003c/span\u003e; Hare et al., \u003cspan citationid=\"CR46\" class=\"CitationRef\"\u003e2021\u003c/span\u003e; Kajka \u0026amp; Kulik, \u003cspan citationid=\"CR53\" class=\"CitationRef\"\u003e2021\u003c/span\u003e). However, ADHD is a NDD that entails a complex assessment (Navarro-Soria et al., \u003cspan citationid=\"CR82\" class=\"CitationRef\"\u003e2020\u003c/span\u003e), with a diagnostic evaluation that generally combines the implementation of direct observation in natural environments, semi-structured interviews and parent and teacher observation scales with the use of psychometric tests (Reid \u0026amp; Maag, \u003cspan citationid=\"CR92\" class=\"CitationRef\"\u003e1994\u003c/span\u003e; Isquith et al., \u003cspan citationid=\"CR49\" class=\"CitationRef\"\u003e2013\u003c/span\u003e; Krieger \u0026amp; Amador-Campos, \u003cspan citationid=\"CR60\" class=\"CitationRef\"\u003e2018\u003c/span\u003e; Marshall et al., \u003cspan citationid=\"CR74\" class=\"CitationRef\"\u003e2021\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eThe \u003cem\u003eWechsler Intelligence Scale for Children\u003c/em\u003e (WISC) proposes the most widely used psychometric test in the world (Oakland \u0026amp; Hu, \u003cspan citationid=\"CR84\" class=\"CitationRef\"\u003e1992\u003c/span\u003e; Oakland et al., \u003cspan citationid=\"CR85\" class=\"CitationRef\"\u003e2013\u003c/span\u003e; Oakland et al., \u003cspan citationid=\"CR86\" class=\"CitationRef\"\u003e2016\u003c/span\u003e; Benson et al., \u003cspan citationid=\"CR12\" class=\"CitationRef\"\u003e2019\u003c/span\u003e; Mu\u0026ntilde;iz et al., \u003cspan citationid=\"CR81\" class=\"CitationRef\"\u003e2020\u003c/span\u003e). It continues to be one of the most widely used instruments within the common batteries for the clinical assessment of ADHD (Bustillo \u0026amp; Servera, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e2015\u003c/span\u003e; Calub et al., \u003cspan citationid=\"CR17\" class=\"CitationRef\"\u003e2019\u003c/span\u003e; Fenollar-Cort\u0026eacute;s et al., 2015; Fenollar-Cort\u0026eacute;s et al., \u003cspan citationid=\"CR34\" class=\"CitationRef\"\u003e2019\u003c/span\u003e; Fern\u0026aacute;ndez-Ja\u0026eacute;n et al., \u003cspan citationid=\"CR35\" class=\"CitationRef\"\u003e2012\u003c/span\u003e; Gomez et al., \u003cspan citationid=\"CR42\" class=\"CitationRef\"\u003e2016\u003c/span\u003e; Jiang et al., \u003cspan citationid=\"CR3\" class=\"CitationRef\"\u003e2015\u003c/span\u003e; Kim \u0026amp; Song, \u003cspan citationid=\"CR58\" class=\"CitationRef\"\u003e2020\u003c/span\u003e; Krane \u0026amp; Tannock, \u003cspan citationid=\"CR59\" class=\"CitationRef\"\u003e2001\u003c/span\u003e; Matsuura et al., \u003cspan citationid=\"CR75\" class=\"CitationRef\"\u003e2014\u003c/span\u003e; Mayes \u0026amp; Calhoun, \u003cspan citationid=\"CR76\" class=\"CitationRef\"\u003e2004\u003c/span\u003e; Mayes \u0026amp; Calhoun, \u003cspan citationid=\"CR77\" class=\"CitationRef\"\u003e2006\u003c/span\u003e; Navarro-Soria et al., \u003cspan citationid=\"CR82\" class=\"CitationRef\"\u003e2020\u003c/span\u003e; Snow \u0026amp; Sapp, \u003cspan citationid=\"CR99\" class=\"CitationRef\"\u003e2000\u003c/span\u003e; Styck \u0026amp; Watkins, \u003cspan citationid=\"CR100\" class=\"CitationRef\"\u003e2017\u003c/span\u003e; Parke et al., \u003cspan citationid=\"CR87\" class=\"CitationRef\"\u003e2020\u003c/span\u003e; Thaler et al., \u003cspan citationid=\"CR103\" class=\"CitationRef\"\u003e2015\u003c/span\u003e; Ulla et al., \u003cspan citationid=\"CR107\" class=\"CitationRef\"\u003e2007\u003c/span\u003e; Walg et al., \u003cspan citationid=\"CR109\" class=\"CitationRef\"\u003e2017\u003c/span\u003e; Wanderer et al., \u003cspan citationid=\"CR110\" class=\"CitationRef\"\u003e2021\u003c/span\u003e). Specifically, its fourth edition (WISC-IV; Wechsler, 2003) has been a notable help in the diagnostic evaluation of ADHD, due to the particular configuration of its different indexes. The first two subscales of the WISC-IV, Verbal Comprehension (VC) and Perceptual Reasoning (PR), do not seem to correspond to variables affected in ADHD, however, WM and PS do relate to altered basic psychological processes (Yang, Allen, Homes, \u0026amp; Chan, 2017; Kubo et al., \u003cspan citationid=\"CR62\" class=\"CitationRef\"\u003e2018\u003c/span\u003e; Kibby et al., \u003cspan citationid=\"CR56\" class=\"CitationRef\"\u003e2019\u003c/span\u003e; Moura, Costa, \u0026amp; Simoes, 2019; Cheung, Fazier-Wood, Asherson, \u0026amp; Kuntsi, 2014; Fosco et al., \u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e2020\u003c/span\u003e). The combination of the WISC-IV and ADHD seems to reveal a response pattern characterized by higher scores in VC and PR, and lower scores in WM and PS (Barkley, Murphy, \u0026amp; Bush, \u003cspan citationid=\"CR8\" class=\"CitationRef\"\u003e2001\u003c/span\u003e; Mayes \u0026amp; Calhoun, \u003cspan citationid=\"CR77\" class=\"CitationRef\"\u003e2006\u003c/span\u003e; Arribas, Santamar\u0026iacute;a, Sueiro, \u0026amp; Pere\u0026ntilde;a, 2011; Bustillo \u0026amp; Servera, \u003cspan citationid=\"CR15\" class=\"CitationRef\"\u003e2015\u003c/span\u003e; Corral, Yang et al., \u003cspan citationid=\"CR116\" class=\"CitationRef\"\u003e2013\u003c/span\u003e; Toffalini et al., \u003cspan citationid=\"CR102\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). Thus, the very structure of the WISC-IV allows to observe in ADHD what some authors call a \"cognitive step\", i.e., stable scores in VC and PR, with sharp score drops in WM and PS. (Fenollar-Cort\u0026eacute;s et al., 2015; Navarro-Soria et al., \u003cspan citationid=\"CR82\" class=\"CitationRef\"\u003e2020\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eOn the other hand, there are algorithms based on machine learning (ML), which is an area of Artificial Intelligence (AI) used to detect patterns in data and make predictions (supervised learning) automatically. These systems require two steps: the first is that they use the algorithms to train models from data; and the second is that the trained model is used to make predictions on unknown data. More and more ML is being applied to problems in different domains such as education (Rico-Juan et al., \u003cspan citationid=\"CR93\" class=\"CitationRef\"\u003e2019\u003c/span\u003e; Cachero, et al., \u003cspan citationid=\"CR16\" class=\"CitationRef\"\u003e2023\u003c/span\u003e), industries (Dalzochio et al., \u003cspan citationid=\"CR26\" class=\"CitationRef\"\u003e2020\u003c/span\u003e; Liu, \u003cspan citationid=\"CR67\" class=\"CitationRef\"\u003e2020\u003c/span\u003e; Dhini et al., \u003cspan citationid=\"CR27\" class=\"CitationRef\"\u003e2021\u003c/span\u003e) and of course healthcare (Javaid et al., \u003cspan citationid=\"CR51\" class=\"CitationRef\"\u003e2022\u003c/span\u003e; Lui et al., 2022). The main reason why ML algorithms were not used so far in data analysis is due to their difficult interpretation in most cases. A colloquial term often used to refer to these systems is black boxes (Koh and Liang, \u003cspan citationid=\"CR61\" class=\"CitationRef\"\u003e2017\u003c/span\u003e).\u003c/p\u003e \u003cp\u003eThe algorithms have different learning styles as neighborhood-based, decision trees, support vector machines or neural networks with algorithms capable of inferring relationships between variables, both linear and nonlinear relationships. This feature makes the resulting models better suited to data distributions than purely linear models (Liang et al., \u003cspan citationid=\"CR66\" class=\"CitationRef\"\u003e2022\u003c/span\u003e). The usual practice is to try different algorithms for a problem and choose the one or ones that obtain the best results to make the predictions.\u003c/p\u003e \u003cp\u003eIn addition, recent advances in the area known as eXplicable Artificial Intelligence (XAI) (Barredo Arrieta et al., \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e2020\u003c/span\u003e) make it easier for us to extract some useful and understandable information from ML models. Specifically, post-hoc techniques offer a unified approach to explicate the predictions made by an ML model. SHAP (SHapley Additive exPlanations) (Lundberg, \u003cspan citationid=\"CR69\" class=\"CitationRef\"\u003e2019\u003c/span\u003e) is an example of a tool whose calculations are based on game theory and which allows us to know the importance of variables at a general level, by groups or at an individual level.\u003c/p\u003e \u003cp\u003eTaking into account the above, the contributions of this research are: (1) to apply ML-based techniques to the test results of 694 cases to improve the detection of cognitive patterns of ADHD, (2) to offer new relationships between the variables obtained in the diagnostic process for use in an automated diagnostic aid system that includes feedback based on the explainability of ML systems. The intention is that this system will be part of a web application in the future to help psychologists not specialized in the detection of ADHD, as well as to cross-check the diagnoses of specialists.\u003c/p\u003e"},{"header":"2. Research method","content":"\u003cdiv id=\"Sec3\" class=\"Section2\"\u003e \u003ch2\u003e2.1. Objectives and context definition\u003c/h2\u003e \u003cp\u003eCognitive patterns associated with different neurobiological disorders such as ADHD -among others-, have been intensively studied during the last two decades with the help of traditional statistical tools (Asherson \u0026amp; Agnew-Blais, \u003cspan citationid=\"CR4\" class=\"CitationRef\"\u003e2019\u003c/span\u003e; Fenollar et al., 2015; Fosco et al., \u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e2020\u003c/span\u003e; Navarro et al., 2020). These methodologies mainly analyze and contrast linear relationships between variables whose behaviors are usually assumed to be normal, which is a limitation both for making predictions and for explaining their relationships. The current proposal is oriented to the application of ML algorithms to alleviate these limitations. These algorithms have the ability to learn or adjust value structures in a first training phase to create a predictive model capable of relating the different input variables, whether these are linear or not. Moreover, this learning can be performed on any type of data distribution, which can help us to better explain the results, obtaining even greater reliability than those achieved so far with traditional statistical tools. In addition, recent advances in ML explainability (Barredo Arrieta et al., \u003cspan citationid=\"CR10\" class=\"CitationRef\"\u003e2020\u003c/span\u003e) now allow predictions to be explained with post-hoc agnostic models.\u003c/p\u003e \u003cp\u003eThis, given the complexity of the diagnostic processes, especially in children and adolescents, would provide a useful tool for professionals to obtain valid information to aid diagnostic decisions. For this reason, we propose (see Fig.\u0026nbsp;\u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003e):\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003e(1) Apply ML-based techniques as a strategy to improve the detection of cognitive patterns associated with ADHD in children aged 6 to 16 years,\u003c/p\u003e \u003cp\u003e(2) To have an automated diagnostic aid system that includes feedback based on the explainability of the systems. This system will be based on the significant relationships between the scores obtained in the WISC-IV of boys and girls ages 6 to 16 years of age.\u003c/p\u003e \u003cp\u003eFigure \u003cspan refid=\"Fig1\" class=\"InternalRef\"\u003e1\u003c/span\u003e represents in a visual way the application of our methodology where the group of experts would be in charge of applying the psychometric tests (WISC IV) to the subjects, obtaining the verified results and updating the database. In addition, updating the diagnostic aid system.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec4\" class=\"Section2\"\u003e \u003ch2\u003e2.1. Participants\u003c/h2\u003e \u003cp\u003eThe recruited sample consisted of 406 subjects aged 6\u0026ndash;16 years, male and female, derived from child and adolescent mental health units in the province of Alicante (Valencian Community), the Region of Murcia and the province of Malaga (Andalusia).\u003c/p\u003e \u003cp\u003eAs shown in Table\u0026nbsp;\u003cspan refid=\"Tab1\" class=\"InternalRef\"\u003e1\u003c/span\u003e, of the subjects in the clinical group (ADHD), 326 were male (80.3%) and the mean age was 9.4 (\u003cem\u003eSD\u003c/em\u003e\u0026thinsp;=\u0026thinsp;2.2). The inclusion criteria were as follows: having been previously diagnosed with ADHD by a clinician, having an IQ equal to or higher than 80 and not observing significant symptoms of severe mental disorders or severe medical illnesses. Subjects who did not meet these criteria were excluded from the sample. In all cases, the legal guardians of the minors signed the informed consent to participate in the current research.\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab1\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 1\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eBasic statistics on the data collected. N, M and SD are the number of cases, the mean and the standard deviation, respectively. ADHD \u0026lsquo;No\u0026rsquo;, refers to the control group.\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"5\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cdiv align=\"char\" char=\".\" class=\"colspec\" colname=\"c5\" colnum=\"5\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e \u003cp\u003eADHD\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e \u003cp\u003eGender\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c3\"\u003e \u003cp\u003eN\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c4\"\u003e \u003cp\u003eM\u003c/p\u003e \u003c/th\u003e \u003cth align=\"left\" colname=\"c5\"\u003e \u003cp\u003eSD\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\" morerows=\"1\" rowspan=\"2\"\u003e \u003cp\u003eNo\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eMale\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e195\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e9.7\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e2.7\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eFemale\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e93\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e9.4\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e2.5\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\" morerows=\"1\" rowspan=\"2\"\u003e \u003cp\u003eYes\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eMale\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e326\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e8.7\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e2.2\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eFemale\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c3\"\u003e \u003cp\u003e80\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c4\"\u003e \u003cp\u003e9.0\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"char\" char=\".\" colname=\"c5\"\u003e \u003cp\u003e2.0\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eOn the other hand, the control group consisted of 288 subjects collected from public and private schools in the urban areas of Alicante, Murcia, Marbella and Malaga, made up of children of both sexes and ages between 6 and 16, of whom 195 were boys (67.7%) and the average age of the control group was 9.5 years old. (\u003cem\u003eSD\u003c/em\u003e\u0026thinsp;=\u0026thinsp;2.51). Furthermore, all participants in the control group had to meet the same inclusion criteria as the clinical group (with the exception of a prior diagnosis of ADHD) and obtain scores on the \u003cem\u003eStrengths and Difficulties Questionnaires\u003c/em\u003e (SDQ) for Parents and Teachers (Goodman, \u003cspan citationid=\"CR43\" class=\"CitationRef\"\u003e1997\u003c/span\u003e), in both informants, below the clinically significant lower limit on any of its subindices.\u003c/p\u003e \u003cp\u003eFinally, the total sample amounted to 694 subjects, including clinical and neurotypical population.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec5\" class=\"Section2\"\u003e \u003ch2\u003e2.2. Instruments\u003c/h2\u003e \u003cp\u003eDuring the process of sample selection and data collection, the following psychometric tools were used:\u003c/p\u003e \u003cp\u003eStrengths and Difficulties Questionnaire for Parents and Teachers (SDQ; Goodman, \u003cspan citationid=\"CR43\" class=\"CitationRef\"\u003e1997\u003c/span\u003e): a 25-item Likert-type short behavioural screening questionnaire. For the investigation the \u0026ldquo;hyperactivity\u0026rdquo; subscale was used with its stipulated clinical limits, except in subjects in whom the inattention dimension predominated, in which case the limit was set at a score equal to or higher than 4 (Ullebo, Posserud, Heiervang, \u0026amp; Gillberg, 2011). Subjects who did not obtain a clinically significant score, from family and the school, were excluded from the clinical sample.\u003c/p\u003e \u003cp\u003eADHD-Rating Scale-IV (For Children and Adolescents) (RS; DuPaul, Power, \u0026amp; Anastopoulos, 1998): An 18-item Likert-type scale based on the DSM-IV diagnostic criteria for ADHD, which measures the dimensions \"inattention\" and \"impulsivity/hyperactivity\", both in the home and school environment. The Spanish version was used (Servera \u0026amp; Card\u0026oacute;, 2007), considering a score of \u0026ge;\u0026thinsp;P90 in each dimension as clinically significant.\u003c/p\u003e \u003cp\u003eConners 3rd edition for parents and teachers (spanish version) (Conners, \u003cspan citationid=\"CR23\" class=\"CitationRef\"\u003e2008\u003c/span\u003e): Likert type scale of 45 (parents version) and 41 items (teachers version) that generates 5 indices: \"Inattention\", \"Hyperactivity/Impulsivity\", \"Learning Difficulties/Executive Functions\", \"Aggressiveness/Defiance\" and \"Relations with peers\". For this research, the score cut-off point was set as T\u0026thinsp;\u0026ge;\u0026thinsp;65 for the index to be considered clinically significant.\u003c/p\u003e \u003cp\u003eThe Parental Account of Childhood Symptoms (PACS) (Taylor, Schachar, Thorley, \u0026amp; Wieselberg, \u003cspan citationid=\"CR101\" class=\"CitationRef\"\u003e1986\u003c/span\u003e): a member of the research team conducted a semi-structured interview with the family to verify that the subject met the ADHD criteria of the DSM-V (APA, 2013) as well as to review the clinical history and corroborate that the inclusion criteria of the sample design were met..\u003c/p\u003e \u003cp\u003eWechsler Intelligence Scale for Children-Fourth Edition (WISC-IV; Wechsler, \u003cspan citationid=\"CR114\" class=\"CitationRef\"\u003e2005\u003c/span\u003e): scale composed of 15 tests for children between 6\u0026ndash;16 years of age. The Spanish version was used for this study. The result of each of the tests is related to the level of cognitive development in a series of intellectual competencies the results from which 5 indexes are obtained: Verbal Comprehension (VC), Perceptual Reasoning (PR), Working Memory (WM), Processing Speed (PS), Total Intellectual Quotient (FSIQ). In addition, we found 2 composite indices: the Index of Cognitive Competence (ICC) resulting from the grouping of WM-PS and the Index of General Ability (GAI) resulting from the VC-PR grouping. This scale has demonstrated the adequacy of reliability and validity coefficients in both the American and Spanish versions. Therefore, the reliability of the WISC-IV in its Spanish adaptation, presents coefficients in the range of 0.86 (IVP) to 0.95 (FSIQ) for the split-half method, which are similar to the coefficients found in the American version, demonstrating the high consistency and stability of the test.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec6\" class=\"Section2\"\u003e \u003ch2\u003e2.3. Experimental planning\u003c/h2\u003e \u003cp\u003eIn order to carry out the current research, child and adolescent mental health units with neuropediatrics or psychiatry and psychology services were contacted in the provinces of Malaga, Alicante and Murcia. Researchers from the task force presented them with the aims of the research and the request to recruit data on boys and girls aged 6 to 16 years with a diagnosis of ADHD having previously been made by a clinician, having an IQ equal to or higher than 80 and not observing significant symptoms of severe mental disorders or severe medical illnesses. Previously, the legal guardians of each of the participants had to sign an informed consent form. The data collection was carried out by three administrators (psychologists trained specifically for this purpose), one from each province, who used the same assessment protocol that included the tools described above.\u003c/p\u003e \u003cp\u003e Regarding the participants of the control group, contact was made with various educational guidance departments of public and private schools in the urban areas of Alicante, Murcia, Marbella and Malaga. The objectives of the study were presented to them and authorization was requested to obtain data on boys and girls between 6 and 16 years of age, with normotypical development, after the signing of the informed consent by the legal guardians of each minor. This research was carried out in accordance with the 1964 Declaration of Helsinki and its subsequent modifications. Likewise, approval was requested from the Ethics Committee of the University of Alicante, which provides and approves the methodology used, assigning the approval the file number UA-2018-03-08. The collection of the control group data was carried out by two researchers of the team, collecting data in parallel in the areas of Alicante and Murcia, on the one hand, and in Marbella and Malaga, on the other.\u003c/p\u003e \u003c/div\u003e"},{"header":"3. Data analysis","content":"\u003cp\u003eFor the experimentation we have considered a validation scheme for the training of ML algorithms, we have selected a series of metrics applied to this type of studies to evaluate the results, the selection of a subset of representative variables with respect to the total to concentrate the study on these variables, and finally, we have added an explanatory part on the predictions.\u003c/p\u003e \u003cdiv id=\"Sec8\" class=\"Section2\"\u003e \u003ch2\u003e3.1. Machine learning approach\u003c/h2\u003e \u003cp\u003eVarious learning styles of ML algorithms were tested to study which algorithms perform better in their predictions in order to explain more accurately the relationships between independent and dependent variables. The chosen styles can be grouped into neighborhood-based, decision trees, support vector machines, Bayesian models and neural networks.\u003c/p\u003e \u003cp\u003eIn the following we will briefly detail what each of the selected algorithms consists of:\u003c/p\u003e \u003cp\u003e \u003cul\u003e \u003cli\u003e \u003cp\u003eBaseline: This is the baseline model and follows a uniform distribution of the data whose prediction is around 50% for a binary classification (our study). Its behavior is like that of a random system.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eDecision tree (Breiman et al., 2017): This model follows a hierarchical structure. This structure is built from a data set by selecting the most relevant variable and applying a cutoff value creating two subsets of samples. This same scheme is repeated for the subsets generated until a finish condition is satisfied.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eRandom forest (Breiman, \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2001\u003c/span\u003e): This algorithm builds several decision trees in order to combine their predictions into one. The combination of several predictions leads to a more robust behavior.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eAdaBoost (Adaptive Boosting) (Freund \u0026amp; Schapire, \u003cspan citationid=\"CR39\" class=\"CitationRef\"\u003e1997\u003c/span\u003e): This algorithm is based on the use of different linear regressors. The final prediction is calculated from the regressors weighted according to the confidence learned by the algorithm in the training phase.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eeXtreme Gradient Boost [XGBoost] (T. Chen \u0026amp; Guestrin, \u003cspan citationid=\"CR20\" class=\"CitationRef\"\u003e2016\u003c/span\u003e), gradient boosting with categorical features support [CatBoost] (Dorogush et al., \u003cspan citationid=\"CR28\" class=\"CitationRef\"\u003e2018\u003c/span\u003e) and Light Gradient Boosting Machine [LightGBM] (Ke et al., \u003cspan citationid=\"CR54\" class=\"CitationRef\"\u003e2017\u003c/span\u003e): These algorithms use the boosting technique combined with the creation of several decision trees, where the optimization processes use derivable cost functions along with weight updating with gradient descent (as in neural networks). Each algorithm applies different types of optimizers and all of them have performed well in open challenges.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eSupport vector machines [SVM] (Cortes \u0026amp; Vapnik, \u003cspan citationid=\"CR24\" class=\"CitationRef\"\u003e1995\u003c/span\u003e): These algorithms are divided into two stages. In the first phase, the original data space is transformed to another space, usually of higher dimension. In a second phase, a linear hyperplane is sought that separates and distances as much as possible the classes in the resulting space.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eMultilayer perceptron [MLP] (Hinton, 1989): This type of neural network is basic but at the same time widely used its basic feature is that all neurons of consecutive layers are connected to each other.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eGaussian Naive Bayes [Gaussian NB] (Chan et al., \u003cspan citationid=\"CR18\" class=\"CitationRef\"\u003e1983\u003c/span\u003e): This model is based on the Bayes model, uses conditional probabilities and assumes independence among the variables in the model.\u003c/p\u003e \u003c/li\u003e \u003cli\u003e \u003cp\u003eNearest neighbors [NN] (Cover \u0026amp; Hart, \u003cspan citationid=\"CR25\" class=\"CitationRef\"\u003e1967\u003c/span\u003e): The class of a new sample is calculated based on the k (parameter) nearest training samples and their associated classes. The proximity function used is the Euclidean distance, and the parameter k was set to 1, 3, 5 and 7.\u003c/p\u003e \u003c/li\u003e \u003c/ul\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec9\" class=\"Section2\"\u003e \u003ch2\u003e3.2. Validation and metrics\u003c/h2\u003e \u003cp\u003eIn order to know the goodness of prediction of a model, a validation technique is applied. In our case we will use the well-known k-fold cross-validation (Fig.\u0026nbsp;\u003cspan refid=\"Fig2\" class=\"InternalRef\"\u003e2\u003c/span\u003e) which consists of creating k disjoint partitions of similar size, leaving one as a test and using the rest as a training set, iterate k times obtaining k different models and calculating their mean as the final result. The usual value to set k is 10.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eLet's start by defining the type of hits and misses that occur when we classify samples in a binary or dichotomous problem such as ours (has or does not have ADHD) (see Table\u0026nbsp;\u003cspan refid=\"Tab2\" class=\"InternalRef\"\u003e2\u003c/span\u003e).\u003c/p\u003e \u003cp\u003e \u003cdiv class=\"gridtable\"\u003e\u003ctable float=\"Yes\" id=\"Tab2\" border=\"1\"\u003e \u003ccaption language=\"En\"\u003e \u003cdiv class=\"CaptionNumber\"\u003eTable 2\u003c/div\u003e \u003cdiv class=\"CaptionContent\"\u003e \u003cp\u003eBinary classification with the four possible cases.\u003c/p\u003e \u003c/div\u003e \u003c/caption\u003e \u003ccolgroup cols=\"4\"\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c1\" colnum=\"1\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c2\" colnum=\"2\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c3\" colnum=\"3\"\u003e\u003c/div\u003e \u003cdiv align=\"left\" class=\"colspec\" colname=\"c4\" colnum=\"4\"\u003e\u003c/div\u003e \u003cthead\u003e \u003ctr\u003e \u003cth align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/th\u003e \u003cth align=\"left\" colname=\"c2\"\u003e\u0026nbsp;\u003c/th\u003e \u003cth align=\"left\" colspan=\"2\" nameend=\"c4\" namest=\"c3\"\u003e \u003cp\u003ePredicted condition\u003c/p\u003e \u003c/th\u003e \u003c/tr\u003e \u003c/thead\u003e \u003ctbody\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\"\u003e\u0026nbsp;\u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e\u0026nbsp;\u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003ePositive\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003eNegative\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c1\" morerows=\"1\" rowspan=\"2\"\u003e \u003cp\u003eActual condition\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003ePositive\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eTrue positive (TP)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003eFalse negative (FN)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003ctr\u003e \u003ctd align=\"left\" colname=\"c2\"\u003e \u003cp\u003eNegative\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c3\"\u003e \u003cp\u003eFalse positive (FP)\u003c/p\u003e \u003c/td\u003e \u003ctd align=\"left\" colname=\"c4\"\u003e \u003cp\u003eTrue negative (TN)\u003c/p\u003e \u003c/td\u003e \u003c/tr\u003e \u003c/tbody\u003e \u003c/colgroup\u003e \u003c/table\u003e\u003c/div\u003e \u003c/p\u003e \u003cp\u003eThe metrics we will use for this study are those commonly used. Accuracy [1] is the number of hits divided by the total number of predictions. Sensitivity [2] is the probability of being right when a sample is truly positive. Specificity [3] is the probability of being right when a sample is truly negative. Area Under the Receiver Operating Characteristic (AUC) [5] can be interpreted as the probability that a classifier will order a randomly chosen positive instance higher than a negative one.\u003c/p\u003e \u003cp\u003e \u003cspan class=\"InlineEquation\"\u003e \u003cspan class=\"mathinline\"\u003e\\(Acc =\\frac{TP + TN}{TP +FN + FP + TN}\\)\u003c/span\u003e \u003c/span\u003e [1]\u003c/p\u003e \u003cp\u003e \u003cspan class=\"InlineEquation\"\u003e \u003cspan class=\"mathinline\"\u003e\\(Sensitivity= \\frac{TP}{TP + FN}\\)\u003c/span\u003e \u003c/span\u003e [2]\u003c/p\u003e \u003cp\u003e \u003cspan class=\"InlineEquation\"\u003e \u003cspan class=\"mathinline\"\u003e\\(Specificity= \\frac{TN}{TN + FP}\\)\u003c/span\u003e \u003c/span\u003e [3]\u003c/p\u003e \u003cp\u003e \u003cspan class=\"InlineEquation\"\u003e \u003cspan class=\"mathinline\"\u003e\\(FPR= \\frac{ FP}{FP + TN}\\)\u003c/span\u003e \u003c/span\u003e [4]\u003c/p\u003e \u003cp\u003e \u003cspan class=\"InlineEquation\"\u003e \u003cspan class=\"mathinline\"\u003e\\(AUC= \\frac{1 + Sensitivity - FPR}{2}\\)\u003c/span\u003e \u003c/span\u003e [5]\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec10\" class=\"Section2\"\u003e \u003ch2\u003e3.3. Feature selection\u003c/h2\u003e \u003cp\u003eWhen conducting a study of this type, it is interesting to explore the possibility of reducing the number of input variables to the model (independent variables) while maintaining its predictive capacity. In this way, variables that are not relevant to the problem are removed, the learning of the models is facilitated since they focus on the most important information, and it is necessary to know fewer variables that the specialist must provide when predicting a new sample with the consequent saving of time and effort.\u003c/p\u003e \u003cp\u003eThere are three types of feature selection approaches: the filter, the wrapper and the embedding approach (Guyon and Elisseeff, 2003). The filter approach is based on analyzing, evaluating and assessing the importance of each characteristic separately and assessing their importance to remove those below a threshold (Chandrashekar and Sahin, \u003cspan citationid=\"CR19\" class=\"CitationRef\"\u003e2014\u003c/span\u003e). However, it ignores the dependencies between features, it is applied independently of the classifier so it lacks classifier verification (Ang at al., 2015). The wrapper approach trains a predictive model with a subset of features, and determines its validity (Gnana et al., \u003cspan citationid=\"CR41\" class=\"CitationRef\"\u003e2016\u003c/span\u003e). This approach solves the problems of the filter method although it increases the computational cost of the process. Finally, the embedding approach focuses on searching for the ideal feature subset for a particular classification algorithm (Khaire \u0026amp; Dhanalakshmi, \u003cspan citationid=\"CR55\" class=\"CitationRef\"\u003e2022\u003c/span\u003e) but it is influenced by the classifier hypothesis and is not compatible with other classifiers.\u003c/p\u003e \u003cp\u003eTherefore, our study uses an advanced wrapper-type method called Boruta (Kursa and Rudnicki, \u003cspan citationid=\"CR63\" class=\"CitationRef\"\u003e2010\u003c/span\u003e) to extract the most relevant features for use with different classifiers. It is a statistical algorithm and consists of duplicating each variable (shadow) and shuffling its values, and applying some classifier based on decision trees (e.g. Random Forest) to perform statistical tests on the results between the original variables and the shadows (artificially created and representing noise). When the results of the original variables are significantly better than a threshold computed with the shadow variables, they are identified as important. This process is repeated several times until it is determined which variables are important in most tests.\u003c/p\u003e \u003cp\u003eAs we can see in Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e, eight variables are selected (colored in green): GAI - CPI, WMI, VCI, WMI - PSI, PSI, CPI, LN and PRI. Hereafter we will refer to \u003cem\u003eAll\u003c/em\u003e when using all features and \u003cem\u003eFS\u003c/em\u003e when using feature selection with these eight variables.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec11\" class=\"Section2\"\u003e \u003ch2\u003e3.4. Results\u003c/h2\u003e \u003cp\u003eTo avoid the presence of outliers, while normalizing important differences in scales between the different variables, we applied a robust scaling method (RS) (see Eq.\u0026nbsp;[6]) before proceeding to train the models.\u003c/p\u003e \u003cp\u003e \u003cspan class=\"InlineEquation\"\u003e \u003cspan class=\"mathinline\"\u003e\\(RS\\left(v\\right)= \\frac{i - Q2\\left(v\\right)}{Q3\\left(v\\right) - Q1\\left(v\\right)} :i∊v\\)\u003c/span\u003e \u003c/span\u003e [6]\u003c/p\u003e \u003cp\u003eWhere \u003cem\u003ev\u003c/em\u003e the de variable, \u003cem\u003ei\u003c/em\u003e is a value of the variable and \u003cem\u003eQ1\u003c/em\u003e, \u003cem\u003eQ2\u003c/em\u003e and \u003cem\u003eQ3\u003c/em\u003e of \u003cem\u003ev\u003c/em\u003e are the respective quantiles.\u003c/p\u003e \u003cp\u003eFigure \u003cspan refid=\"Fig4\" class=\"InternalRef\"\u003e4\u003c/span\u003e shows the average results of the cross-validation according to the two groups of characteristics and metrics proposed. We observe the similarity of results between the best All and FS algorithms for the different metrics, since FS has a less number of features we will consider only FS for the following experiments.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eIt is also desirable to contrast the results to know if the difference between the algorithms is significant or not. For this we use the Wilcoxon paired test (Wilcoxon, \u003cspan citationid=\"CR115\" class=\"CitationRef\"\u003e1945\u003c/span\u003e) with 95% confidence which is the commonly used value.\u003c/p\u003e \u003cp\u003eFigure \u003cspan refid=\"Fig5\" class=\"InternalRef\"\u003e5\u003c/span\u003e shows a comparison of statistical significance between algorithmic pairs according to ACC as it is one of the main metrics. It can be observed that: i) Baseline, Gaussian NB and NN get the worst results; ii) The best algorithms are based on decision trees (Random Forest, CatBoost and LGBM); Random Forest is significantly better than the rest in terms of accuracy.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eIn addition, we have also performed preliminary tests on some partitions with hyperparameter optimization algorithms (Jamieson \u0026amp; Talwalkar, \u003cspan citationid=\"CR50\" class=\"CitationRef\"\u003e2016\u003c/span\u003e; Li et al., \u003cspan citationid=\"CR65\" class=\"CitationRef\"\u003e2017\u003c/span\u003e) on the ML models that have obtained the best results such as Random Forest, CatBoost and LightGBM, since despite having a high computational cost with respect to the simple learning of a single model, sometimes this parameter adjustment is justified by the improvement in the results in the predictions. On this occasion, the differences obtained were not significant with respect to the default parameters, so we ruled out performing the complete tests with cross-validation.\u003c/p\u003e \u003cp\u003eTherefore, since Random Forest is significantly better than the rest in accuracy, and in addition, it is one of the best in all other metrics (Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e), we selected it as the algorithm with feature selection variables (FS) for explainability.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec12\" class=\"Section2\"\u003e \u003ch2\u003e3.5. Machine learning and explainability\u003c/h2\u003e \u003cp\u003eThe best predictions are usually made by the most complex models and to try to explain why they make consistent predictions there is a so-called post-hoc approach that can be applied to any ML model once trained.\u003c/p\u003e \u003cp\u003eThe first post-hoc explainability approach is based on making permutations (Breiman, \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2001\u003c/span\u003e) on the values of each individual input variable (predictor) and comparing the variability directly on its predictions. This form allows us to estimate the importance of the predictors of a model that is already trained; the second approach is based on the construction of a new linear model that explains the complex model already trained.\u003c/p\u003e \u003cp\u003eBasically, this type of explainability can be tackled in two ways. One is based on using permutations on the independent variables in a controlled way (Breiman, \u003cspan citationid=\"CR13\" class=\"CitationRef\"\u003e2001\u003c/span\u003e) and measuring how they affect the dependent variable. In this way the importance of each variable can be estimated. The second approach is based on game theory and makes use of Shapley values (Roth, \u003cspan citationid=\"CR94\" class=\"CitationRef\"\u003e1988\u003c/span\u003e). The very definition of these values coincides with the objective of ML to explain the models. These values calculate the marginal contribution of each independent variable considering all possible combinations with the rest of the variables to explain the final outcome (dependent variable). In addition, this method ensures local accuracy, missingness and consistency. Figure\u0026nbsp;\u003cspan refid=\"Fig6\" class=\"InternalRef\"\u003e6\u003c/span\u003e shows the general post-hoc scheme when combined with ML. As we can see, the main prediction system is trained with a machine learning model to issue a diagnosis (decision), and in parallel the explainability system estimates the importance of the values of the input variables with respect to the decision taken. This information is used to provide feedback on the prediction process.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eShapley values are useful when we want to know the overall impact of a variable but cannot explain the impact on a group of samples or on a specific sample. Lundberg and Lee (Lundberg \u0026amp; Lee, \u003cspan citationid=\"CR70\" class=\"CitationRef\"\u003e2017a\u003c/span\u003e, Lundberg \u0026amp; Lee, \u003cspan citationid=\"CR71\" class=\"CitationRef\"\u003e2017b\u003c/span\u003e) have made these advances by solving the above problems. One tool that follows this approach is SHAP (SHapley Additive exPlanations) (Lundberg, \u003cspan citationid=\"CR69\" class=\"CitationRef\"\u003e2019\u003c/span\u003e) which allows scaling explainability at the global, group or individual level.\u003c/p\u003e \u003cp\u003eThis explainability of the ML model may be of great interest to both highly skilled and unskilled ADHD screening personnel. In the first case, it can confirm or not a diagnosis and the professional himself/herself could help to improve the model, and in the second case our system, in addition to issuing a diagnosis, guides the staff by explaining which factors are relevant to a particular diagnosis and the certainty he has in it.\u003c/p\u003e \u003c/div\u003e \u003cdiv id=\"Sec13\" class=\"Section2\"\u003e \u003ch2\u003e3.6. Importance of the model's features\u003c/h2\u003e \u003cp\u003eThe importance of the features of the model depends on the data used in the training. In our case, we use the Random Forest algorithm, according to the results obtained in the Experimentation section.\u003c/p\u003e \u003cp\u003eIn this trained model, Shapley values are calculated for each variable and their cumulative absolute values determine their importance. Figure\u0026nbsp;\u003cspan refid=\"Fig7\" class=\"InternalRef\"\u003e7\u003c/span\u003e shows the predictors of the model in order of importance.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eFigure \u003cspan refid=\"Fig8\" class=\"InternalRef\"\u003e8\u003c/span\u003e shows the individual SHAP values by coloring in red those cases diagnosed as ADHD and in blue those not diagnosed as ADHD. As we can see in most of the subfigures the relationships are not linear with the input values which makes this ML model a better fit than other linear ones. On the other hand, from this graph we can also extract which value or range of values are the most neutral (the y-axis value is zero and is represented by a dashed line) for each variable. For example, in GAI - CPI a value around 10 is neutral the higher the system tends to predict having ADHD, while if it is lower it tends to predict the opposite; in the variable WMI its neutral value is around 90 and the opposite happens with the previous variable the lower it is the more tendency to diagnose ADHD; As a special case, the variable WMI - PSI where the neutral values are around \u0026minus;\u0026thinsp;10 or +\u0026thinsp;10, between this range the tendency is to be a negative case and outside this range to be a positive case.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003c/div\u003e"},{"header":"4. Discussion","content":"\u003cp\u003eA large group of international research teams, with the aim of facilitating a complex clinical diagnosis, center their works around the detection of a cognitive pattern in infant-juvenile population presenting ADHD (Cheung, Fazier-Wood, Asherson, \u0026amp; Kuntsi, 2014; Fenollar-Cort\u0026eacute;s et al., 2015; Fosco et al., \u003cspan citationid=\"CR36\" class=\"CitationRef\"\u003e2020\u003c/span\u003e; Kubo et al., \u003cspan citationid=\"CR62\" class=\"CitationRef\"\u003e2018\u003c/span\u003e; Kibby et al., \u003cspan citationid=\"CR56\" class=\"CitationRef\"\u003e2019\u003c/span\u003e; Moura, Costa, \u0026amp; Simoes, 2019; Navarro-Soria et al., \u003cspan citationid=\"CR82\" class=\"CitationRef\"\u003e2020\u003c/span\u003e; Yang, Allen, Homes, \u0026amp; Chan, 2017). Therefore, the purpose of this work is not so much to corroborate the existence of this cognitive pattern, but to propose an effective system to support the clinician in making diagnostic decisions. In the current research, ML-based algorithms are used to determine which variables measured in the cognitive assessment by WISC-IV are important and their impact on a diagnosis of ADHD.\u003c/p\u003e \u003cp\u003eFrom the results shown in the previous section, specifically those developed in Fig.\u0026nbsp;\u003cspan refid=\"Fig3\" class=\"InternalRef\"\u003e3\u003c/span\u003e which represents feature selection, it follows that the difference between the GAI and CPI indices, and the classic WMI index, as demonstrated in previous studies, (Fenollar-Cort\u0026eacute;s et al., 2015; Mayes \u0026amp; Calhoun, \u003cspan citationid=\"CR77\" class=\"CitationRef\"\u003e2006\u003c/span\u003e; Thaler et al., \u003cspan citationid=\"CR103\" class=\"CitationRef\"\u003e2015\u003c/span\u003e), remain highly reliable predictor indices in the diagnosis of ADHD. Continuing with this figure, it is also significant the importance detected by the algorithms used in indexes such as VCI or PRI which, despite measuring cognitive skills associated with brain areas less affected by ADHD, stand out as adequate predictors. This may be due to its stability in relation to the FSIQ and to the fact that when a cognitive step is detected in the VCI and PRI indices, the pattern coincides with disorders other than ADHD, providing information to the algorithms used to discard the results as typical of the target pattern in this study. Also, the difference between the WMI and PSI indices is presented as relevant in the detection of the cognitive pattern, as stated in previous research (Navarro-Soria et al., \u003cspan citationid=\"CR82\" class=\"CitationRef\"\u003e2020\u003c/span\u003e;). Naturally, the PSI index continues to be important in detecting the cognitive pattern of ADHD, with a somewhat lower weight than WMI, as expected, since depending on whether the disorder presents as combined or inattentive, the impairment to the skills related to the PSI index will be higher or lower. The CPI composite index, stands out in third position of importance, which is calculated from the WMI and PSI indices and their component subtests that are associated with the cognitive skills most affected by ADHD. Finally, the LN subtest, which in the practical field of clinical neuropsychological assessment has always been an indicator widely taken into account to support the diagnosis of ADHD, the algorithms used highlight it as significant, but below the level of predictive ability of the other previously mentioned indexes, the other data being more robust and effective.\u003c/p\u003e \u003cp\u003eTo explain how this system would work as a diagnostic support system, the four possible cases are presented, knowing that the accuracy of the model is around 90%. In the following cases we will assume that probabilities higher than 0.5 would be diagnosed as ADHD. In practice, these graphs with the breakdown of the odds and the final probability would have to be interpreted by a specialist. The first two cases are for a true positive (TP) and a true negative (TN).\u003c/p\u003e \u003cp\u003eOn the other hand, as observed in Fig.\u0026nbsp;\u003cspan refid=\"Fig9\" class=\"InternalRef\"\u003e9\u003c/span\u003e, the algorithms chosen by the implemented IA use the 8 most effective variables to ensure a high percentage of success in classifying the subjects from the sample. In these examples, in which the IA correctly classifies, associating a specific pattern according to the importance of the profile values of a subject diagnosed with ADHD (figure b, with an estimated probability of 0.97) and discarding another with a negative diagnosis (figure a, with an estimated probability of 0.024), because it fits a pattern of normal subjects (control group) and thus allows us to check the way in which the algorithm is based to calculate the probability of diagnosis. In the first graph presented in this figure, it can be seen that the IA gives great relevance to the difference between the GAI and CPI indices, which as stated previously, is very relevant in these cases. Also considered of relevance is the CPI, the composite index that is most affected by ADHD and, of course, the WMI and PSI indexes. On the other hand, the graph in Fig.\u0026nbsp;\u003cspan refid=\"Fig9\" class=\"InternalRef\"\u003e9\u003c/span\u003eb shows that the coincidence with the expected scores for the cognitive pattern associated with ADHD, in results such as the difference between GAI and CPI, and the higher than usual scores for ADHD in the CPI and WMI indices are sufficient criteria for IA to classify the cognitive profile as non-ADHD.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e \u003cp\u003eAs we can see in Fig.\u0026nbsp;\u003cspan refid=\"Fig9\" class=\"InternalRef\"\u003e9\u003c/span\u003ea, the majority of values indicate a negative diagnosis. While in Fig.\u0026nbsp;\u003cspan refid=\"Fig9\" class=\"InternalRef\"\u003e9\u003c/span\u003eb the values indicate the opposite. The final probabilities of the diagnoses are 0.024 and 0.97 out of 1.\u003c/p\u003e \u003cp\u003eIn these cases presented as examples, it can be seen in Fig.\u0026nbsp;\u003cspan refid=\"Fig10\" class=\"InternalRef\"\u003e10\u003c/span\u003ea that, despite being a case diagnosed with ADHD with a final probability estimated by the model at 0.281 (false negative example), the cognitive step affects the scores in VCI and PRI more, which means that the difference between GAI and CPI is inverted, this being incongruent with what, as a general rule, has been observed by the IA in the database from which it learns.. This unusual profile would be a false negative and to perform a proper classification of profiles like these, the number of cases with these characteristics need to be expanded. On the other hand, in the second graph, Fig.\u0026nbsp;\u003cspan refid=\"Fig10\" class=\"InternalRef\"\u003e10\u003c/span\u003eb (false positive example), a singular case is presented which, among the professionals who evaluate using WISC, is not surprising, but it is a case of false positive, since its estimated probability is 0.628. Giftedness, which affects more or less regularly all cognitive skills, in the case of presenting with a diagnosis of ADHD may represent a significant cognitive step between the skills affected by the disorder and those not affected, obtaining scores within the normal range in the affected areas, if compared with subjects with average scores. While in the indexes measuring cognitive skills not affected by ADHD, their results are those of giftedness. The cognitive step exists, but it is only observed in comparison of the scores with the same subject, in the score resulting from the difference between GAI and CPI, the rest of the scores are within normality, despite being altered by ADHD.\u003c/p\u003e \u003cp\u003e \u003c/p\u003e"},{"header":"5. Conclusions and future works","content":"\u003cp\u003eThe results obtained in the presented study support the idea that IA can facilitate and improve the support to neuropsychological diagnosis made by medical doctors, psychologists and psychopedagogues, applying the knowledge we already have about cognitive patterns and the detailed observation of the data it obtains.\u003c/p\u003e \u003cp\u003eWe have selected 8 of the 20 initial variables as the most relevant, which in order of importance are GAI - CPI, WMI, CPI, PSI, VCI, WMI - PSI, PRI and LN, obtaining very promising results such as 0.9 hits, 0.94 of area under the ROC curve, 0.91 sensitivity and 0.92 specificity. In addition, XAI techniques allow detailing the most relevant factors of each case to facilitate the work of professionals new to this type of diagnosis or to help contrast them with those of experts.\u003c/p\u003e \u003cp\u003eAll these results have been obtained from a sample of around 700 subjects between the clinical and control groups, which from a clinical point of view is a large sample that is costly to obtain, although we consider this to be a limitation. Increasing the database with new records in both clinical and control groups will also help to increase the representativeness of the cases and thus allow the ML algorithms to improve their training, with this improvement impacting on reducing the number of errors in the predictions, as well as helping to reduce the biases committed by the model itself.\u003c/p\u003e \u003cp\u003eFurthermore, once the efficacy in determining cognitive patterns and correctly classifying subjects diagnosed with ADHD from those who do not meet diagnostic criteria has been confirmed, it's considered if the algorithms used can be sensitive to differences in the WMI and PSI indices. We consider it would be interesting to investigate whether this difference can help experts to discriminate between the two main presentations of the disorder (combined and inattentive), as previous studies, carried out with traditional statistical algorithms with a smaller number of cases, have shown.\u003c/p\u003e \u003cp\u003eTherefore, a future line of work would be the integration of our ML-based system into a web application for professionals with a double benefit. On the one hand, the professionals themselves would obtain a prediction and its corresponding feedback from our ADHD diagnostic support system, and on the other hand, these professionals could also incorporate anonymized data from new cases so that we could update the support system and make new predictions with greater accuracy. This application should be carried out both for WISC-IV, still widespread among clinical and educational assessment teams, and for WISC-V, the current version of the Wechsler scales. The purpose would be to provide the medical community with another way to support its diagnostic process and, just as the medical practitioner requests the completion of protocols from family members/teachers or collects direct information in consultation, he/she could also enter the direct scores from applying WISC to a subject being studied for a diagnosis of ADHD and receive feedback on the probability that this cognitive profile is associated with ADHD.\u003c/p\u003e"},{"header":"Declarations","content":"\u003cp\u003e\u003cstrong\u003eFunding:\u003c/strong\u003e The research work carried out is part of the StepTDAH-UA project (GRE21-15A) from the Vice-Rectorate of Research, Development, and Innovation of the University of Alicante..\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eInstitutional Review Board Statement:\u003c/strong\u003e The study was conducted in accordance with the Declaration of Helsinki and approved by the Ethics Committee and Vice-Rectorate for Research and Knowledge Transfer of the University of Alicante (UA-2018-03-08).\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eInformed Consent Statement:\u003c/strong\u003e Informed consent was obtained from all subjects involved in the study.\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eConflicts of Interest:\u003c/strong\u003e The authors declare no conflict of interest\u0026nbsp;\u003c/p\u003e\n\u003cp\u003e\u003cstrong\u003eAvailability of data and materials:\u0026nbsp;\u003c/strong\u003eThe data that support the findings of this study are not openly available due to reasons of sensitivity and are available from the corresponding author upon reasonable request. The data is stored in controlled access storage at the University of Alicante Repository.\u003cstrong\u003e\u0026nbsp;\u003c/strong\u003e\u003c/p\u003e"},{"header":"References","content":"\u003col\u003e\n\u003cli\u003eAlda, J. A., Ubi\u0026ntilde;ana, M., Ferreira, E., Serrano, E., Fust\u0026eacute;, A., \u0026amp; San, L. (2012). Psychic distress in parents of children newly diagnosed with ADHD. \u003cem\u003eNeuropsychiatrie de l\u0026apos;enfance et de l\u0026apos;adolescence\u003c/em\u003e, \u003cem\u003e5\u003c/em\u003e(60), S274-S275. https://doi.org/10.1016/j.neurenf.2012.04.739\u003c/li\u003e\n\u003cli\u003eAmerican Psychiatric Association. (2013). \u003cem\u003eGu\u0026iacute;a de Consulta de los Criterios Diagn\u0026oacute;sticos del DSM-5\u003c/em\u003e.APP. https://doi.org/10.1176/appi.books.9780890425657\u003c/li\u003e\n\u003cli\u003eAng, J. C., Mirzal, A., Haron, H., \u0026amp; Hamed, H. N. A. (2015). Supervised, unsupervised, and semi-supervised feature selection: a review on gene selection. \u003cem\u003eIEEE/ACM transactions on computational biology and bioinformatics\u003c/em\u003e, 13(5), 971-989. https://doi.org/10.1109/TCBB.2015.2478454\u003c/li\u003e\n\u003cli\u003eAsherson, P., \u0026amp; Agnew‐Blais, J. (2019). Annual Research Review: Does late‐onset attention‐deficit/hyperactivity disorder exist?. \u003cem\u003eJournal of Child Psychology and Psychiatry\u003c/em\u003e, \u003cem\u003e60\u003c/em\u003e(4), 333-352. https://doi.org/10.1111/jcpp.13020\u003c/li\u003e\n\u003cli\u003eAtkeson, C., Moore, A., \u0026amp; Schaal, S. (1996). Locally weighted learning. \u003cem\u003eAI Review\u003c/em\u003e. https://doi.org/10.1007/978-94-017-2053-3_2\u003c/li\u003e\n\u003cli\u003eBaloh, R. W. (2022). The Developing Brain. In \u003cem\u003eExercise and the Brain: Why Physical Exercise is Essential to Peak Cognitive Health\u003c/em\u003e (pp. 67-87). Cham: Springer International Publishing. https://doi.org/10.1007/978-3-031-13924-6_4\u003c/li\u003e\n\u003cli\u003eBarkley, R. A. (1997). Behavioral inhibition, sustained attention, and executive functions: constructing a unifying theory of ADHD. \u003cem\u003ePsychological bulletin\u003c/em\u003e, \u003cem\u003e121\u003c/em\u003e(1), 65. https://doi.org/10.1037/0033-2909.121.1.65\u003c/li\u003e\n\u003cli\u003eBarkley, R. A., Murphy, K. R., \u0026amp; Bush, T. (2001). Time perception and reproduction in young adults with attention deficit hyperactivity disorder. \u003cem\u003eNeuropsychology\u003c/em\u003e, \u003cem\u003e15\u003c/em\u003e(3), 351.https://doi.org/10.1037/0894-4105.15.3.351\u003c/li\u003e\n\u003cli\u003eBarkley, R.A. (2005). \u003cem\u003eADHD and the Nature of Self-Control.\u003c/em\u003e Guilford Press\u003c/li\u003e\n\u003cli\u003eBarredo Arrieta, A., D\u0026iacute;az-Rodr\u0026iacute;guez, N., Del Ser, J., Bennetot, A., Tabik, S., Barbado, A., Garc\u0026iacute;a, S., Gil-L\u0026oacute;pez, S., Molina, D., Benjamins, R., et al. (2020). Explainable artificial intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible ai. \u003cem\u003eInformation Fusion\u003c/em\u003e, 58:82\u0026ndash;115. https://doi.org/10.1016/j.inffus.2019.12.012\u003c/li\u003e\n\u003cli\u003eB\u0026eacute;langer, S. A., Andrews, D., Gray, C., \u0026amp; Korczak, D. (2018). ADHD in children and youth: Part 1\u0026mdash;Etiology, diagnosis, and comorbidity. \u003cem\u003ePediatrics \u0026amp; child health\u003c/em\u003e, \u003cem\u003e23\u003c/em\u003e(7), 447-453. https://doi.org/10.1093/pch/pxy109\u003c/li\u003e\n\u003cli\u003eBenson, N. F., Floyd, R. G., Kranzler, J. H., Eckert, T. L., Fefer, S. A., \u0026amp; Morgan, G. B. (2019). Test use and assessment practices of school psychologists in the United States: Findings from the 2017 National Survey. \u003cem\u003eJournal of School Psychology\u003c/em\u003e, \u003cem\u003e72\u003c/em\u003e, 29-48. https://doi.org/10.1016/j.jsp.2018.12.004\u003c/li\u003e\n\u003cli\u003eBreiman, L. (2001). Random forests. \u003cem\u003eMachine learning\u003c/em\u003e, 45(1):5\u0026ndash;32. https://doi.org/10.1023/A:1010933404324\u003c/li\u003e\n\u003cli\u003eBreiman, L. (2017). \u003cem\u003eClassification and regression trees\u003c/em\u003e. Routledge. https://doi.org/10.1201/9781315139470\u003c/li\u003e\n\u003cli\u003eBustillo, M., \u0026amp; Servera, M. (2015). Analysis of the performance pattern of a sample of children with ADHD in the WISC-IV. \u003cem\u003eRevista de Psicolog\u0026iacute;a Cl\u0026iacute;nica con Ni\u0026ntilde;os y Adolescentes\u003c/em\u003e, \u003cem\u003e2\u003c/em\u003e(2), 121-128.\u003c/li\u003e\n\u003cli\u003eCachero, C., Rico-Juan, J. R., \u0026amp; Maci\u0026agrave;, H. (2023). Influence of personality and modality on peer assessment evaluation perceptions using Machine Learning techniques. \u003cem\u003eExpert Systems with Applications\u003c/em\u003e, 119150. https://doi.org/10.1016/j.eswa.2022.119150\u003c/li\u003e\n\u003cli\u003eCalub, C. A., Rapport, M. D., Friedman, L. M., \u0026amp; Eckrich, S. J. (2019). IQ and academic achievement in children with ADHD: The differential effects of specific cognitive functions. \u003cem\u003eJournal of Psychopathology and Behavioral Assessment\u003c/em\u003e, \u003cem\u003e41\u003c/em\u003e, 639-651. https://doi.org/10.1007/s10862-019-09728-z\u003c/li\u003e\n\u003cli\u003eChan, T. F., Golub, G. H., \u0026amp; LeVeque, R. J. (1983). Algorithms for computing the sample variance: Analysis and recommendations. \u003cem\u003eThe American Statistician\u003c/em\u003e, 37(3):242\u0026ndash;247. https://doi.org/10.1080/00031305.1983.10483115\u003c/li\u003e\n\u003cli\u003eChandrashekar, G., \u0026amp; Sahin, F. (2014). A survey on feature selection methods. \u003cem\u003eComputers \u0026amp; Electrical Engineering\u003c/em\u003e, 40(1), 16-28. https://doi.org/10.1016/j.compeleceng.2013.11.024\u003c/li\u003e\n\u003cli\u003eChen, T. \u0026amp; Guestrin, C. (2016). XGBoost: A Scalable Tree Boosting System. \u003cem\u003eCoRR\u003c/em\u003e, abs/1603.02754. https://doi.org/10.1145/2939672.2939785\u003c/li\u003e\n\u003cli\u003eChen, T.-H. \u0026amp; Chang, R.-C. (2020). Using machine learning to evaluate the influence of fintech patents: The case of Taiwan\u0026apos;s financial industry. \u003cem\u003eJournal of Computational and Applied Mathematics\u003c/em\u003e, page 113215. https://doi.org/10.1016/j.cam.2020.113215\u003c/li\u003e\n\u003cli\u003eCheung, C. H., Fazier-Wood, A. C., Asherson, P., Rijsdijk, F., \u0026amp; Kuntsi, J. (2014). Shared cognitive impairments and aetiology in ADHD symptoms and reading difficulties. \u003cem\u003ePloS one\u003c/em\u003e, \u003cem\u003e9\u003c/em\u003e(6), e98590. https://doi.org/10.1371/journal.pone.0098590\u003c/li\u003e\n\u003cli\u003eConners, C. K. (2008). Conners 3. North Tonawanda, NJ: MHS.\u003c/li\u003e\n\u003cli\u003eCortes, C., \u0026amp; Vapnik, V. (1995). Support-vector networks. \u003cem\u003eMachine learning\u003c/em\u003e, \u003cem\u003e20\u003c/em\u003e, 273-297. https://doi.org/10.1007/BF00994018\u003c/li\u003e\n\u003cli\u003eCover, T. M. \u0026amp; Hart, P. E. (1967). Nearest neighbor pattern classification. \u003cem\u003eInformation Theory, IEEE Transactions on\u003c/em\u003e, 13(1):21\u0026ndash;27. https://doi.org/10.1109/TIT.1967.1053964\u003c/li\u003e\n\u003cli\u003eDalzochio, J., Kunst, R., Pignaton, E., Binotto, A., Sanyal, S., Favilla, J., and Barbosa, J. (2020). Machine learning and reasoning for predictive maintenance in industry 4.0: Current status and challenges. \u003cem\u003eComputers in Industry\u003c/em\u003e, 123:103298. https://doi.org/10.1016/j.compind.2020.103298\u003c/li\u003e\n\u003cli\u003eDhini, A., Surjandari, I., Kusumoputro, B., \u0026amp; Kusiak, A. (2021). Extreme learning machine\u0026ndash;radial basis function (elm-rbf) networks for diagnosing faults in a steam turbine. Journal of Industrial and Production Engineering, (pp. 1\u0026ndash;9). https://doi.org/10.1080/21681015.2021.1887948\u003c/li\u003e\n\u003cli\u003eDorogush, A. V., Ershov, V., \u0026amp; Gulin, A. (2018). Catboost: gradient boosting with categorical features support. \u003cem\u003earXiv preprint arXiv:1810.11363\u003c/em\u003e.\u003c/li\u003e\n\u003cli\u003eDuPaul, G. J., Power, T. J., Anastopoulos, A. D., \u0026amp; Reid, R. (1998). \u003cem\u003eADHD Rating Scale\u0026mdash;IV: Checklists, norms, and clinical interpretation\u003c/em\u003e. Guilford press. https://doi.org/10.1037/t00680-000\u003c/li\u003e\n\u003cli\u003eDrucker, H., Burges, C. J., Kaufman, L., Smola, A. J., and Vapnik, V. (1997). Support vector regression machines. In \u003cem\u003eAdvances in neural information processing systems\u003c/em\u003e, pages 155\u0026ndash;161.\u003c/li\u003e\n\u003cli\u003eDuda, R. O., Hart, P. E., and Stork, D. G. (2001). \u003cem\u003ePattern Classification\u003c/em\u003e. Wiley, 2 edition.\u003c/li\u003e\n\u003cli\u003eDumont, M., Mar\u0026acute;ee, R., Wehenkel, L., and Geurts, P. (2009). Fast multi-class image annotation with random subwindows and multiple output random- ized trees. In \u003cem\u003eProc. International Conference on Computer Vision Theory and Applications (VISAPP)\u003c/em\u003e, volume 2, pages 196\u0026ndash;203.\u003c/li\u003e\n\u003cli\u003eFenollar Cort\u0026eacute;s, J., Gonz\u0026aacute;lez G\u0026oacute;mez, C., Garc\u0026iacute;a Sevilla, J., \u0026amp; Navarro Soria, IJ (2015). Detecci\u0026oacute;n de perfiles cognitivos mediante WISC-IV en ni\u0026ntilde;os diagnosticados de ADHD:\u0026iquest;Existen diferencias entre subtipos?. \u003cem\u003eRevista de Psicodid\u0026aacute;ctica\u003c/em\u003e. https://doi.org/10.1387/RevPsicodidact.12531\u003c/li\u003e\n\u003cli\u003eFenollar-Cort\u0026eacute;s, J., L\u0026oacute;pez-Pinar, C., \u0026amp; Watkins, M. W. (2019). Structural validity of the Spanish Wechsler Intelligence Scale for Children\u0026ndash;Fourth Edition in a large sample of Spanish children with attention-deficit hyperactivity disorder. \u003cem\u003eInternational Journal of School \u0026amp; Educational Psychology\u003c/em\u003e, \u003cem\u003e7\u003c/em\u003e(sup1), 2-14. https://doi.org/10.1080/21683603.2018.1474820\u003c/li\u003e\n\u003cli\u003eFern\u0026aacute;ndez-Ja\u0026eacute;n, A., Fern\u0026aacute;ndez-Mayoralas, D. M., L\u0026oacute;pez-Arribas, S., Pardos-V\u0026eacute;glia, A., Mu\u0026ntilde;iz-Borrega, B., Garc\u0026iacute;a-Savat\u0026eacute;, C., ... \u0026amp; Fern\u0026aacute;ndez-Perrone, A. L. (2012). Habilidades sociales y de liderazgo en el trastorno por d\u0026eacute;ficit de atenci\u0026oacute;n/hiperactividad: relaci\u0026oacute;n con las capacidades cognitivo-atencionales. \u003cem\u003eActas Espa\u0026ntilde;olas de Psiquiatr\u0026iacute;a\u003c/em\u003e, \u003cem\u003e40\u003c/em\u003e(3).\u003c/li\u003e\n\u003cli\u003eFosco, W. D., Kofler, M. J., Groves, N. B., Chan, E. S., \u0026amp; Raiker, J. S. (2020). Which \u0026lsquo;working\u0026rsquo; components of working memory aren\u0026rsquo;t working in youth with ADHD?. \u003cem\u003eJournal of Abnormal Child Psychology\u003c/em\u003e, \u003cem\u003e48\u003c/em\u003e, 647-660. https://doi.org/10.1007/s10802-020-00621-y\u003c/li\u003e\n\u003cli\u003eFrank, E., Hall, M., and Pfahringer, B. (2003). Locally weighted naive bayes. In \u003cem\u003e19th Conference in Uncertainty in Artificial Intelligence\u003c/em\u003e, pages 249\u0026ndash;256. Morgan Kaufmann.\u003c/li\u003e\n\u003cli\u003eFrank, E., Mayo, M., and Kramer, S. (2015). Alternating model trees. In \u003cem\u003eProceedings of the 30th Annual ACM Symposium on Applied Computing\u003c/em\u003e, pages 871\u0026ndash;878. ACM. https://doi.org/10.1145/2695664.2695848\u003c/li\u003e\n\u003cli\u003eFreund, Y. and Schapire, R. E. (1997). A decision-theoretic generalization of on-line learning and an application to boosting. \u003cem\u003eJournal of computer and system sciences\u003c/em\u003e, 55(1):119\u0026ndash;139. https://doi.org/10.1006/jcss.1997.1504\u003c/li\u003e\n\u003cli\u003eFriedman, J. (1999). Stochastic gradient boosting. Technical report, Stanford University.\u003c/li\u003e\n\u003cli\u003eGnana, D. A. A., Balamurugan, S. A. A., \u0026amp; Leavline, E. J. (2016). Literature review on feature selection methods for high-dimensional data. International Journal of Computer Applications, 136(1), 9-17. https://doi.org/10.5120/ijca2016908317\u003c/li\u003e\n\u003cli\u003eGomez, R., Vance, A., \u0026amp; Watson, S. D. (2016). Structure of the Wechsler Intelligence Scale for Children\u0026ndash;Fourth Edition in a group of children with ADHD. \u003cem\u003eFrontiers in Psychology\u003c/em\u003e, \u003cem\u003e7\u003c/em\u003e, 737. https://doi.org/10.3389/fpsyg.2016.00737\u003c/li\u003e\n\u003cli\u003eGoodman, R. (1997). The Strengths and Difficulties Questionnaire: a research note. \u003cem\u003eJournal of child psychology and psychiatry\u003c/em\u003e, \u003cem\u003e38\u003c/em\u003e(5), 581-586. https://doi.org/10.1111/j.1469-7610.1997.tb01545.x\u003c/li\u003e\n\u003cli\u003eGuyon, I. and Elisseeff, A. An introduction to variable and feature selection. Hinton, G. E. (1990). Connectionist learning procedures. In \u003cem\u003eMachine Learning, Volume III\u003c/em\u003e, pages 555\u0026ndash;610. Elsevier.\u003c/li\u003e\n\u003cli\u003eHamed, A. M., Kauer, A. J., \u0026amp; Stevens, H. E. (2015). Why the diagnosis of attention deficit hyperactivity disorder matters. \u003cem\u003eFrontiers in psychiatry\u003c/em\u003e, \u003cem\u003e6\u003c/em\u003e, 168. https://doi.org/10.3389/fpsyt.2015.00168\u003c/li\u003e\n\u003cli\u003eHare, M. M., Garcia, A. M., Hart, K. C., \u0026amp; Graziano, P. A. (2021). Intervention response among preschoolers with ADHD: The role of emotion understanding. \u003cem\u003eJournal of school psychology\u003c/em\u003e, \u003cem\u003e84\u003c/em\u003e, 19-31. https://doi.org/10.1016/j.jsp.2020.11.001\u003c/li\u003e\n\u003cli\u003eHo, T. K. (1998). The random subspace method for constructing decision forests. \u003cem\u003eIEEE Transactions on Pattern Analysis and Machine Intelligence\u003c/em\u003e, 20(8):832\u0026ndash;844. https://doi.org/10.1109/34.709601\u003c/li\u003e\n\u003cli\u003eHoerl, A. E. and Kennard, R. W. (1970). Ridge regression: Biased estimation for nonorthogonal problems. \u003cem\u003eTechnometrics\u003c/em\u003e, 12(1):55\u0026ndash;67. https://doi.org/10.1080/00401706.1970.10488634\u003c/li\u003e\n\u003cli\u003eIsquith, P. K., Roth, R. M., \u0026amp; Gioia, G. (2013). Contribution of rating scales to the assessment of executive functions. \u003cem\u003eApplied Neuropsychology: Child\u003c/em\u003e, \u003cem\u003e2\u003c/em\u003e(2), 125-132. https://doi.org/10.1080/21622965.2013.748389\u003c/li\u003e\n\u003cli\u003eJamieson, K. and Talwalkar, A. (2016). Non-stochastic best arm identification and hyperparameter optimization. In \u003cem\u003eArtificial intelligence and statistics\u003c/em\u003e, pages 240\u0026ndash;248. PMLR. \u003c/li\u003e\n\u003cli\u003eJavaid, A., Zghyer, F., Kim, C., Spaulding, E. M., Isakadze, N., Ding, J., Kargillis, D., Gao, Y., Rahman, F., Brown, D. E., Saria, S., Martin, S. S., Kramer, C. M., Blumenthal, R. S., and Marvel, F. A. (2022). Medicine 2032: The future of cardiovascular disease prevention with machine learning and digital health technology. \u003cem\u003eAmerican Journal of Preventive Cardiology\u003c/em\u003e, 12:100379. https://doi.org/10.1016/j.ajpc.2022.100379\u003c/li\u003e\n\u003cli\u003eJiang, W., Li, Y., Du, Y., \u0026amp; Fan, J. (2015). Cognitive deficits feature of male with attention deficit hyperactivity disorder-based on the study of WISC-IV. \u003cem\u003eJ. Psychiatry\u003c/em\u003e, \u003cem\u003e18\u003c/em\u003e, 1-6. https://doi.org/10.4172/Psychiatry.1000252\u003c/li\u003e\n\u003cli\u003eKajka, N., \u0026amp; Kulik, A. (2021). The influence of metacognitive strategies on the improvement of reaction inhibition processes in children with ADHD. \u003cem\u003eInternational Journal of Environmental Research and Public Health\u003c/em\u003e, \u003cem\u003e18\u003c/em\u003e(3), 878. https://doi.org/10.1016/j.jksuci.2019.06.012\u003c/li\u003e\n\u003cli\u003eKe, G., Meng, Q., Finley, T., Wang, T., Chen, W., Ma, W., Ye, Q., and Liu, T.-Y. (2017). lightgbm: A highly efficient gradient boosting decision tree. In \u003cem\u003eAdvances in Neural Information Processing Systems\u003c/em\u003e, pages 3149\u0026ndash;3157.\u003c/li\u003e\n\u003cli\u003eKhaire, U. M., \u0026amp; Dhanalakshmi, R. (2022). Stability of feature selection algorithm: A review. Journal of King Saud University-Computer and Information Sciences, 34(4), 1060-1073. https://doi.org/10.1016/j.jksuci.2019.06.012\u003c/li\u003e\n\u003cli\u003eKibby, M. Y., Vadnais, S. A., \u0026amp; Jagger-Rickels, A. C. (2019). Which components of processing speed are affected in ADHD subtypes?. \u003cem\u003eChild Neuropsychology\u003c/em\u003e, \u003cem\u003e25\u003c/em\u003e(7), 964-979. https://doi.org/10.1080/09297049.2018.1556625\u003c/li\u003e\n\u003cli\u003eKim, J. H., Kim, J. Y., Lee, J., Jeong, G. H., Lee, E., Lee, S., ... \u0026amp; Fusar-Poli, P. (2020). Environmental risk factors, protective factors, and peripheral biomarkers for ADHD: an umbrella review. \u003cem\u003eThe Lancet Psychiatry\u003c/em\u003e, \u003cem\u003e7\u003c/em\u003e(11), 955-970. https://doi.org/10.1016/S2215-0366(20)30312-6\u003c/li\u003e\n\u003cli\u003eKim, H., \u0026amp; Song, D. H. (2020). Comparison of the K-WISC-IV profiles of boys with autism spectrum disorder and attention-deficit/hyperactivity disorder. \u003cem\u003eResearch in Developmental Disabilities\u003c/em\u003e, \u003cem\u003e97\u003c/em\u003e, 103539.https://doi.org/10.1016/j.ridd.2019.103539\u003c/li\u003e\n\u003cli\u003eKrane, E., \u0026amp; Tannock, R. (2001). WISC-III third factor indexes learning problems but not attention deficit/hyperactivity disorder. \u003cem\u003eJournal of Attention Disorders\u003c/em\u003e, \u003cem\u003e5\u003c/em\u003e(2), 69-78. https://doi.org/10.1177/108705470100500201\u003c/li\u003e\n\u003cli\u003eKrieger, V., \u0026amp; Amador-Campos, J. A. (2018). Assessment of executive function in ADHD adolescents: contribution of performance tests and rating scales. \u003cem\u003eChild Neuropsychology\u003c/em\u003e, \u003cem\u003e24\u003c/em\u003e(8), 1063-1087. https://doi.org/10.1080/09297049.2017.1386781\u003c/li\u003e\n\u003cli\u003eKoh, P. W. and Liang, P. (2017). Understanding black-box predictions via influence functions. In \u003cem\u003eProceedings of the 34th International Conference on Machine Learning - Volume 70\u003c/em\u003e, ICML\u0026rsquo;17, pages 1885\u0026ndash;1894. JMLR.org.\u003c/li\u003e\n\u003cli\u003eKubo, Y., Kanazawa, T., Kawabata, Y., Tanaka, K., Okayama, T., Fujta, M., ... \u0026amp; Yoneda, H. (2018). Comparative analysis of the WISC between two ADHD subgroups. \u003cem\u003ePsychiatry investigation\u003c/em\u003e, \u003cem\u003e15\u003c/em\u003e(2), 172. https://doi.org/10.30773/pi.2017.07.12\u003c/li\u003e\n\u003cli\u003eKursa, M. B. and Rudnicki, W. R. (2010). Feature selection with the Boruta package. \u003cem\u003eJournal of Statistical Software\u003c/em\u003e, 36(11):1\u0026ndash;13. https://doi.org/10.18637/jss.v036.i11\u003c/li\u003e\n\u003cli\u003eLavigne- Cerv\u0026aacute;n, R., \u0026amp; Romero- P\u0026eacute;rez, JF (2010). \u003cem\u003eEl ADHD:\u0026iquest;qu\u0026eacute; es?,\u0026iquest;qu\u0026eacute; lo causa?,\u0026iquest;c\u0026oacute;mo evaluarlo y tratarlo?. \u003c/em\u003e(1\u0026ordf; ed.) Pir\u0026aacute;mide.\u003c/li\u003e\n\u003cli\u003eLi, L., Jamieson, K., DeSalvo, G., Rostamizadeh, A., and Talwalkar, A. (2017). Hyperband: A novel bandit-based approach to hyperparameter optimiza- tion. \u003cem\u003eThe Journal of Machine Learning Research\u003c/em\u003e, 18(1):6765\u0026ndash;6816.\u003c/li\u003e\n\u003cli\u003eLiang, D., Frederick, D. A., Lledo, E. E., Rosenfield, N., Berardi, V., Linstead, E., and Maoz, U. (2022). Examining the utility of nonlinear machine learn- ing approaches versus linear regression for predicting body image outcomes: The u.s. body project i. \u003cem\u003eBody Image\u003c/em\u003e, 41:32\u0026ndash;45. https://doi.org/10.1016/j.bodyim.2022.01.013\u003c/li\u003e\n\u003cli\u003eLiu, B. (2020). New technology application in logistics industry based on machine learning and embedded network. \u003cem\u003eMicroprocessors and Microsystems\u003c/em\u003e, page 103596. https://doi.org/10.1016/j.micpro.2020.103484\u003c/li\u003e\n\u003cli\u003eLiu, D., Fox, K., Weber, G., and Miller, T. (2022). Confederated learning in healthcare: Training machine learning models using disconnected data separated by individual, data type and identity for large-scale health system intelligence. \u003cem\u003eJournal of Biomedical Informatics\u003c/em\u003e, 134:104151. https://doi.org/10.1016/j.jbi.2022.104151\u003c/li\u003e\n\u003cli\u003eLundberg, S. (2019). Shap (Shapley additive explanations). https://github.com/slundberg/shap.\u003c/li\u003e\n\u003cli\u003eLundberg, S. M. and Lee, S.-I. (2017a). Consistent feature attribution for tree ensembles. \u003cem\u003earXiv preprint arXiv:1706.06060\u003c/em\u003e.\u003c/li\u003e\n\u003cli\u003eLundberg, S. M. and Lee, S.-I. (2017b). A unified approach to interpreting model predictions. In \u003cem\u003eAdvances in Neural Information Processing Systems\u003c/em\u003e, pages 4765\u0026ndash;4774.\u003c/li\u003e\n\u003cli\u003eMaci\u0026agrave; Ant\u0026oacute;n, D. (2012). \u003cem\u003eADHD en la infancia y la adolescencia: concepto, evaluaci\u0026oacute;n y tratamiento. \u003c/em\u003e(1\u0026ordf; ed.). Pir\u0026aacute;mide.\u003c/li\u003e\n\u003cli\u003eMacKay, D. J. (1992). Bayesian interpolation. \u003cem\u003eNeural computation\u003c/em\u003e, 4(3):415\u0026ndash; 447. https://doi.org/10.1162/neco.1992.4.3.415\u003c/li\u003e\n\u003cli\u003eMarshall, P., Hoelzle, J., \u0026amp; Nikolas, M. (2021). Diagnosing Attention-Deficit/Hyperactivity Disorder (ADHD) in young adults: A qualitative review of the utility of assessment measures and recommendations for improving the diagnostic process. \u003cem\u003eThe Clinical Neuropsychologist\u003c/em\u003e, \u003cem\u003e35\u003c/em\u003e(1), 165-198. https://doi.org/10.1080/13854046.2019.1696409\u003c/li\u003e\n\u003cli\u003eMatsuura, N., Ishitobi, M., Arai, S., Kawamura, K., Asano, M., Inohara, K., ... \u0026amp; Kosaka, H. (2014). Distinguishing between autism spectrum disorder and attention deficit hyperactivity disorder by using behavioral checklists, cognitive assessments, and neuropsychological test battery. \u003cem\u003eAsian journal of psychiatry\u003c/em\u003e, \u003cem\u003e12\u003c/em\u003e, 50-57. https://doi.org/10.1016/j.ajp.2014.06.011\u003c/li\u003e\n\u003cli\u003eMayes, S. D., \u0026amp; Calhoun, S. L. (2004). Similarities and differences in Wechsler Intelligence Scale for Children\u0026mdash;Third Edition (WISC-III) profiles: Support for subtest analysis in clinical referrals. \u003cem\u003eThe Clinical Neuropsychologist\u003c/em\u003e, \u003cem\u003e18\u003c/em\u003e(4), 559-572. https://doi.org/10.1080/13854040490888530\u003c/li\u003e\n\u003cli\u003eMayes, S. D., \u0026amp; Calhoun, S. L. (2006). WISC-IV and WISC-III profiles in children with ADHD. \u003cem\u003eJournal of Attention Disorders\u003c/em\u003e, \u003cem\u003e9\u003c/em\u003e(3), 486-493. https://doi.org/10.1177/1087054705283616\u003c/li\u003e\n\u003cli\u003eMehta, T. R., Monegro, A., Nene, Y., Fayyaz, M., \u0026amp; Bollu, P. C. (2019). Neurobiology of ADHD: a review. \u003cem\u003eCurrent Developmental Disorders Reports\u003c/em\u003e, \u003cem\u003e6\u003c/em\u003e, 235-240. https://doi.org/10.1007/s40474-019-00182-w\u003c/li\u003e\n\u003cli\u003eMiller, M., Iosif, A. M., Young, G. S., Hill, M. M., \u0026amp; Ozonoff, S. (2018). Early detection of ADHD: Insights from infant siblings of children with autism. \u003cem\u003eJournal of Clinical Child \u0026amp; Adolescent Psychology\u003c/em\u003e, \u003cem\u003e47\u003c/em\u003e(5), 737-744. https://doi.org/10.1080/15374416.2016.1220314\u003c/li\u003e\n\u003cli\u003eMoura, O., Costa, P., \u0026amp; Sim\u0026otilde;es, M. R. (2019). WISC-III cognitive profiles in children with ADHD: specific cognitive impairments and diagnostic utility. \u003cem\u003eThe Journal of general psychology\u003c/em\u003e, \u003cem\u003e146\u003c/em\u003e(3), 258-282. https://doi.org/10.1080/00221309.2018.1561410\u003c/li\u003e\n\u003cli\u003eMu\u0026ntilde;iz, J., Hern\u0026aacute;ndez, A., \u0026amp; Fern\u0026aacute;ndez-Hermida, J. R. (2020). Utilizaci\u0026oacute;n de los test en Espa\u0026ntilde;a: El punto de vista de los psic\u0026oacute;logos. \u003cem\u003ePapeles del Psic\u0026oacute;logo\u003c/em\u003e, \u003cem\u003e41\u003c/em\u003e(1), 1-15. https://doi.org/10.23923/pap.psicol2020.2921\u003c/li\u003e\n\u003cli\u003eNavarro-Soria, I., Fenollar, J., Carbonell, J., \u0026amp; Real, M. (2020). Memoria de trabajo y velocidad de procesamiento evaluado mediante WISC-IV como claves en la evaluaci\u0026oacute;n del ADHD. https://doi.org/10.21134/rpcna.2020.07.1.3\u003c/li\u003e\n\u003cli\u003eNarciso, D. A. and Martins, F. (2020). Application of machine learning tools for energy efficiency in industry: A review. \u003cem\u003eEnergy Reports\u003c/em\u003e, 6:1181\u0026ndash;1199. https://doi.org/10.1016/j.egyr.2020.04.035\u003c/li\u003e\n\u003cli\u003eOakland, T., \u0026amp; Hu, S. (1992). The top 10 tests used with children and youth worldwide. \u003cem\u003eBulletin of the International Test Commission\u003c/em\u003e, \u003cem\u003e19\u003c/em\u003e(1), 99-120.\u003c/li\u003e\n\u003cli\u003eOakland, T., Iliescu, D., Chen, H. Y., \u0026amp; Chen, J. H. (2013). Cross-national assessment of adaptive behavior in three countries. \u003cem\u003eJournal of Psychoeducational Assessment\u003c/em\u003e, \u003cem\u003e31\u003c/em\u003e(5), 435-447. https://doi.org/10.1177/0734282912469492\u003c/li\u003e\n\u003cli\u003eOakland, T., Douglas, S., \u0026amp; Kane, H. (2016). Top ten standardized tests used internationally with children and youth by school psychologists in 64 countries: A 24-year follow-up study. \u003cem\u003eJournal of Psychoeducational Assessment\u003c/em\u003e, \u003cem\u003e34\u003c/em\u003e(2), 166-176. https://doi.org/10.1177/0734282915595303\u003c/li\u003e\n\u003cli\u003eParke, E. M., Thaler, N. S., Etcoff, L. M., \u0026amp; Allen, D. N. (2020). Intellectual profiles in children with ADHD and comorbid learning and motor disorders. \u003cem\u003eJournal of attention disorders\u003c/em\u003e, \u003cem\u003e24\u003c/em\u003e(9), 1227-1236. https://doi.org/10.1177/1087054715576343\u003c/li\u003e\n\u003cli\u003ePedregosa, F., Varoquaux, G., Gramfort, A., Michel, V., Thirion, B., Grisel, O., Blondel, M., Prettenhofer, P., Weiss, R., Dubourg, V., Vanderplas, J., Passos, A., Cournapeau, D., Brucher, M., Perrot, M., and Duchesnay, E. (2011). Scikit-learn: Machine learning in Python. \u003cem\u003eJournal of Machine Learning Research\u003c/em\u003e, 12:2825\u0026ndash;2830.\u003c/li\u003e\n\u003cli\u003ePolanczyk, G. V., Willcutt, E. G., Salum, G. A., Kieling, C., \u0026amp; Rohde, L. A. (2014). ADHD prevalence estimates across three decades: an updated systematic review and meta-regression analysis. \u003cem\u003eInternational journal of epidemiology\u003c/em\u003e, \u003cem\u003e43\u003c/em\u003e(2), 434-442. https://doi.org/10.1093/ije/dyt261\u003c/li\u003e\n\u003cli\u003eQuinlan, R. J. (1992). Learning with continuous classes. In \u003cem\u003e5th Australian Joint Conference on Artificial Intelligence\u003c/em\u003e, pages 343\u0026ndash;348, Singapore. World Scientific.\u003c/li\u003e\n\u003cli\u003eRappaport, G. C., Ornoy, A., \u0026amp; Tenenbaum, A. (1998). Is early intervention effective in preventing ADHD?. \u003cem\u003eIsrael Journal of Psychiatry\u003c/em\u003e, \u003cem\u003e35\u003c/em\u003e(4), 271.\u003c/li\u003e\n\u003cli\u003eReid, R., \u0026amp; Maag, J. W. (1994). How many fidgets in a pretty much: A critique of behavior rating scales for identifying students with ADHD. \u003cem\u003eJournal of School Psychology\u003c/em\u003e, \u003cem\u003e32\u003c/em\u003e(4), 339-354. https://doi.org/10.1016/0022-4405(94)90032-9\u003c/li\u003e\n\u003cli\u003eRico-Juan, J. R., Gallego, A.-J., \u0026amp; Calvo-Zaragoza, J. (2019). Automatic detection of inconsistencies between numerical scores and textual feedback in peer-assessment processes with machine learning. Computers \u0026amp; Education,140 , 103609. https://doi.org/10.1016/j.compedu.2019.103609\u003c/li\u003e\n\u003cli\u003eRoth, A. E. (1988). \u003cem\u003eThe Shapley value: essays in honor of Lloyd S. Shapley\u003c/em\u003e. Cambridge University Press. https://doi.org/10.1017/CBO9780511528446\u003c/li\u003e\n\u003cli\u003eServera, M., \u0026amp; Cardo, E. (2007). ADHD Rating Scale-IV en una muestra escolar espa\u0026ntilde;ola: datos normativos y consistencia interna para maestros, padres y madres. \u003cem\u003eRev Neurol\u003c/em\u003e, \u003cem\u003e45\u003c/em\u003e(7), 393-9. https://doi.org/10.33588/rn.4507.2007301\u003c/li\u003e\n\u003cli\u003eSharma, A. (2018). Guided stochastic gradient descent algorithm for inconsistent datasets. \u003cem\u003eApplied Soft Computing\u003c/em\u003e, 73:1068\u0026ndash;1080. https://doi.org/10.1016/j.asoc.2018.09.038\u003c/li\u003e\n\u003cli\u003eSimon, V., Czobor, P., B\u0026aacute;lint, S., M\u0026eacute;sz\u0026aacute;ros, A., \u0026amp; Bitter, I. (2009). Prevalence and correlates of adult attention-deficit hyperactivity disorder: meta-analysis. \u003cem\u003eThe British Journal of Psychiatry\u003c/em\u003e, \u003cem\u003e194\u003c/em\u003e(3), 204-211. https://doi.org/10.1192/bjp.bp.107.048827\u003c/li\u003e\n\u003cli\u003eSonuga-Barke, E. J., Koerting, J., Smith, E., McCann, D. C., \u0026amp; Thompson, M. (2011). Early detection and intervention for attention-deficit/hyperactivity disorder. \u003cem\u003eExpert Review of Neurotherapeutics\u003c/em\u003e, \u003cem\u003e11\u003c/em\u003e(4), 557-563. https://doi.org/10.1586/ern.11.39\u003c/li\u003e\n\u003cli\u003eSnow, J. B., \u0026amp; Sapp, G. L. (2000). WISC\u0026mdash;III subtest patterns of ADHD and normal samples. \u003cem\u003ePsychological Reports\u003c/em\u003e, \u003cem\u003e87\u003c/em\u003e(3), 759-765.https://doi.org/10.2466/pr0.2000.87.3.759\u003c/li\u003e\n\u003cli\u003eStyck, K. M., \u0026amp; Watkins, M. W. (2017). Structural validity of the WISC-IV for students with ADHD. \u003cem\u003eJournal of Attention Disorders\u003c/em\u003e, \u003cem\u003e21\u003c/em\u003e(11), 921-928. https://doi.org/10.1177/1087054714553052\u003c/li\u003e\n\u003cli\u003eTaylor, E., Schachar, R., Thorley, G., \u0026amp; Wieselberg, M. (1986). Parental account of children\u0026apos;s symptoms. \u003cem\u003eBritish Journal of Psychiatry\u003c/em\u003e, \u003cem\u003e149\u003c/em\u003e, 760-767. https://doi.org/10.1192/bjp.149.6.760\u003c/li\u003e\n\u003cli\u003eToffalini, E., Buono, S., \u0026amp; Cornoldi, C. (2022). The structure, profile, and diagnostic significance of intelligence in children with ADHD are impressively similar to those of children with a specific learning disorder. \u003cem\u003eResearch in Developmental Disabilities\u003c/em\u003e, \u003cem\u003e129\u003c/em\u003e, 104306. https://doi.org/10.1016/j.ridd.2022.104306\u003c/li\u003e\n\u003cli\u003eThaler, N. S., Barchard, K. A., Parke, E., Jones, W. P., Etcoff, L. M., \u0026amp; Allen, D. N. (2015). Factor structure of the Wechsler Intelligence Scale for Children: in children with ADHD. \u003cem\u003eJournal of attention disorders\u003c/em\u003e, \u003cem\u003e19\u003c/em\u003e(12), 1013-1021. https://doi.org/10.1177/1087054712459952\u003c/li\u003e\n\u003cli\u003eThapar, A. (2018). Discoveries on the genetics of ADHD in the 21st century: new findings and their implications. \u003cem\u003eAmerican Journal of Psychiatry\u003c/em\u003e, \u003cem\u003e175\u003c/em\u003e(10), 943-950. https://doi.org/10.1176/appi.ajp.2018.18040383\u003c/li\u003e\n\u003cli\u003eThomas, R., Sanders, S., Doust, J., Beller, E., \u0026amp; Glasziou, P. (2015). Prevalence of attention-deficit/hyperactivity disorder: a systematic review and meta-analysis. \u003cem\u003ePediatrics\u003c/em\u003e, \u003cem\u003e135\u003c/em\u003e(4), e994-e1001. https://doi.org/10.1542/peds.2014-3482\u003c/li\u003e\n\u003cli\u003eTibshirani, R. (1996). Regression shrinkage and selection via the lasso. \u003cem\u003eJournal of the Royal Statistical Society: Series B (Methodological)\u003c/em\u003e, 58(1):267\u0026ndash;288. https://doi.org/10.1111/j.2517-6161.1996.tb02080.x\u003c/li\u003e\n\u003cli\u003eUlla, Ek. Fernell, E., Westerlund, J., Holmberg, K., Olsson, P. O., \u0026amp; Gillberg, C. (2007). Cognitive strengths and deficits in schoolchildren with ADHD. \u003cem\u003eActa Paediatrica\u003c/em\u003e, \u003cem\u003e96\u003c/em\u003e(5), 756-761. https://doi.org/10.1111/j.1651-2227.2007.00297.x\u003c/li\u003e\n\u003cli\u003eUlleb\u0026oslash;, A. K., Posserud, M. B., Heiervang, E., Gillberg, C., \u0026amp; Obel, C. (2011). Screening for the attention deficit hyperactivity disorder phenotype using the strength and difficulties questionnaire. \u003cem\u003eEuropean child \u0026amp; adolescent psychiatry\u003c/em\u003e, \u003cem\u003e20\u003c/em\u003e, 451-458. https://doi.org/10.1007/s00787-011-0198-9\u003c/li\u003e\n\u003cli\u003eWalg, M., Hapfelmeier, G., El-Wahsch, D., \u0026amp; Prior, H. (2017). The faster internal clock in ADHD is related to lower processing speed: WISC-IV profile analyses and time estimation tasks facilitate the distinction between real ADHD and pseudo-ADHD. \u003cem\u003eEuropean child \u0026amp; adolescent psychiatry\u003c/em\u003e, \u003cem\u003e26\u003c/em\u003e, 1177-1186. https://doi.org/10.1007/s00787-017-0971-5\u003c/li\u003e\n\u003cli\u003eWanderer, S., Roessner, V., Strobel, A., \u0026amp; Martini, J. (2021). WISC-IV performance of children with chronic tic disorder, obsessive\u0026ndash;compulsive disorder and attention-deficit/hyperactivity disorder: results from a German clinical study. \u003cem\u003eChild and Adolescent Psychiatry and Mental Health\u003c/em\u003e, \u003cem\u003e15\u003c/em\u003e(1), 1-13. https://doi.org/10.1186/s13034-021-00392-4\u003c/li\u003e\n\u003cli\u003eWang, Y. \u0026amp; Witten, I. H. (1997). Induction of model trees for predicting continuous classes. In \u003cem\u003ePoster papers of the 9th European Conference on Machine Learning\u003c/em\u003e. Springer.\u003c/li\u003e\n\u003cli\u003eWeisberg, S. (2005). \u003cem\u003eApplied linear regression\u003c/em\u003e, volume 528. John Wiley \u0026amp; Sons. https://doi.org/10.1002/0471704091\u003c/li\u003e\n\u003cli\u003eWeschler, D. (2003). Wechsler intelligence scale for children-forth edition (WISC-IV) administration and scoring manual. \u003cem\u003eThe Psychological Association, San Antonio\u003c/em\u003e.\u003c/li\u003e\n\u003cli\u003eWechsler, D. (2005). \u003cem\u003eWISC-IV\u003c/em\u003e. TEA\u003c/li\u003e\n\u003cli\u003eWilcoxon, F. (1945). Individual comparisons by ranking methods. \u003cem\u003eBiometrics bulletin\u003c/em\u003e, 1(6):80\u0026ndash;83. https://doi.org/10.2307/3001968\u003c/li\u003e\n\u003cli\u003eYang, H. N., Tai, Y. M., Yang, L. K., \u0026amp; Gau, S. S. F. (2013). Prediction of childhood ADHD symptoms to quality of life in young adults: adult ADHD and anxiety/depression as mediators. \u003cem\u003eResearch in developmental disabilities\u003c/em\u003e, \u003cem\u003e34\u003c/em\u003e(10), 3168-3181. https://doi.org/10.1016/j.ridd.2013.06.011\u003c/li\u003e\n\u003cli\u003eYang, T. X., Allen, R. J., Holmes, J., \u0026amp; Chan, R. C. (2017). Impaired memory for instructions in children with attention-deficit hyperactivity disorder is improved by action at presentation and recall. \u003cem\u003eFrontiers in Psychology\u003c/em\u003e, 39. https://doi.org/10.3389/fpsyg.2017.00039\u003c/li\u003e\n\u003c/ol\u003e"}],"fulltextSource":"","fullText":"","funders":[],"hasAdminPriorityOnWorkflow":false,"hasManuscriptDocX":true,"hasOptedInToPreprint":true,"hasPassedJournalQc":"","hasAnyPriority":false,"hideJournal":false,"highlight":"","institution":"","isAcceptedByJournal":true,"isAuthorSuppliedPdf":false,"isDeskRejected":"","isHiddenFromSearch":false,"isInQc":false,"isInWorkflow":false,"isPdf":false,"isPdfUpToDate":true,"isWithdrawnOrRetracted":false,"journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true},"keywords":"","lastPublishedDoi":"10.21203/rs.3.rs-3992393/v1","lastPublishedDoiUrl":"https://doi.org/10.21203/rs.3.rs-3992393/v1","license":{"name":"CC BY 4.0","url":"https://creativecommons.org/licenses/by/4.0/"},"manuscriptAbstract":"\u003cp\u003eThe aim of this study is to predict the probability being diagnosed with ADHD using ML algorithms and to explain the behavior of the model to support decision making. The dataset studied included 694 cases. Information was obtained on age, sex and WISC-IV scores. Algorithms belonging to different ML learning styles were tested. A stratified 10-fold-cross-validation was applied to evaluate the models. The metrics were used: accuracy, area under the receiver operating characteristic, sensitivity and specificity. We compared models using all initial features and a suitable wrapper-type feature selection algorithm. After, we calculated Shapley additive values to assign weights to each predictor based on its additive contribution to the outcome and explain the predictions. The Random Forest algorithm performed best on most metrics. The main predictors included, GAI-CPI, WMI, CPI, PSI, VCI, WMI - PSI, PRI and LN. The ML model adequately predicted ADHD diagnosis in 90% of cases.\u003c/p\u003e","manuscriptTitle":"Prediction of attention deficit hyperactivity disorder based on explainable artificial intelligence","msid":"","msnumber":"","nonDraftVersions":[{"code":1,"date":"2024-03-12 16:12:51","doi":"10.21203/rs.3.rs-3992393/v1","editorialEvents":[{"type":"communityComments","content":0}],"status":"published","journal":{"display":true,"email":"[email protected]","identity":"researchsquare","isNatureJournal":false,"hasQc":true,"allowDirectSubmit":true,"externalIdentity":"","sideBox":"","snPcode":"","submissionUrl":"/submission","title":"Research Square","twitterHandle":"researchsquare","acdcEnabled":true,"dfaEnabled":false,"editorialSystem":"","reportingPortfolio":"","inReviewEnabled":false,"inReviewRevisionsEnabled":true}}],"origin":"","ownerIdentity":"19bfb467-fd1c-4336-8199-60cc209739dd","owner":[],"postedDate":"March 12th, 2024","published":true,"recentEditorialEvents":[],"rejectedJournal":[],"revision":"","amendment":"","status":"published-in-journal","subjectAreas":[],"tags":[],"updatedAt":"2024-04-10T01:44:59+00:00","versionOfRecord":{"articleIdentity":"rs-3992393","link":"https://doi.org/10.1080/21622965.2024.2336019","journal":{"identity":"applied-neuropsychology-child","isVorOnly":true,"title":"Applied Neuropsychology: Child"},"publishedOn":"2024-04-09 01:44:59","publishedOnDateReadable":"April 9th, 2024"},"versionCreatedAt":"2024-03-12 16:12:51","video":"","vorDoi":"10.1080/21622965.2024.2336019","vorDoiUrl":"https://doi.org/10.1080/21622965.2024.2336019","workflowStages":[]},"version":"v1","identity":"rs-3992393","journalConfig":"researchsquare"},"__N_SSP":true},"page":"/article/[identity]/[[...version]]","query":{"redirect":"/article/rs-3992393","identity":"rs-3992393","version":["v1"]},"buildId":"8U1c8b4HqxoKbykW_rLl7","isFallback":false,"isExperimentalCompile":false,"dynamicIds":[84888],"gssp":true,"scriptLoader":[]}

Text is read by the "Ask this paper" AI Q&A widget below. Extraction quality varies by source — PMC NXML preserves structure cleanly, OA-HTML may include some navigation residue, and OA-PDF can have broken hyphenation. The publisher copy (via DOI) is the canonical version.

My notes (saved in your browser only)

Ask this paper AI returns verbatim quotes from the full text · source: preprint-html

Answers must be backed by verbatim quotes from this paper's full text. Hallucinated quotes are dropped automatically; if no verbatim passage answers the question, we say so. How this works

Citation neighborhood (no data yet)

We don't have any in-corpus citations linked to this paper yet. This is a recent paper (2024) — citers typically take a year or two to land, and the OpenAlex reference graph may still be filling in.

Source provenance

europepmc
last seen: 2026-05-20T01:45:00.602351+00:00