Introduction

Tourette syndrome (TS) is the most common chronic tic disorder affecting children and is characterized by multiple motor tics and at least one vocal tic1. Tic disorders typically begin in childhood, usually between the ages of 5 and 10, affecting both males and females. However, they are more prevalent among males, with research indicating that males are 3–5 times more likely to be diagnosed with Tourette syndrome compared to females2. Tourette syndrome may interfere with concentration and learning, leading to academic challenges 1,3. Despite the noticeable increase in Tourette syndrome and tic-like behaviors, particularly since the onset of the COVID-19 pandemic, underdiagnosis remains common. This underdiagnosis may result in serious consequences, including chronic stress, low self-esteem, and difficulties in personal and professional life4. Compared to other tic disorders, TS is often considered to affect children more significantly because TS can be more disruptive and noticeable than having only motor or vocal tics, as in other tic disorders5. Additionally, while other tic disorders can involve complex tics, TS is more commonly associated with them, and the combination and complexity of tics in TS can have a greater impact on a child’s social interactions, school performance, and overall quality of life6. Therefore, early diagnosis of tic disorders, particularly Tourette syndrome, is crucial for timely intervention and prevention of the development of more severe symptoms and complications7.

Currently, the diagnosis of TS is primarily based on behavioral assessments and criteria outlined in the Diagnostic and Statistical Manual of Mental Disorders, Fifth Edition (DSM-5)5. Early diagnosis of TS in clinical settings remains challenging due to several factors. Firstly, there is no blood test, brain scan, or other objective method to diagnose TS. Diagnosis relies solely on the observation and documentation of the patient’s history of tics. Secondly, certain tics, such as eye blinking or throat clearing, can be mistaken for other issues, including allergies, vision problems, or nervous habits. This can lead to misdiagnosis or delayed diagnosis8. Thirdly, the presence of comorbidities can complicate the diagnostic process, as symptoms may overlap or mask one another. Additionally, individuals with TS and their families may feel embarrassed or reluctant to discuss tics, especially if they are mild or infrequent, which can result in underreporting and delays in diagnosis. Therefore, this study aims to establish a machine learning–based predictive model for predicting the risk of TS. For this purpose, we collected data including complete blood counts, liver function assessments, kidney function evaluations, blood glucose levels, serum electrolyte analyses, and vitamin D level. These indicators were selected not only because they are widely accessible across all healthcare facility levels in China but also due to their high cost-effectiveness, making them particularly suitable for large-scale screening of TS9. This study will not only enhance our understanding of the biological basis of TS but also improve treatment strategies for timely intervention and management.

Methods

Study design

As depicted in Fig. 1, this preliminary retrospective observational pilot cohort study was conducted at the Department of Pediatrics, Affiliated Hospital of Jiangnan University, spanning the period from January 2022 to October 2024, building upon our previous research10,11. The TS group encompassed children aged 8–10 years diagnosed with TS, while the control group consisted of healthy children admitted for routine check-ups. Table 1 outlines the fundamental clinical characteristics of both cohorts. Exclusion criteria for the study were children not diagnosed with TS, those with concurrent psychiatric disorders, and those with incomplete clinical or laboratory data. To facilitate practical clinical application of the predictive model, we focused on disparities in complete blood count, liver function assessments, kidney function evaluations, blood glucose levels, and serum electrolyte analyses. These data were gathered within the initial 24-h period following admission to the department. Initially, a total of 50 features were investigated (Table 1).

Fig. 1
figure 1

Flow chart of the study design.

Table 1 The basic clinical characteristics of participants involved in this study.

Ethical statement

This pilot observational retrospective study adhered to the tenets of the Declaration of Helsinki. The study protocol was developed based on our previous research and received ethical approval from the Institutional Review Board of the Affiliated Hospital of Jiangnan University (ID: LS2024KY721)12. Participants were thoroughly informed about the study’s objectives, procedures, potential risks, and benefits and all participants were provided informed consent during the study. To uphold confidentiality, all collected data were anonymized and securely stored, ensuring the privacy of participant information.

Clinical diagnostic criteria for Tourette syndrome

The clinical diagnostic criteria for Tourette syndrome are based on the DSM-513. The detailed information on the clinical diagnostic criteria for Tourette syndrome, including specific symptom lists and exclusion criteria, is shown in Supplemental File 1.

Feature selection

As illustrated in Fig. 1, we performed feature selection combined using Boruta and multivariable logistic regression analyses, drawing upon methodologies from our prior investigations14. As illustrated in Fig. 2, We selected important features from the Boruta algorithm by identifying confirmed important variables. Multivariable logistic regression was subsequently performed to further refine feature selection and to determine the odds ratios (OR) with their respective confidence intervals for each significant predictor. Features with OR > 1 indicate an increased likelihood of Tourette syndrome, while OR < 1 suggests a decreased likelihood10,15. To mitigate bias arising from multicollinearity, we conducted both Spearman correlation and variance inflation factor (VIF) analyses and eliminated variables exhibiting significant correlations (|R|> 0.8) or VIF > 10, adhering to the methodology delineated in the aforementioned study16. The raw R scripts, along with detailed information regarding the feature selection analyses, are openly available in our GitHub repository: (https://github.com/PediatricLab-Jiangnan/TS-Syndrome-Brouta-ML-Web).

Fig. 2
figure 2

Comprehensive analysis of multiple variables to identify significant features. (A) Multicollinearity analysis using Spearman correlation, with correlations having an absolute value greater than 0.8 annotated with an asterisk. (B) Multicollinearity analysis using the variance inflation factor (VIF), showing features with VIF values below 10. (C) Boruta feature selection algorithm, where features in green are confirmed as important. (D) Variable importance scores across iterations in the Boruta feature selection algorithm. (E) Forest plot of significant variables (P < 0.05). (F) A Venn diagram to illustrate the overlap of features between the Boruta algorithm and multivariable logistic regression analysis.

Model construction and evaluation

As illustrated in Fig. 3, we combined the Boruta algorithm with multivariable logistic regression analysis to select relevant features, effectively to mitigate overfitting17. Consequently, a total of 10 features were identified for the development of predictive models. Before applying advanced machine learning (ML) techniques, we established a traditional predictive nomogram model and compared the Receiver Operating Characteristic (ROC) curves for the combination of 10 features as well as individual features. Subsequently, we employed 9 distinct ML techniques to predict the risk probability of Tic disorder using the selected 10 features: 1. Adaptive Boosting (AdaBoost), 2. Lasso Regression (Lasso), 3. Random Forest (RF), 4. Gradient Boosting Machine (GBM), 5. Extreme Gradient Boosting (XGBoost), 6. Support Vector Machine (SVM), 7. K-Nearest Neighbors (KNN), 8. Multilayer Perceptron (MLP), and 9. Decision Tree (DT). The use of multiple models facilitated a comparison of their performance and the identification of the most effective technique for predicting the risk of Tourette syndrome. We implemented tenfold cross-validation to rigorously assess model performance, wherein the entire dataset was partitioned into 10 subsets, with each subset iteratively serving as the validation set while training occurred on the remaining nine subsets. This methodology ensured thorough evaluation without necessitating a separate dataset split. Model reliability was assessed using four primary metrics: the area under the receiver operating characteristic curve (AUC), Precision-Recall Curve (PRC), Decision Curve Analysis (DCA), and calibration curve16. Additionally, specificity, sensitivity, accuracy and F1-score were presented using a radar chart. The raw Python scripts, along with detailed information regarding the feature selection analyses, are openly available in our GitHub repository: (https://github.com/PediatricLab-Jiangnan/TS-Syndrome-Brouta-ML-Web).

Fig. 3
figure 3

Comprehensive comparison of different machine learning models. (A) Displays the ROC curves for the combination of 10 features as well as individual features. (BE) ROC, Precision-Recall, DCA, and Calibration curves for various machine learning models, highlighting the strong performance of the Gradient Boosting Model. (FI) Radar chart illustrating the accuracy, F1-score, sensitivity, specificity of each model.

Model interpretation

Interpreting machine learning models has consistently been a challenge. To further elucidate how each feature influences and contributes to the final model, we employed the SHapley Additive exPlanation (SHAP) technique to interpret the highest-performing black-box model, as established in prior studies and illustrated in Fig. 4. In this study, we evaluated the importance of each feature by computing the mean absolute SHAP values. Additionally, we plotted the SHAP values for each feature across samples to better understand the overall patterns and the extent of each feature’s impact on Tourette syndrome risk. We also utilized the SHAP dependency plots to assess the influence of each feature. Furthermore, we provided one illustrative one example of SHAP predictions for demonstration purposes.

Fig. 4
figure 4

Comprehensive analysis of feature importance and their impact on the risk of Tourette syndrome using the SHAP method. (A) SHAP summary dot plot illustrating the overall distribution of each feature’s influence, where red generally indicates high values and blue indicates low values for the corresponding feature. (B) SHAP summary bar plot depicting the mean absolute SHAP value for each feature. (C) Feature importance donut chart depicting the relative importance of each feature as a percentage of the total importance. (D) Waterfall chart explaining a single prediction by showing how each feature contributes to pushing the prediction from the base value (average prediction) to the final prediction. (EF) SHAP dependence plots of blood beta-2 microglobulin and 25-hydroxyvitamin D.

Web application development

To facilitate the utility of the model in clinical settings, the final prediction model was implemented into a web application developed using the Streamlit Python framework and available at https://touretterisk.streamlit.app/.

Mendelian randomization analysis

Mendelian Randomization (MR) analysis is a robust method that leverages genome-wide association study (GWAS) data and genetic variants to investigate potential causal relationships between modifiable exposures and various diseases or outcomes18. GWAS identifies genetic variants associated with specific traits, diseases, or outcomes, and these genetic variants—particularly single-nucleotide polymorphisms (SNPs)—serve as instrumental variables (IVs) in MR studies19. MR analysis has been extensively utilized to evaluate the potential effects of diverse exposures on disease risk19. In this study, we conducted Mendelian Randomization analysis based on previously established methodologies to explore the potential associations between Tourette syndrome and candidate biomarkers identified through machine learning analyses20,21,22. Detailed criteria for the MR analysis are provided in Supplemental File 2, and comprehensive information regarding the GWAS datasets employed in this study can be found in Table 2. Specifically, GWAS data for TS were sourced from the Psychiatric Genomics Consortium (PGC) (https://pgc.unc.edu/), while GWAS data for the candidate biomarkers selected by ML were obtained from the IEU OpenGWAS project (https://gwas.mrcieu.ac.uk/).

Table 2 Detailed information of GWAS data used in the study.

Statistical analysis

This study presents data as mean ± standard deviation (SD). The Shapiro–Wilk test evaluated the normality of the data distribution. For normally distributed data, unpaired t-tests, one-way ANOVA, or two-way ANOVA were employed for statistical comparisons, followed by Tukey’s post-hoc tests for multiple comparisons. Non-normally distributed data were analyzed using the Mann–Whitney U test for two groups or the Kruskal–Wallis test for multiple groups. All statistical analyses were conducted utilizing Python (https://www.python.org) and R (https://cran.r-project.org/). A P-value less than 0.05 was deemed statistically significant.

Results

Data processing and features selection

As depicted in Fig. 1 and Table 1, after selection, a total of 671 volunteer participants were involved, with no significant differences in age (P > 0.05), while significant differences were observed in sex and education between the groups (P < 0.05). Although educational attainment differed significantly between the groups (P < 0.05), it does not directly influence tic disorder risk. Among the 50 clinical features we collected, we observed higher levels of β2-microglobulin and lower levels of 25-hydroxyvitamin D in Tourette syndrome patients compared to healthy controls (Table 1). Spearman correlation and variance inflation factor (VIF) analyses were performed to identify and remove features contributing to multicollinearity. In this study, features with spearman correlations exceeding an absolute value of 0.8 or VIF values above 10 were excluded to minimize redundant information (Fig. 2A–B)23.

Subsequently, the Boruta algorithm was employed to identify significant features by comparing them to shadow features. Features consistently deemed more important than their shadow counterparts are depicted in green (Fig. 2C). The iterative variable importance scores generated by the Boruta algorithm further illustrate the selection process, demonstrating the consistency of feature selection and confirming the stability of feature importance rankings (Fig. 2D). It should be noted that although the Boruta algorithm indicated that early symptoms such as eye blinking, involuntary movements, and unintentionally uttered sounds were the most important features in predicting the risk probability of TS, a complete collinearity was observed between the early symptoms and the diagnostic status. All controls were asymptomatic, whereas all TS patients exhibited these symptoms. To avoid the confounding effect of multicollinearity due to early symptoms and to better assess the independent predictive value of other biomarkers, the early symptoms variable was removed prior to machine learning modeling. A forest plot quantifies the impact of the selected features on the outcome (Fig. 2E). Notably, the multivariable logistic regression analysis revealed a high OR for β2-microglobulin and a low OR for 25-hydroxyvitamin D₃, underscoring their potential roles in predicting the risk probability of Tourette syndrome. Finally, as shown in Fig. 2F, a Venn diagram was created to illustrate the overlap of features between the Boruta algorithm and multivariable logistic regression analysis. 10 features were chosen for machine learning model development.

Machine learning algorithm development and interpretation

Following feature selection using the Boruta algorithm and multivariable logistic regression analysis, 10 features were chosen for model development. As shown in the supplemental File 2 and Fig. 3A, Prior to implementing advanced machine learning (ML) techniques, we established a conventional predictive nomogram and compared Receiver Operating Characteristic (ROC) curves for both the composite 10-feature model and individual features independently. The analysis demonstrated that the combined 10-feature model exhibited superior predictive performance compared to any single feature in isolation. In this study, we employed nine widely used machine learning algorithms, including Adaptive Boosting (AdaBoost), Lasso Regression (Lasso), Random Forest (RF), Gradient Boosting Machine (GBM), Extreme Gradient Boosting (XGBoost), Support Vector Machine (SVM), K-Nearest Neighbors (KNN), Multilayer Perceptron (MLP), and Decision Tree (DT), to develop predictive models and select the optimal model. Among the nine machine learning algorithms evaluated, the Gradient Boosting Machine (GBM) demonstrated exceptional performance, achieving the highest area under the curve (AUC) in ROC analysis (0.94), PRC analysis (0.97), and superior performance in DCA analyses. Additionally, GBM ranked among the top two algorithms in calibration curve assessment. As evidenced in Fig. 3F–I Further evaluation through radar chart analysis revealed that GBM attained the highest accuracy (0.872), F1-score (0.907), sensitivity (0.952), and specificity (0.719) among all tested algorithms. Based on this comprehensive performance assessment, GBM was determined to be the optimal predictive model for this study.

To enhance the interpretability of the GBM prediction model, we applied the Shapley Additive Explanations (SHAP) method, as described in a prior study16. This method elucidates how variable features influence the model’s output regarding Tourette syndrome risk. As depicted in Fig. 4A, the summary dot plot provides a comprehensive overview of each feature’s impact on the model’s output. The x-axis represents the SHAP value, indicating the magnitude and direction of impact on the prediction, where positive values increase the predicted Tourette syndrome risk and negative values decrease it. Features on the y-axis are ranked by importance based on their average absolute SHAP values. The color gradient illustrates how a feature’s value affects its impact: red represents higher values and blue represents lower values. In our analysis, we observed distinct patterns that reveal important clinical insights. For 25-hydroxyvitamin D, high values (red dots) predominantly appear on the left side (negative SHAP value area), indicating that higher levels of this vitamin significantly decrease the predicted risk of Tourette syndrome. Conversely, for β2-microglobulin, low values (blue dots) appear on the right side (positive SHAP value area), suggesting that lower levels of this protein actually increase the predicted risk of Tourette syndrome. The summary bar plot (Fig. 4B) illustrates the average absolute SHAP value for each feature, with longer bars representing more influential features. The feature importance donut chart (Fig. 4C) displays the relative importance of each feature as a percentage of the total importance, with larger slices indicating more influential features. These results emphasize the critical roles of serum 25-dihydroxyvitamin D levels and β2-microglobulin in predicting Tourette syndrome risk. The waterfall chart illustrates how each feature contributes to the shift in prediction from the baseline value (average prediction, E[f(x)]) to the final prediction (f(x)) for a given instance. The contributions of each feature are represented by horizontal bars, where blue indicates a negative effect, and red signifies a positive effect. In this study, the visualization of high-risk TS samples shows how each feature value moves the prediction from the base value (1.369) to the final expected value (2.91) (see Fig. 4D). The numbers accompanying the variables represent the actual values for the case under analysis; for example, a value of 1.94 indicates that an increase of 1.94 mg/L in blood β2-microglobulin raises the risk probability of TS prediction by 0.9. The SHAP dependence plots for blood β2-microglobulin (Fig. 4E) and 25-hydroxyvitamin D (Fig. 4F) provided detailed insights into the relationship between each feature’s value and its SHAP value, with SHAP values above zero indicating an increased risk of Tourette syndrome24. In this study, our results indicated that higher levels of blood β2-microglobulin generally have a positive impact (increased Tourette syndrome risk), whereas higher levels of 25-hydroxyvitamin D have a negative impact (decreased Tourette syndrome risk). Finally, we performed a Mendelian Randomization analysis to investigate potential relationships between the selected biomarkers and Tourette Syndrome symptom severity (Fig. 5A). The results indicated a positive correlation between serum β2-microglobulin levels and TS symptoms, suggesting that elevated β2-microglobulin may exacerbate symptom severity (Fig. 5B). Conversely, we observed negative correlations between both vitamin D level and albumin and TS symptom severity (Fig. 5C and D). Furthermore, we established a web application to enhance the clinical value of our study (Fig. 6). In summary, our study successfully established a predictive model for Tourette syndrome risk using GBM. The SHAP method further emphasized the key roles of serum 25-dihydroxyvitamin D₃ and β2-microglobulin in predicting Tourette syndrome risk.

Fig. 5
figure 5

Mendelian Randomization analysis. (A) Forest plot of Mendelian Randomization (MR) analysis using the Inverse Variance Weighted (IVW) method, examining potential the relationship between selected biomarkers and Tourette Syndrome symptoms. A P-value less than 0.05 was considered statistically significant; odds ratios (OR) greater than 1 indicate risk factors, while ORs less than 1 suggest protective factors. (BD) Scatter plot from the MR analysis relating serum β2-microglobulin, vitamin D, and mean corpuscular volume levels to Tourette Syndrome symptoms. The x-axis represents the effect of single nucleotide polymorphisms (SNPs) on biomarker expression, and the y-axis shows their effect on Tourette Syndrome symptoms. An upward trend from left to right suggests a positive association between increased biomarker expression and exacerbated Tourette Syndrome symptoms, whereas a downward trend from left to right suggests a negative association.

Fig. 6
figure 6

The web application for the Tourette syndrome risk assessment.

Discussion

Tic disorders are neurological conditions characterized by sudden, repetitive, non-rhythmic movements or vocalizations known as tics. These tics typically emerge during childhood, most commonly between the ages of 5 and 10, and are more prevalent in boys than in girls25. Individuals with tics often experience a premonitory urge or sensation that is temporarily alleviated by performing the tic26.The severity of these symptoms may increase during periods of stress, anxiety, excitement, or life transitions, while they often decrease when the individual is distracted or engaged in focused activities.

Tic disorders are classified into several main types based on the nature of the tics and their duration. Among these, Tourette syndrome is a relatively common chronic tic disorder characterized by the presence of both motor and vocal tics. Compared to other tic disorders, Tourette syndrome tends to have a more pronounced impact on children, largely due to the combined effect of motor and vocal tics, the frequent coexistence of associated conditions, and the potential for significant social, emotional, and academic challenges5. However, the early diagnosis of Tourette syndrome in clinical settings remains a significant challenge. The diagnosis depends entirely on the observation and detailed documentation of the patient’s history of tics, as there are currently no blood tests or other objective methods available to confirm the condition. Given these diagnostic difficulties, the identification of reliable diagnostic models is critical to enhance the accuracy and timeliness of TS detection. Although some children, including those with TS, may still exhibit resistance to any form of blood collection, blood-based biomarkers are increasingly being investigated for their potential to predict the risk probability of TS. These biomarkers offer promise for facilitating early detection, enhancing our understanding of the disorder, and supporting the development of novel therapeutic strategies27.

Recently, machine learning methods using blood-based biomarkers offer a non-invasive, cost-effective approach for the early diagnosis and classification of neurological disorders including TS. For instance, Brügge et al. developed algorithms for automatic tic detection using video recordings, achieving F1 scores of 82.0% with a Random Forest classifier and 79.5% with a deep neural network28. Conelea et al. applied computer vision and deep learning techniques to detect eye tics in video recordings, achieving an area under the receiver operating characteristic curve of 0.89 for random split regimens29. Cernera et al. demonstrated the use of wearable sensors to capture physiological differences between tic and voluntary movements, achieving high accuracy (96.69%) in classifying movements30. While these approaches provide valuable quantitative insights, their integration into routine clinical practice and intervention studies is hindered by the requirement for specialized equipment and controlled environments, posing challenges for implementation in everyday clinical settings. Furthermore, the acquisition and annotation of real clinical data demand significant expertise and time investment, which limits the creation of large-scale datasets necessary for robust model development and validation.

In contrast to prior research on Tourette Syndrome (TS), which has predominantly concentrated on brain function and neurotransmitter differences due to its classification as a neurodevelopmental disorder, this study seeks to develop a predictive model for assessing TS risk using routine, nonspecific, low-cost, and widely accessible testing parameters. Financial considerations are a critical factor in patients’ medical decision-making processes. Research indicates that out-of-pocket expenses significantly influence choices regarding healthcare services and treatments, particularly for guardians of children with neurological disorders in China31,32. The American College of Cardiology and the American Heart Association have recognized that economic implications are inherent in medical recommendations, and disregarding costs can impose financial burdens on patients, violating the fundamental principle of "do no harm."33 Moreover, unlike specialized and costly neurotransmitter-related marker testing for TS, routine blood tests are predominantly covered by China’s two primary insurance schemes—the Urban Resident Basic Medical Insurance (URBMI) and the New Rural Cooperative Medical Scheme (NRCMS)—which provide coverage for children in urban and rural areas34. This approach has the potential to bridge coverage gaps, especially for children from migrant families and those with lower socioeconomic status. Consequently, it aligns well with the healthcare context in China, offering the dual benefit of enhancing early identification rates and diagnostic accuracy for TS while reducing diagnostic barriers and associated costs35.

In this investigation, we initially addressed multicollinearity using Spearman correlation and variance inflation factor analyses, followed by the employment of the Boruta algorithm and multivariable logistic regression analyses to select a total of 10 features, including serum 25-dihydroxyvitamin D and β2-microglobulin, as final predictors of Tourette Syndrome risk. Subsequently, we utilized nine distinct machine learning algorithms to develop the optimal predictive model based on the comparison of the ROC, PRC, DCA and Calibration curves. Additionally, radar chart analysis further indicated that GBM as the optimal predictive model. The Shapley Additive Explanations (SHAP) method, recognized as one of the most effective approaches for identifying attributions in machine learning models, was utilized to elucidate the model. Kim et al. have explored SHAP applications in neuroimaging, highlighting its promising potential in neuroscience research36. In our study, the SHAP summary dot plot and summary bar plot highlighting the pivotal roles of 25-dihydroxyvitamin D and β2-microglobulin. The SHAP dependence plots illustrated the relationship between the selected feature values and their impact on tic disorder risk. Our findings indicated that deficiencies in 25-dihydroxyvitamin D or elevated levels of β2-microglobulin may increase the risk of TS. For all the potential biomarkers selected by ML in this study, recent evidence has underscored the crucial roles of β2-microglobulin and vitamin D in various neurological disorders. Research indicates that β2-microglobulin is a component of the major histocompatibility complex class I (MHC I), which plays a key role in the immune system’s ability to distinguish “self” from "non-self"37 . Additionally, β2-microglobulin is a marker of inflammation and increased cell turnover and immune system activation during inflammation lead to elevated β2-microglobulin levels. And there is growing evidence suggesting that inflammation, particularly neuroinflammation, may play a role in the development or exacerbation of Tourette syndrome38. Regarding vitamin D, research indicates that it plays a crucial role in regulating the immune system, exhibits anti-inflammatory properties, and can influence both innate and adaptive immune responses39. Vitamin D is also involved in synthesizing neurotransmitters such as dopamine, which is implicated in TS. Several studies have reported lower vitamin D levels in children with TS compared to healthy controls40. For example, Li et al. observed significantly lower serum 25-hydroxyvitamin D levels in children with tic disorders, including TS, compared to healthy controls, and noted a negative correlation between serum vitamin D levels and tic severity. In contrast, Bond et al. reported that lower vitamin D levels were not directly associated with tic presence or severity but were related to the presence and severity of comorbid ADHD in children and adolescents with chronic tic disorders35. It is important to note that there is currently no established relationship between other biomarkers, such as uric acid and serum phosphorus levels, and TS. Uric acid is a metabolic byproduct produced during purine breakdown, and elevated levels are primarily associated with conditions such as gout, kidney disease, metabolic syndrome, cardiovascular disease, and hypertension41. Mean corpuscular volume is a measure of the average volume of red blood cells, commonly assessed through complete blood count tests42. Serum phosphorus is essential for various physiological functions, including bone health, energy production, and cellular membrane integrity; however, direct evidence linking phosphorus levels to neurological processes underlying TS is lacking43. Platelet distribution width, a measure reflecting platelet size variability, may indicate platelet activation and overall platelet health44. Previous research suggests that platelet function abnormalities could reflect broader neurochemical dysregulations in TS. Thyroid hormones, including thyroxine, play critical roles in brain development and function, and alterations in thyroid hormone levels can impact various mental disorders. Prealbumin, also known as transthyretin, primarily serves as a marker of nutritional status and liver function. Eosinophils, a type of white blood cell, are primarily involved in allergic reactions and parasitic infections. Although previous studies have indicated a higher prevalence of allergic conditions among TS patients, there remains insufficient evidence to establish eosinophil levels as a significant factor in TS pathophysiology. Thus, it is essential to emphasize that the etiology of TS is complex, involving genetic factors, brain structural variations, and neurotransmitter imbalances. Relying on speculative associations without robust evidence can be misleading. Therefore, the predictive model developed in this study should serve as a supportive tool alongside comprehensive clinical assessments, and further validation studies are warranted.

Conclusion

In conclusion, we successfully developed a machine learning (ML) model to predict the risk of Tourette syndrome based on clinical data readily accessible from real-world clinical practice. Among the 9 different machine learning algorithms evaluated in this study, the Gradient Boosting Machine model exhibited superior performance. Additionally, the SHAP method was employed to elucidate the ML model. This approach not only facilitated the determination of the importance of each feature in the model but also demonstrated how each feature influenced the model’s predictions. Our study established a predictive model for tic disorder risk using GBM, with SHAP analysis revealing that 25-dihydroxyvitamin D and β2-microglobulin emerged as the two most important features for predicting Tourette syndrome risk. This comprehensive approach provides valuable insights into the biological factors potentially underlying this neurodevelopmental disorder.

Limitation

There are several limitations to this study. First, this study was conducted at a single center in China and involved Chinese populations, so the findings may not be directly applicable to other populations or ethnicities. Second, distinguishing persistent tics characteristic of Tourette syndrome from transient developmental phenomena or other neurodevelopmental disorders can be challenging in young children; therefore, potential biases due to diagnostic uncertainty or misclassification cannot be completely ruled out. Third, although the machine learning-based prediction model was developed using a relatively large sample size of 694 participants, all features in the model are non-specific laboratory biomarkers. This could lead to false positives when applied to the general population, and external validation studies are necessary to confirm the model’s generalizability and robustness. Additionally, it must be point out that our study did not specifically investigate the participants’ history of streptococcal infection and the observed correlation between elevated β2-microglobulin levels and increased Tourette syndrome risk is reminiscent of findings in Pediatric Autoimmune Neuropsychiatric Disorders Associated with Streptococcal infections (PANDAS), which also present with tic-like movements similar to Tourette syndrome45. Furthermore, the pilot prospective observational cohort study had a relatively short follow-up period, and we did not explore the effects of different hyperparameters on model performance or adopt ensemble modeling techniques, potentially overlooking optimal configurations that could enhance the model’s accuracy and efficiency. Additional longitudinal data and extended follow-up assessments are required to provide stronger evidence supporting the predictive model’s utility in assessing tic disorder risk. Fourth, Tourette syndrome is a complex neurodevelopmental condition, and conducting machine learning models without involving neurotransmitters and functional neuroimaging markers may overlook critical biological mechanisms underlying Tourette syndrome. Further research exploring additional biomarkers could help obtain a more comprehensive understanding of its pathogenesis. Finally, before the predictive model can be broadly implemented in clinical practice, further evaluation of its clinical utility and cost-effectiveness in real-world settings is essential.