Scroll to:
Machine learning methods in differential diagnosis of ACTH-dependent hypercortisolism
https://doi.org/10.14341/probl13342
Abstract
AIM: To develop a noninvasive method of differential diagnosis of ACTH-dependent hypercortisolism, as well as to evaluate the effectiveness of an optimal algorithm for predicting the probability of ectopic ACTH syndrome (EAS) obtained using machine learning methods based on the analysis of clinical data.
MATERIALS AND METHODS: As part of a single-center, one-stage, cohort study, a retrospective prediction of the probability of EAS among patients with ACTH-dependent hypercortisolism was carried out. Patients were randomly stratified into 2 samples: training (80%) and test (20%). Eleven machine learning algorithms were used to develop predictive models: Linear Discriminant Analysis, Logistic Regression, elastic network (GLMNET), Support Vector machine (SVM Radial), k-nearest neighbors (kNN), Naive Bayes, binary decision tree (CART), C5.0 decision tree algorithms, Bagged CART, Random Forest, Gradient Boosting (Stochastic Gradient Boosting, GBM).
RESULTS: The study included 223 patients (163 women, 60 men) with ACTH-dependent hypercortisolism, of which 175 patients with Cushing’s disease (CD), 48 — with EAS. As a result of preliminary data processing and selection of the most informative signs, the final variables for the classification and prediction of EAS were selected: ACTH level at 08:00 hours, potassium level (the minimum value of potassium in the active stage of the disease), 24-h urinary free cortisol, late-night serum cortisol, late-night salivary cortisol, the largest size of pituitary adenoma according to MRI of the brain. The best predictive ability in a training sample of all trained machine learning models for all three final metrics (ROC-AUC (0.867), sensitivity (90%), specificity (56.4%)) demonstrated a model of gradient boosting (Generalized Boosted Modeling, GBM). In the test sample, the AUC, sensitivity and specificity of the model in predicting EAS were 0.920; 77.8% and 97.1%, respectively.
CONCLUSION: The prognostic model based on machine learning methods makes it possible to differentiate patients with EAS and CD based on basic clinical results and can be used as a primary screening of patients with ACTH-dependent hypercortisolism.
Keywords
For citations:
Golounina O.O., Belaya Zh.E., Voronov K.A., Solodovnikov A.G., Rozhinskaya L.Ya., Melnichenko G.A., Mokrysheva N.G., Dedov I.I. Machine learning methods in differential diagnosis of ACTH-dependent hypercortisolism. Problems of Endocrinology. 2024;70(1):18-29. https://doi.org/10.14341/probl13342
BACKGROUND
Endogenous hypercortisolism is an extremely rare and one of the most severe endocrine diseases caused by long-term body exposure to excessive levels of cortisol. To avoid the development of life-threatening complications and patient disability, timely diagnosis of this condition is necessary to establish the disease ethiology and subsequent adequate treatment [1]. ACTH-secreting pituitary adenomas (Cushing disease) account for up to 80% of nosologies leading to hypercortisolism. A much rarer type of ACTH-dependent hypercortisolism is ectopic ACTH syndrome (EAS) caused by overproduction of ACTH, or, less frequently, overproduction of corticotropin-releasing hormone (CRH) or neuroendocrine tumour (NET) of various localisations [2].
Differentiating between Cushing disease and EAS is very challenging, especially in patients without adenoma visualisation or with pituitary microadenomas less than 6 mm in diameter and/or negative results of topical NET diagnostics obtained through radiological, functional, and receptor imaging [3].
The gold standard for differential diagnosis of ACTH-dependent forms of endogenous hypercortisolism is bilateral selective blood sampling from the inferior stony sinuses with determination of the ACTH centre/periphery gradient before and after administration of the stimulating agent [4]. Since its introduction, the method has undergone significant changes and improvements to enhance diagnostic accuracy [5]. The results of a study conducted in Russia’s Endocrinology Research Centre indicate high sensitivity (98.31% (95% CI 95.16–99.43)) and specificity (94.64% (95% CI 85.39–98.16)) of selective blood sampling using desmopressin stimulation [6].
Despite the high diagnostic accuracy of the method, selective blood sampling from the inferior stony sinuses can be performed in only a handful of Russia’s medical institutions due to the technical difficulties of this procedure, high cost, high man-hour expenses and potential complications of this invasive intervention. In turn, non-invasive diagnostic tests used for differential diagnosis, such as the high-dose dexamethasone test (HDT) and peripheral corticoliberin test, do not have sufficient sensitivity and specificity [7]. To improve the differential diagnosis of ACTH-dependent forms of endogenous hypercortisolism, there is a clear need for more accessible, easy-to-use and non-invasive methods that can be used in conditions where selective blood sampling from the lower stony sinuses is impossible.
Development of digital technologies and a trend towards personalised medicine in recent years have encouraged active implementation of artificial intelligence methods to support physician’s decision-making in many areas of medicine [8]. Machine learning methods are considered to be the main tools of artificial intelligence and have been increasingly used in diagnostic and predictive studies. To enable early and differential diagnosis of ACTH-dependent hypercortisolism, an algorithm combining the analysis of simple medical history data with the results of clinical and instrumental examination can be proposed as a non-invasive screening or primary diagnosis method.
PURPOSE
To develop a non-invasive method of differential diagnosis of ACTH-dependent endogenous hypercortisolism based on data accessible in regular clinical practice and to evaluate the effectiveness of an optimal algorithm for predicting the probability of ectopic ACTH syndrome developed through machine learning methods.
MATERIALS AND METHODS
Study design
A single-centre cross-sectional cohort study with a retrospective data analysis. The study included patients observed and/or treated in Russia’s Endocrinology Research Centre from 28 October 2015 to 29 December 2022.
Inclusion criteria
Inclusion criteria: patients of both sexes with ACTH-dependent endogenous hypercortisolism confirmed as per federal clinical guidelines [7] by at least two laboratory tests: elevated 24-h urinary free cortisol and/or late-night salivary cortisol or late-night blood cortisol and/or negative small-dose (1 mg) dexamethasone test with (cut-off point: 50 nmol/L); morning ACTH concentration ≥10 pg/ml; no visualisation of pituitary adenoma on MRI or pituitary adenoma size less than 10 mm.
Exclusion criteria: patients with other verified forms of hypercortisolism; a history of bilateral adrenalectomy; taking medications for conservative treatment of hypercortisolism; therapy with prolonged-acting somatostatin analogues; women during pregnancy, childbirth, or breastfeeding; persons suffering from psychiatric diseases; patients with poorly maintained medical records; patients with missing values of the analysed indicators.
Population sampling
To build a model of differential diagnosis between Cushing disease and EAS, we used data from a sample of ACTH-dependent endogenous hypercortisolism patients selected from the general cohort of patients who underwent in-patient examination and neurosurgical treatment at the Russia’s Endocrinology Research Centre or underwent surgical treatment of NET with ectopic ACTH production in a relevant specialised medical institution in 2015–2022.
During sampling, the following inclusion criteria were applied: patients who received neurosurgical treatment for Cushing disease and achieved remission and/or histological and immunohistochemical confirmation of the diagnosis; patients without surgical treatment and without ACTH centre/periphery gradient as per selective blood sampling from the lower stony sinuses, with NET markers as per planar scintigraphy in the “whole body” scanning mode, combined with 99mTc-tectrotide SPECT/CT and/or combined positron emission and computed tomography with DOTA-conjugated radiopharmaceuticals (68Ga-DOTA-TATE, 68Ga-DOTA-TOC, 68Ga-DOTA-NOC).
Exclusion criteria: patients who received no neurosurgical or surgical treatment and had no histological verification of the diagnosis or NET markers as per radiological and radionuclide diagnostics.
Medical intervention procedures
In all patients included in the study, ACTH-dependent endogenous hypercortisolism was laboratory and clinically confirmed at the screening stage in accordance with clinical guidelines [7]. In order to differentiate between Cushing disease and EAS, 113 patients included in the study underwent HDT. A decrease in blood cortisol at 08:00 by more than 60% from baseline after 8 mg dexamethasone administration the day before was considered as a central form of endogenous hypercortisolism (Cushing disease), whereas a decrease in cortisol level by less than 60% was in favour of EAS.
All patients included in the study underwent selective blood sampling from the inferior cavernous sinuses with desmopressin stimulation (Ferring Pharmaceuticals Desmopressin Acetate 4 µg per 1 ml for intravenous, intramuscular and subcutaneous injections) at a 8 µg dose. Selective blood sampling results were evaluated based on the estimated maximum ratio of sinus ACTH to peripheral ACTH before desmopressin stimulation and after IV administration of the drug. Any ACTH centre/periphery gradient ≥2 before desmopressin stimulation and/or ≥3 thereafter was considered to be in favour of Cushing disease, whereas lower values of the ACTH centre/periphery gradient were considered as EAS [9].
Measurement equipment
Brain MRI was performed with Magnetom Harmony magnetic resonance tomograph (Siemens, Germany) with field strength from 1.5 to 3 Tesla and gadolinium contrast agent.
Hormonal ACTH measurements (reference interval: morning 7.2–63.3 pg/ml, evening 2–25.5 pg/ml), late night serum cortisol measurements (64–327 nmol/L), late night salivary cortisol (0.5–9.6 nmol/L) were conducted by electrochemiluminescent method on a Cobas 6000 Module e601 analyser (Roche); 24-h urinary free cortisol (100–379 nmol/day) was measured by immunochemiluminescent method on Vitros ECi device; potassium level (3.5–5.1 mmol/L) was measured with Architect c8000 analyser (Abbott, USA).
Modelling methods
The concept of developing and validating predictive models is presented in Figure 1.
Figure 1. The concept of developing predictive models.
Data pre-processing included removing features with near-zero variance and screening out collinear predictors. In addition, predictors were normalised and centred for those algorithms that required it. To avoid data leakage, all pre-processing and transformation was performed after splitting the dataset.
The sample split into the training set and the test set was performed using an 80/20 random stratified selection. In order to confirm the consistency of the data distribution in the training and test sets, analyses were performed by estimating the modified multivariate Mahalanobis distance (Eklavya Jain, 2022) and testing the null hypothesis of consistency of distributions (Monte Carlo test). The analysis showed a small distance of 12.7 between the training and the test sets. Accepting the null hypothesis with a p-value significance at 0.755, we concluded that this separation can be used to measure the model effectiveness.
We subjected the available data to mathematical processing to select an optimal set of indicators that would be more likely to be characteristic for Cushing disease or EAS. Moreover, we searched for parameters recommended by the software on the basis of mathematical calculations in order to exercise optimal control over the model training process. Baseline clinical features presented in Table 1 were considered as potential parameters.
Eleven machine learning algorithms were compared to develop an EAS predictive model for patients with ACTH-dependent endogenous hypercortisolism: Linear Discriminant Analysis, Logistic Regression, Elastic Network (GLMNET), Support Vector Machine (SVM Radial), k-Nearest Neighbours (kNN), Naïve Bayes, Binary Decision Tree (CART), C5.0 Decision Tree Algorithms, CART Bootstrap Aggregation (Bagged CART), Random Forest, and Stochastic Gradient Boosting (GBM). The result of the prediction is the probability of EAS in the patient, expressed as a percentage value.
Table 1. General data and main laboratory-determined parameters of the study subjects – patients with ACTH-dependent endogenous hypercortisolism
Parameter |
Cushing disease patients |
Ectopic ACTH syndrome patients |
p |
Total count |
175 |
48 |
- |
Males (%) : Females (%) |
41 (23.4%) : 134 (76.6%) |
19 (39.6%) : 29 (60.4%) |
0.029* |
Age at disease onset |
35 [ 27; 46] (11; 70) |
36.5 [ 28; 54] (16; 76) |
0.129 |
Age at inclusion in the study |
39 [ 30; 50] (18; 72) |
43 [ 31; 57] (18; 76) |
0.213 |
Body mass index (kg/m²) |
31.1 [ 26.6; 34.7] (18.3; 57.5) |
29 [ 25.3; 33.6] (21; 48.7) |
0.148 |
Laboratory-determined parameters at the time of diagnosis |
|||
Late-night blood cortisol (nmol/L) |
660.1 [ 518.2; 904] (93.2; 1,960) |
1,128.5 [ 691.1; 1,412.3] (460.2; 5,854.8) |
<0.001* |
Late-night salivary cortisol (nmol/L) |
22.5 [ 14.7; 39.3] (4.2; 436) |
60.1 [ 35.4; 117] (9.8; 711.5) |
<0.001* |
24-h urinary cortisol (nmol/day) |
1,198.4 [ 721; 2,362] (284; 15,196) |
3,460.4 [ 1,597.6; 6,994.2] (640.2; 12,332.25) |
<0.001* |
Morning ACTH (pg/ml) |
63.5 [ 45.5; 82.1] (13.4; 428.3) |
127.2 [ 94.2; 188.3] (39.9; 536.7) |
<0.001* |
Late-night ACTH (pg/ml) |
51.5 [ 33.2; 75.3] (8.1; 208.7) |
108.6 [ 77.9; 173.3] (38; 755.6) |
<0.001* |
Minimum potassium level during the disease period (mmol/L) |
4.2 [ 3.9; 4.4] (1.6; 5.1) |
3.5 [ 2.8; 4.1] (1.6; 4.9) |
<0.001* |
Small-dose (1 mg) dexamethasone test |
|||
Positive test |
1 |
0 |
0.758 |
Negative test |
173 |
48 |
|
High-dose (8 mg) dexamethasone test |
|||
Positive test |
65 |
4 |
<0.001* |
Negative test |
14 |
26 |
|
Adenoma visualisation at brain MRI |
|||
Adenoma visible |
105 |
20 |
0.031* |
Adenoma invisible |
68 |
27 |
|
Maximum size of adenoma on MRI image (mm) |
3.5 [ 0.0; 5.5] (0; 10) |
0.0 [ 0.0; 4.5] (0; 8) |
0.551 |
No MRI performed |
2 |
1 |
- |
Model training
To identify the best algorithm for predicting EAS among patients with ACTH-dependent endogenous hypercortisolism, nested cross-validation with five external and three internal cycles (3×5 Nested Cross-Validation) was used as per the scheme proposed by Isci S. et al. [10]. This approach provides a more accurate estimation of Generalisation error while optimising the hyperparameters and has shown its applicability in the subject domain.
The Boruta algorithm (Kursa and Rudnicki, 2010) was used to select predictors for model training. This algorithm is a wrapper-based marker selection method; it is tuned to find the minimum optimal marker set instead of all possible relevant markers, resulting in an unbiased and consistent selection of important predictors.
Training and performance evaluation of the machine learning algorithms were performed on a training dataset. After selecting the best algorithm, its final evaluation was performed on the test set.
Model performance assessment
Performance results of the models were compared with one another by analysing standard metrics derived from the confusion matrix: 1) Accuracy; 2) ROC-AUC (Area Under Receiver Operating Characteristic Curve; 3) Sensitivity; 4) Specificity; 5) Precision; 6) Recall; 7) F₁-measure.
The main metrics for selecting the final model were: ROC-AUC, Sensitivity, and Specificity.
Statistical analysis
Quantitative indicators were presented as median (Me) and interquartile range [ Q25-Q75], maximum and minimum values; qualitative variables were presented as absolute and relative frequencies. Ratios of qualitative features are presented as fractions (%). Comparison of two independent groups for quantitative data was performed using Student’s t-test for indicators conforming to the law of normal distribution and using Mann-Whitney test for indicators not conforming thereto. Qualitative variables were compared with each other using the chi-square test (χ2) and double-tailed Fisher’s exact test.
Data processing was performed using the IBM SPSS Statistics 23 software package (SPSS. Inc., Chicago, IL, USA).
R 4.2.3. software package was used to build and develop predictive models and to assess their performance.
Ethical expertise
The study protocol was approved by Endocrinology Research Centre’s local ethical committee on 20 February 2013, session record no. 2. Extract no. 12 from the record was made on 29 June 2022. All patients included in the study have given their informed consent to participate therein.
RESULTS
Subjects of the study
The study included 223 patients (163 females, 60 males) with ACTH-dependent endogenous hypercortisolism, of which 175 had Cushing disease and 48 had EAS. General characteristics of patients, main laboratory-determined parameters and results of comparative analysis are presented in Table 1. The patients were divided into two groups according to the final clinical diagnosis.
At the first stage of the study, we analysed possible differences in clinical-demographic and laboratory-instrumental parameters between the comparison groups (Table 1). Patients in both groups were predominantly female; the groups did not differ in age at disease onset, body mass index or the results of low-dose dexamethasone test (LDT). EAS patients had higher levels of all major hormonal parameters (p<0.001) and lower potassium values (p<0.001) in the active stage of the disease. Negative results of highdose dexamethasone test and no visible pituitary adenoma in brain MRI occurred statistically significantly more frequently in the EAS group of patients vs. Cushing disease patients (Table 1). Out of the 223 patients, three patients did not undergo MRI imaging of the pituitary gland because of a pacemaker, claustrophobia, and morbid obesity (BMI at 57.5 kg/m²).
Machine learning model for predicting the probability of EAS in patients with ACTH-dependent endogenous hypercortisolism
Initially, the total group of patients (n=223) was randomly divided into two sets at 80-20 ratio: the training set (n=179) and the test set (n=44) (Figure 1).
As a result of data pre-processing and selection of the most informative markers using random forest method and Boruta algorithm, the following were selected as final variables for EAS classification and prediction: morning ACTH level, potassium level, 24-h urinary cortisol, late-night blood cortisol, late-night salivary cortisol, and the largest size of pituitary adenoma on brain MRI. At the end of the selection procedure, each marker was assigned a value expressing its informativeness (Figure 2). The higher the value, the more valuable the marker is for prediction and classification. As a result, we obtained a list of markers that can be ranked according to their relevance to the classification and prediction (Figure 2).
Figure 2. Predictor ranging by Boruta algorithm.
Variables that turned out to be less relevant were removed from the analysis: age at disease onset, sex, BMI, LDT result, and pituitary adenoma visibility on brain MRI (Figure 2).
Table 2 presents performance parameters of predictive models. These parameters were obtained through machine learning algorithms applied to the training sample (n=179). Comparative analysis of predictive accuracy indicators enabled us to identify certain differences between the models. Thus, the quality of prediction by AUC, sensitivity and specificity metrics for the model based on the binary decision tree (CART) was 0.694; 87.3 and 48.7%, respectively, which indicates that the accuracy of the model when tested on the analysed cohort of patients is insufficient and needs to be improved. In turn, the GLMNET model provided good sensitivity (96.4%) and AUC (0.840) on the training sample yet with the lowest specificity (28.2%). The highest ROC-AUC values were obtained for the Naïve Bayes and the Gradient Boosted Modelling models (AUC of 0.856 and 0.867, respectively), which means they have a very good predictive accuracy.
Table 2. Comparing the models performance in predicting EAS in patients with ACTH-dependent hypercortisolism
Algorithm |
Area under ROC curve (AUC) |
Sensitivity (%) |
Specificity (%) |
Linear Discriminant Analysis |
0.820 |
95.7 |
35.9 |
Logistic Regression |
0.807 |
93.6 |
41.0 |
GLMNET |
0.840 |
96.4 |
28.2 |
SVM Radial |
0.824 |
91.4 |
51.3 |
kNN |
0.783 |
95.7 |
28.2 |
Naïve Bayes |
0.856 |
95.0 |
48.7 |
CART |
0.694 |
87.3 |
48.7 |
C5.0 |
0.818 |
84.9 |
61.5 |
Bagged CART |
0.808 |
85.7 |
56.4 |
Random Forest |
0.841 |
89.3 |
51.3 |
Generalised Boosted Modelling |
0.867 |
90.0 |
56.4 |
The Generalised Boosted Modelling (GBM) demonstrated the best predictive ability of all trained machine learning models in all three final metrics (ROC-AUC, sensitivity, and specificity).
The final model was trained with the following hyperparameters (Table 3).
Table 3. Generalised Boosted Modelling (GBM) final hyperparameters
n.trees |
interaction.depth |
shrinkage |
n.minobsinnode |
50 |
3 |
0.1 |
10 |
The results of testing the accuracy of classification performed by the Generalised Boosted Modelling (GBM) on the test sample (n=44) are presented in Figures 3 and 4 and Table 4. The accuracy of our model of differential diagnosis is as follows: probability of correct EAS prediction (predictive value of a positive result) is 87.5%; probability of correct EAS exclusion (predictive value of a negative result) is 94,4%. This method of differential diagnosis using machine learning algorithm enables EAS predicting with a sensitivity of 77.8% and a specificity of 97.1%.
Figure 3. Generalised Boosted Modelling (GBM) accuracy of prediction on the test sample.
Figure 4. ROC curve for the Generalised Boosted Modelling (GBM) obtained on the test sample.
Table 4. Quality metrics of the Generalised Boosted Modelling (GBM) predictive model obtained after application to the test sample
Model |
Accuracy |
F1 measure |
Sensitivity |
Specificity |
Precision |
Recall |
ROC-AUC |
GBM |
93.2% |
82.3% |
77.8% |
97.1% |
87.5% |
77.8% |
0.920 |
The significance of clinical markers determining prediction accuracy of the Generalised Boosted Modelling (GBM) model is shown in Figure 5. Six factors were found to have a dominant influence on the resulting variable, with the most significant parameters being early morning ACTH level, minimum potassium value during the active stage of the disease, 24-h urinary cortisol and late-night blood cortisol. Late-night salivary cortisol was less influential, and the contribution of the maximum size of pituitary adenoma at brain MRI scan was minimal (Figure 5).
Figure 5. Predictive significance of markers that provide decision support for the Generalised Boosted Modelling (GBM) model.
Table 5. Results of Patient S. examination
Parameter |
Value |
Morning ACTH (pg/ml) |
82.4 |
Late-night blood cortisol (nmol/L) |
653.2 |
Late-night salivary cortisol (nmol/L) |
52.7 |
24-h urinary cortisol (nmol/day) |
2,538.4 |
Potassium (mmol/L) |
3.7 |
Maximum size of pituitary adenoma (mm) |
0 |
Examples of using the developed machine learning model for differential diagnosis of ACTH-dependent endogenous hypercortisolism
Clinical example 1
Patient S., male, 38, was admitted to Neuroendocrinology and Osteopathy Department of Russia’s Endocrinology Research Centre. Results of Patient S. examination in the hospital are presented in Table 5. Brain MRI with intravenous contrasting revealed diffuse heterogeneity of the anterior pituitary structure; no convincing data for pituitary microadenoma were obtained.
Our predictive model determined the probability of EAS in this patient at 11%; i.e., in this case the model enabled us to conclude in favour of Cushing disease in Patient S.
To confirm this conclusion using our machine learning model we compared the results of histological and immunohistochemical examinations of Patient S’s pituitary tumour tissue that had been removed. Histology findings: pituitary adenoma from basophilic cells. Immunohistochemical findings: corticotropinoma (positive ACTH expression). Thus, the conclusion made on the basis of our non-invasive method of differential diagnosis based on machine learning algorithm coincided with the results of pathomorphological examination.
Clinical example 2
Patient O., female, 32, was admitted to Neuroendocrinology and Osteopathy Department of Russia’s Endocrinology Research Centre. Results of Patient O. examination in the hospital are presented in Table 6. Brain MRI with intravenous contrasting revealed a 5.5×4 mm pituitary adenoma.
Table 6. Results of Patient O. examination
Parameter |
Value |
Morning ACTH (pg/ml) |
133.9 |
Late-night blood cortisol (nmol/L) |
809 |
Late-night salivary cortisol (nmol/L) |
47.08 |
24-h urinary cortisol (nmol/day) |
5,542 |
Potassium (mmol/L) |
4.06 |
Maximum size of pituitary adenoma (mm) |
5.5 |
The model determined the probability of EAS in this patient at 62%, which required us to use additional instrumental diagnostic methods to localise the source of ectopic ACTH hyperproduction. Chest MSCT revealed a 7.5×5 mm mass in S4 of the right lung. Somatostatin-receptor scintigraphy combined with 99mTc-tectrotide SPECT/CT revealed a similar mass in S4 of the right lung with evidence of increased 99mTc-tectrotide fixation.
To confirm this conclusion, we compared the results of histological and immunohistochemical examinations of the removed lung tumour tissue. Histological findings: typical lung carcinoid. Immunohistochemical conclusion: focal expression of ACTH in the tumour cells. Thus, our machine learning model correctly classified the type of ACTH-dependent hypercortisolism in this patient, which was confirmed by the results of pathomorphological examination of the removed lung tumour tissue.
DISCUSSION
In this study, we developed Russia’s first machine learning-based model for and proposed a method of non-invasive differential diagnosis of ACTH-dependent forms of endogenous hypercortisolism (Cushing disease and EAS) which enable prediction of EAS probability in a patient with high accuracy (93.2%), sensitivity (77.8%) and specificity (97.1%). The use of our algorithm can significantly reduce the need for invasive differential diagnosis through bilateral selective blood sampling from inferior stony sinuses.
Predictive research in clinical medicine is one of the most promising and rapidly developing areas. Publications in international literature indicate growing interest for the development of predictive models for non-invasive verification of the etiology of endogenous hypercortisolism on the basis of modern machine learning technologies. Currently, active research is being undertaken to improve and increase the models predictive value. For example, machine learning models can effectively predict both early postoperative outcomes in patients with pituitary adenoma, including patients with Cushing disease [11], and long-term remission after removal of pituitary corticotropinoma [12]. However, the smallest amount of research in this area is devoted to differential diagnosis of various forms of hypercortisolism.
Isci S. et al. [10] were the first to propose a method of differential diagnosis of various forms of endogenous hypercortisolism (Cushing disease, Cushing syndrome, subclinical hypercortisolism) using machine learning methods based on data from 241 patients and including 11 predictor variables. Researchers applied various machine learning methods to create predictive models. Their results showed that the Random Forest (RF) algorithm significantly outperformed predictive abilities of other machine learning algorithms. Application of the model enabled classification of hypercortisolism syndrome with an average accuracy of 92%. A binary classification algorithm based on RF ‘one vs all’ demonstrated a sensitivity of 97.6%, accuracy of 91.1%, and specificity of 87.1% in confirming or excluding endogenous hypercortisolism in the test sample. Multiclass classification with RF showed an average sensitivity by class at 91.8%, average specificity at 97.1%, and average accuracy at 92.1% when classifying different forms of endogenous hypercortisolism in the test dataset. However, the algorithm developed by our international colleagues cannot provide differential diagnosis of Cushing disease and EAS the as the selected cohort had no patients with EAS.
The first research into the capacity of machine learning methods in the area of differential diagnosis of ACTH-dependent forms of endogenous hypercortisolism (Cushing disease and EAS) was conducted by Chinese scientists Lyu X. et al. [13][14]. Data from 311 patients, including 47 patients with EAS, were used for modelling. Based on multivariate logistic regression analysis, out of 11 variables, female sex (odds ratio (OR) 3.030), hypokalaemia (OR 0.209), ACTH level (OR 0.988), visibility of pituitary adenoma on MRI (OR 8.671), and positive HDT (OR 2.768) were identified as predictors associated with Cushing disease in patients with ACTH-dependent endogenous hypercortisolism [13]. Out of the eight models created, GBM yielded the highest values of the area under the ROC curve (AUC 0.980±0.02), but the RF model was the best for predicting Cushing disease among all machine learning models. RF algorithm sensitivity was 98.9%; its specificity was 87.9%, and its AUC was 0.976. Application of the predictive model to the training sample demonstrated high sensitivity (98.4%) and specificity (100%) of the algorithm, but in the test dataset, the sensitivity and specificity values were 95% and 71.4%, respectively. According to the results, the three most significant variables were serum potassium level, size of pituitary adenoma on MRI and ACTH values. The RF algorithm-based model had the largest area under the ROC curve (AUC 0.984 (95% CI 0.950-0.993)) compared to the AUC for LDT and HDT (p<0.001), but no statistically significant differences were found when comparing the areas of the selected model and the ‘gold standard’ method, i.e., bilateral selective blood sampling from inferior stony sinuses using a stimulation agent [14].
In contrast to the aforementioned study, in addition to early morning ACTH levels and potassium levels, we included 24-h urinary free cortisol, late-night blood cortisol, late-night salivary cortisol, and the largest size of pituitary adenoma on brain MRI into the set of universal predictors providing high predictive accuracy. EAS is believed to have a more severe course with a rapid increase in the severity of clinical symptoms and higher levels of all major hormonal parameters [15]. Decrease in blood potassium occurs in hypercortisolism of any etiology and is not a specific marker of EAS despite the fact that the frequency of hypokalaemia in ectopic ACTH production syndrome can reach 82.6% against 21% of cases in Cushing disease (p=0.001), according to the results of a comparative study by Attr B. et al. [15]. According to our data, the results of preoperative laboratory examination were statistically significantly different between patients with Cushing disease and EAS (p<0.001), which is consistent with the aforementioned study.
In our study, prediction of EAS probability was based on the analyses of simple laboratory and instrumental parameters available in routine clinical practice. The high quality of predictor variables in our study was ensured by a multistage selection procedure, which included assessment of their informativeness using automatic methods, in particular the Boruta algorithm, as well as selection of clinical features that ensure prediction accuracy. The use of this algorithm enabled ranking individual predictors in terms of their contribution to the predictive potential and determining the variables required for a high-performance model.
The results of our study are consistent with those achieved by international scholars who also find that more sophisticated machine learning methods surpass the capabilities of traditional statistical models, including models based on logistic regression. However, in contrast to previous research, in our study the best quality metrics in predicting EAS probability were observed when using a machine learning method based on gradient boosting (Generalised Boosted Modelling, GBM); therefore, this model seems preferable for use in clinical practice.
Clinical significance of the results
The model for EAS differential diagnosis and probability prediction developed in this study can be used as a piece of software (a calculator) and provides a decision support tool. The calculator may include fields where parameter values are entered (results of laboratory tests: early morning ACTH, late-night blood cortisol, late-night salivary cortisol, 24-h urinary free cortisol, potassium, and the largest size of pituitary adenoma on brain MRI). Based on the model judgement, the physician will be able to personalise the patient management algorithm, select the optimal set of therapeutic measures and, in the event the model yields a positive prediction, select additional diagnosis activities which will ultimately improve the treatment effectiveness, reduce the sick leave duration, eliminate unjustified surgical interventions and reduce the patient disability rate.
Limitations of this study
Limitations of this study include a relatively small sample size, which is largely due to the orphan character of the underlying disease (Cushing disease and EAS). A single-centre study design is also a limitation which may reduce the predictive value of the model in external validation.
Areas of further research
Areas of further research could include the expansion of the training sample by involving data from other medical institutions of the Russian Federation, which will contribute to improving the model accuracy.
CONCLUSION
A machine learning predictive model enables differentiating patients with EAS and Cushing disease based on baseline clinical parameters and can be used as a primary screening of patients with ACTH-dependent endogenous hypercortisolism. The results obtained justify the use of machine learning methods in the area of medical prognosis, mark a step towards the development of personalised treatment methods and can serve as a basis for the implementation of medical decision support systems in patients with ACTH-dependent hypercortisolism.
ADDITIONAL INFORMATION
Source of funding. This study was funded by the Russian Science Foundation (Grant 9-15-00398-П).
Conflict of interest. The authors hereby declare they have had no apparent or potential conflict of interest related to the content of this publication.
References
1. Belaya ZE, Rozhinskaya LY, Dragunova NV, et al. Metabolic complications of endogenous Cushing: patient selection for screening. Obes Metabol. 2013; 10(1): 26–31. (In Russ). doi: https://doi.org/10.14341/2071-8713-5068
2. Golounina OO, Belaya ZE, Rozhinskaya LYa, et al. Clinical and laboratory characteristics and results of treatment of patients with ACTH-producing neuroendocrine tumors of various localization. Therapeutic Archive. 2021; 93(10):1171–1178. (In Russ.). doi: https://doi.org/10.26442/00403660.2021.10.201102
3. Golounina OO, Slashchuk KY, Khairieva AV, et al. X-ray and radionuclide imaging in the diagnosis of ACTH-producing neuroendocrine tumors. Medical radiology and radiation safety. 2022; 67(4): 80–88. (In Russ). doi: https://doi.org/10.33266/1024-6177-2022-67-4-80-88
4. Sitkin II, Belaya ZhE, Rozhinskaya LYa, et al. Simultaneous bilateral inferior petrosal sinus blood sampling after desmopressin stimulation in the differential diagnosis of ACTH-dependent Cushing’s syndrome. Diagnostic and Interventional Radiology 2013; 7(3): 57–68. (In Russ). doi: https://doi.org/10.25512/DIR.2013.07.3.06
5. Belaya ZE., Rozhinskaya LYa., Melnichenko GA., et al. The role of prolactin gradient and normalized ACTH/prolactin ratio in the improvement of sensitivity and specificity of selective blood sampling from inferior petrosal sinuses for differential diagnostics of ACTH-dependent hypercorticism. Probl Endokrinol (Mosk) 2013; 59(4): 3–10. (In Russ). doi: https://doi.org/10.14341/probl20135943-10
6. Belaya ZE, Golounina OO, Sitkin II, et al. Diagnostic value of bilateral inferior petrosal sinus sampling in various modifications and methods of radiation and radionuclide imaging in the diagnosis and differential diagnosis of ACTH-dependent endogenous hypercortisolism. Problems of Endocrinology. 2023;69(6). (In Russ) doi: https://doi.org/10.14341/probl13299
7. Melnichenko GA, Dedov II, Belaya ZE, et al. Cushing’s disease: the clinical features, diagnostics, differential diagnostics, and methods of treatment. Probl Endokrinol (Mosk) 2015; 61(2): 55–77. (In Russ) doi: https://doi.org/10.14341/probl201561255-77
8. Petraikin AV, Belaya ZhE, Kiseleva AN, et al. Artificial intelligence for diagnosis of vertebral compression fractures using a morphometric analysis model, based on convolutional neural networks. Problems of Endocrinology. 2020; 66(5):48–60. (In Russ). doi: https://doi.org/1014341/probl12605
9. Dedov II, Belaya ZE, Sitkin II, et al. Significance of the method of selective blood collection from the inferior petrosal sinuses for differential diagnosis of ACTH-dependent hypercorticism. Probl Endokrinol (Mosk) 2009; 55(6): 35–40. (In Russ). doi: https://doi.org/10.14341/probl200955635-40
10. Isci S, Kalender DSY, Bayraktar F, Yaman A. Machine Learning Models for Classification of Cushing’s Syndrome Using Retrospective Data. IEEE J Biomed Health Inform 2021; 25(8): 3153–3162. doi: https://doi.org/10.1109/JBHI.2021.3054592
11. Hollon TC, Parikh A, Pandian B, et al. A machine learning approach to predict early outcomes after pituitary adenoma surgery. Neurosurgical Focus. 2018; 45(5): E8. doi: https://doi.org/10.3171/2018.8.FOCUS18268
12. Fan Y, Li Y, Bao X, et al. Development of Machine Learning Models for Predicting Postoperative Delayed Remission in Patients With Cushing’s Disease. The Journal of Clinical Endocrinology & Metabolism. 2021; 106(1): 217–231. doi: https://doi.org/10.1210/clinem/dgaa698
13. Lyu X, Zhang D, Pan H, et al. A noninvasive scoring model for the differential diagnosis of ACTH-dependent Cushing’s syndrome: a retrospective analysis of 311 patients based on easy-to-use parameters. Endocrine. 2022; 78(1): 114–122. doi: https://doi.org/10.1007/s12020-022-03081-0
14. Lyu X, Zhang D, Pan H, et al. Machine learning models for differential diagnosis of Cushing’s disease and ectopic ACTH secretion syndrome. Endocrine. 2023; 80(3): 639–646. doi: https://doi.org/10.1007/s12020-023-03341-7
15. Attri B, Goyal A, Kalaivani M, et al. Clinical profile and treatment outcomes of patients with ectopic ACTH syndrome compared to Cushing disease: a single-center experience. Endocrine. 2023; 80(2): 408–418. doi: https://doi.org/10.1007/s12020-022-03298-z
About the Authors
O. O. GolouninaRussian Federation
Olga O. Golounina - MD.
Moscow
Competing Interests:
None
Zh. E. Belaya
Russian Federation
Zhanna E. Belaya - MD, PhD, Professor.
Moscow
Competing Interests:
None
K. A. Voronov
Russian Federation
Kirill A. Voronov.
Yekaterinburg
Competing Interests:
None
A. G. Solodovnikov
Russian Federation
Alexander G. Solodovnikov - MD.
Yekaterinburg
Competing Interests:
None
L. Ya. Rozhinskaya
Russian Federation
Liudmila Ya. Rozhinskaya - MD, PhD, Professor.
Moscow
Competing Interests:
None
G. A. Melnichenko
Russian Federation
Galina A. Melnichenko - MD, PhD, Professor, Academician of the Russian Academy of Sciences.
Moscow
Competing Interests:
None
N. G. Mokrysheva
Russian Federation
Natalia G. Mokrysheva - MD, PhD, Professor.
Moscow
Competing Interests:
None
I. I. Dedov
Russian Federation
Ivan I. Dedov - MD, PhD, Professor, Academician of the Russian Academy of Sciences.
Moscow
Competing Interests:
None
Supplementary files
|
1. Figure 1. Concept for developing predictive models. | |
Subject | ||
Type | Исследовательские инструменты | |
View
(404KB)
|
Indexing metadata ▾ |
|
2. Figure 2. Ranking of predictors using the Boruta algorithm. | |
Subject | ||
Type | Исследовательские инструменты | |
View
(161KB)
|
Indexing metadata ▾ |
|
3. Figure 3. Prediction accuracy of the Generalized Boosted Modeling (GBM) on the test set | |
Subject | ||
Type | Исследовательские инструменты | |
View
(92KB)
|
Indexing metadata ▾ |
|
4. Figure 4. ROC curve for the Generalized Boosted Modeling (GBM) model obtained on the test set | |
Subject | Рисунок 4. ROC-кривая для модели градиентного бустинга (Generalized Boosted Modeling, GBM), полученная на тестовой выборке | |
Type | Исследовательские инструменты | |
View
(116KB)
|
Indexing metadata ▾ |
|
5. Figure 5. Predictive importance of decision features for the Generalized Boosted Modeling (GBM) model | |
Subject | ||
Type | Исследовательские инструменты | |
View
(158KB)
|
Indexing metadata ▾ |
Review
For citations:
Golounina O.O., Belaya Zh.E., Voronov K.A., Solodovnikov A.G., Rozhinskaya L.Ya., Melnichenko G.A., Mokrysheva N.G., Dedov I.I. Machine learning methods in differential diagnosis of ACTH-dependent hypercortisolism. Problems of Endocrinology. 2024;70(1):18-29. https://doi.org/10.14341/probl13342

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License (CC BY-NC-ND 4.0).