Skip to main content

18F-FDG-PET/CT-based machine learning model evaluates indeterminate adrenal nodules in patients with extra-adrenal malignancies

Abstract

Background

To assess the value of an 18F-FDG-positron emission tomography/computed tomography (PET/CT)-based machine learning model for distinguishing between adrenal benign nodules (ABNs) and adrenal metastases (AMs) in patients with indeterminate adrenal nodules and extra-adrenal malignancies.

Methods

A total of 303 patients who underwent 18F-FDG-PET/CT with indeterminate adrenal nodules and extra-adrenal malignancies from March 2015 to June 2021 were included in this retrospective study (training dataset (n = 182): AMs (n = 97), ABNs (n = 85); testing dataset (n = 121): AMs (n = 68), ABNs (n = 55)). The clinical and PET/CT imaging features of the two groups were analyzed. The predictive model and simplified scoring system for distinguishing between AMs and ABNs were built based on clinical and PET/CT risk factors using multivariable logistic regression in the training cohort. The performances of the predictive model and simplified scoring system in both the training and testing cohorts were evaluated by the areas under the receiver operating characteristic curves (AUCs) and calibration curves. The comparison of AUCs was evaluated by the DeLong test.

Results

The predictive model included four risk factors: sex, the ratio of the maximum standardized uptake value (SUVmax) of adrenal lesions to the mean liver standardized uptake value, the value on unenhanced CT (CTU), and the clinical stage of extra-adrenal malignancies. The model achieved an AUC of 0.936 with a specificity, sensitivity and accuracy of 0.918, 0.835, and 0.874 in the training dataset, respectively, while it yielded an AUC of 0.931 with a specificity, sensitivity, and accuracy of 1.00, 0.735, and 0.851 in the testing dataset, respectively. The simplified scoring system had comparable diagnostic value to the predictive model in both the training (AUC 0.938, sensitivity: 0.825, specificity 0.953, accuracy 0.885; P = 0.5733) and testing (AUC 0.931, sensitivity 0.735, specificity 1.000, accuracy 0.851; P = 1.00) datasets.

Conclusions

Our study showed the potential ability of a machine learning model and a simplified scoring system based on clinical and 18F-FDG-PET/CT imaging features to predict AMs in patients with indeterminate adrenal nodules and extra-adrenal malignancies. The simplified scoring system is simple, convenient, and easy to popularize.

Background

The adrenal gland is the fourth most common metastatic site in patients with extra-adrenal cancer, primarily lung cancer (39%) and breast cancer (35%) [1, 2]. Approximately 30% to 70% of adrenal masses incidentally found in cancer patients are metastases [3]. The most common malignant tumour of the adrenal gland is adrenal metastasis (AM) [3]. However, not all adrenal masses can be assumed to represent metastases, and adrenal benign lesions are also not uncommon in cancer patients. Hammarstedt et al. discovered that up to 74% of patients with extra-adrenal malignancy had adrenal benign lesions [4]. Therefore, accurate differential diagnosis of adrenal lesions in cancer patients during follow-up or staging is very important to guide treatment and predict prognosis.

Accurate differential diagnosis of adrenal masses can mostly be performed in patients with extra-adrenal malignancy based on non-invasive imaging techniques, such as magnetic resonance imaging (MRI) and computed tomography (CT). Adrenal washout CT and the value on unenhanced CT images (CTU) have shown perfect ability to differentiate benign from malignant lesions [5,6,7,8,9]. However, the shortcomings of adrenal washout CT, such as a 15-min delayed scanning, lack of sensitivity and additional radiation hazards [10], should not be ignored. In addition, the CTU and adrenal wash-out CT of some benign lesions (hyper-attenuating lesions: CTU ≥ 10 HU) were similar to those of AMs, leading to a misdiagnosis [11, 12]. MRI, especially chemical-shift MRI, has been proven to be the most sensitive examination, but the signal intensity of benign and malignant lesions overlaps considerably [13, 14]. Meanwhile, previous studies have demonstrated that a long diameter (LD) > 3 cm of an adrenal lesion is highly specific for malignancy [15]. Therefore, when patients have unilateral hyper-attenuating (CTU ≥ 10 HU) nodules (LD ≤ 3 cm) based on CT and MRI imaging techniques, it is a challenge to immediately and accurately identify AMs from adrenal benign nodules (ABNs) without additional examinations in cancer patients, and, then, a biopsy may be needed. Although biopsy remains the gold standard for confirmation of the nature of the masses, it is invasive and difficult to perform and thus frequently leads to complications and study failure [16].

Radiomics, as an advanced image analysis technology, has good differential diagnostic ability for adrenal lesions, especially malignant and benign tumors [17, 18]. However, due to the uncertainty of its reliability and the need for computational expertise, radiomics is not widely used in clinical practice. Therefore, it is necessary to explore a non-invasive and simple imaging method for effectively differentiating AMs from ABNs in cancer patients waiting for treatment.

Previous studies have demonstrated that 18F‐FDG‐PET/CT is extremely predictive and sensitive for differentiating adrenal tumours found in routine MRI or CT examinations in patients with or without a cancer history [19,20,21,22]. In addition, PET/CT could evaluate the primary lesions and metastases at the same time, so it may be the first and most cost-effective method to characterize adrenal tumours, especially in cancer patients. However, most of the previous studies only explored the value of individual radiological parameters in the differential diagnosis of adrenal tumors [20,21,22]. In addition, the clinical stages and types of primary cancers have not been comprehensively analyzed. Most importantly, studies on “indeterminate adrenal lesions” based on PET/CT are still rare. Comprehensive differential diagnostic criteria for “indeterminate adrenal lesions” based on machine learning and 18F-FDG-PET/CT is needed. Therefore, the purpose of the present study was to assess the accuracy of the predictive model and simplified scoring system based on 18F-FDG-PET/CT for differentiating AMs from ABNs in cancer patients with indeterminate adrenal nodules.

Methods

Patients

The Tianjin Medical University Cancer Hospital Institutional Ethics Committee approved our retrospective study. Patients from March 2015 to June 2021 who met the following criteria were enrolled: (1) patients with extra-adrenal malignancy confirmed by histopathology before performing 18F-FDG PET/CT examination; (2) complete PET/CT images and clinical information; and (3) indeterminate adrenal lesions: unilateral hyper-attenuating (CTU ≥ 10 HU) adrenal tumors (1 cm ≤ LD ≤ 3 cm). The reasons for the use of 1 cm as the cut-off for LD were as follows: (a) providing sufficient lesion volume for reliable quantitative measurement technology and (b) increasing confidence in the existence of true focal adrenal lesions. Finally, 165 patients met the eligibility criteria for diagnosing metastases if there is histologic confirmation (n = 15), if a new adrenal lesion had developed (n = 88) or if the size had increased or decreased after treatment in a short period of time (n = 62) [23]. A total of 138 patients met the eligibility criteria for diagnosing benign nodules: histopathological assessment (n = 59) or no change in size during at least 1 year of follow-up (n = 79). Because the patients in our study comprised a whole dataset, to test the predictive model, we randomly assigned the dataset to a training cohort (n = 182; 85 ABNs, 97 AMs) and a testing cohort (n = 121; 53 ABNs, 68 AMs) at a ratio of 6:4 (Fig. 1). There was no intersection between the training cohort and testing cohort. Input variables were selected, parameters were adjusted, and the model was fitted on the training cohort. The generalization ability of the model was evaluated on the testing cohort. As the testing cohort is unknown to the model, the accuracy of the evaluation is reliable. Age, sex, and the types and clinical stage of primary cancer were analyzed.

Fig. 1
figure 1

The process of dataset establishment, short time: within 6 months

18F-FDG PET/CT procedure

The patients fasted for approximately 6 h and had a blood glucose level < 11 mmol/L before undergoing the examination. A Discovery Elite PET/CT scanner (GE healthcare) was used to acquire images approximately 60 min after intravenous injection of 18F-FDG (4.2 MBq/kg). Unenhanced CT images were first acquired with 120 kVp, 80 mAs, and a slice thickness of 5 mm from the top of the skull to the middle femur during tidal breathing, and then full-ring dedicated PET images were obtained from the middle thigh to the top of the head during shallow breathing. PET/CT images were reconstructed using the ordered-subset expectation maximization algorithms and CT scans for attenuation correction.

FDG PET/CT image analysis

A radiologist who had 6 years of PET/CT diagnostic experience and did not know the pathological and clinical information interpreted the CT and PET images. LD, short diameter (SD), left or right and CTU of the adrenal nodules were assessed on the image of the maximum axial area of the tumour. The region of interest (ROI) should contain two-thirds of the largest transverse section of the lesions, and adjacent fat should be avoided when manually measuring CTU. In addition, calcification, hemorrhagic components, and cystic degeneration or necrosis were excluded from the ROI measurements. The maximum standardized uptake value (SUVmax) for each adrenal nodule was recorded by manually drawing a circular oval ROI that included the tumour as much as possible on the axial PET image and while paying attention to avoid adjacent FDG-avid structures. Moreover, the average spleen and liver standardized uptake value (SUV) for each patient was recorded by manually drawing oval ROIs including the spleen or the right lobe of the liver as much as possible on the axial PET images. Then, the ratio of the adrenal lesion SUVmax to the average spleen SUV (SUV/spleen) and the ratio of adrenal lesion SUVmax to the average liver SUV (SUV/liver) were calculated (Fig. 2).

Fig. 2
figure 2

The overall workflow of the development and validation of the predictive model. First, the CT, PET, and clinical features were extracted from the training dataset, and then the predictive model was developed based on significant factors by logistic regression. Second, ROC and calibration curves were used to examine the performance of the nomogram both in the training and testing datasets. Third, a simplified scoring system was built based on the regression coefficients acquired from the training dataset for every individual feature in the predictive model and then the performance of this simplified scoring system was evaluated in both the training and testing datasets. Last, the simplified scoring tables were presented in both the training and testing datasets

Predictive modelling and statistical analysis

R (version 4.1.2) was used to perform all statistical analyses. Differences in quantitative characteristics such as SD, LD, age, CTU, SUVmax, SUV/spleen, and SUV/liver were analyzed using the Wilcoxon rank-sum test. For qualitative characteristics such as sex, location, type and clinical stage of primary cancer, the chi-square test was used to analyze whether there was a significant difference between AMs and ABNs. The predictive model was developed based on the above significant factors by logistic regression, which was obtained using the function “lrm” (in the “rms” package). The area under the receiver operating characteristic (ROC) curves (AUCs) of risk factors and the predictive model and simplified scoring system were obtained using the function “reportROC” (in package “reportROC”). The function “nomogram” (in package “rms”) was used to draw the nomograms. The comparison between AUCs was evaluated by the DeLong test in MedCalc. Calibration curves were used to examine the performance characteristics of the nomogram and the simplified scoring system in detail (Fig. 2). values < 0.05 were considered statistically significant.

Results

Patient and clinical characteristics

The common extra-adrenal malignancies in the AM group were comprised of lung (72.73%), liver (5.45%), breast (4.24%), and kidney (4.24%) cancers and lymphoma (2.42%) malignancies. The extra-adrenal malignancies in the ABN group included malignancies in the lung (50.0%), gynaecological malignancies (10.87%) and breast cancers (10.14%), thyroid carcinoma (7.24%), and lymphoma (5.07%) (Table 1).

Table 1 The types of primary cancer in patients with AMs and ABNs

The demographic details of the AM and ABN patients are presented in Table 2. There were no significant differences in any of the variables (sex, SD, LD, age, location, CTU, SUVmax, SUV/liver, SUV/spleen, and the types and clinical stage of primary cancer) (Table 2, P > 0.05) between the training and testing datasets, showing the rationality of randomly grouping the total data. Patients with ABNs had a higher female percentage than patients with AMs in both the training (60.0% vs. 28.9%, p < 0.001) and testing (62.3% vs. 41.2%, p = 0.034) datasets. The AM group had a higher proportion of patients with clinical stage III/IV extra-adrenal cancers, while the ABN group had a lower proportion of those with clinical stage III/IV cancers in both the training (p < 0.001) and testing (p = 0.007) datasets. For the primary cancer types, the AM group had a higher primary lung cancer: other cancer ratio than the ABN group in both the training (2.88 vs. 0.848, p < 0.001) and testing (3.25 vs. 1.208, p = 0.012) datasets.

Table 2 Clinical and PET/CT characteristics of the patients in the training and testing cohorts

Comparison of PET/CT imaging features

The mean CTU of the ABN group was significantly lower than that of the AM group in both the training (25 vs. 37, p < 0.001) and testing (25 vs. 35, p < 0.001) datasets. The mean LDs of the ABN group were 1.5 cm and 1.5 cm in the training and testing datasets, respectively, which were significantly smaller than those of the AM group in both the training (1.7 cm, p < 0.001) and testing (1.8 cm, p = 0.004) datasets. The SUVmax, SUV/liver, and SUV/spleen of AMs were all significantly higher than those of ABNs in both the training and testing datasets (all p < 0.001) (Table 2).

Machine-learning models

Since strong correlations were observed among SUVmax, SUV/liver, and SUV/spleen, only the SUV/liver feature was selected to reduce redundancies (Fig. 3). Six clinical and imaging characteristics showed significant differences between the AM and ABN groups: sex, LD, CTU, SUV/liver, and the types and clinical stage of primary cancer were used to develop a model by multivariate logistic regression. Ultimately, the predictive model revealed that sex, CTU, SUV/liver, and clinical stage of primary cancer were risk factors for AMs (Table 3). Adrenal nodules in male patients with CTU ≥ 32.5 HU, SUV/liver ≥ 1.493, and clinical stage III/IV primary cancers tended to be AMs. The predictive model achieved an AUC of 0.936 [95% CI 0.904–0.969] with a specificity, sensitivity and accuracy of 0.918, 0.835, and 0.874 in the training dataset, respectively, while yielding an AUC of 0.931 [95% CI 0.889–0.973] with a specificity, sensitivity, and accuracy of 1.000, 0.735, and 0.851 in the testing dataset, respectively (Fig. 4A, B). The AUC of the predictive model was significantly higher than that of any individual feature (all P < 0.05, see Fig. 5 and Table 4).

Fig. 3
figure 3

Strong correlations were observed among SUVmax, SUV/liver, and SUV/spleen

Table 3 Individual variables obtained from the ROC analysis for the differentiation of AMs from ABNs
Fig. 4
figure 4

A The AUC of the predictive model was 0.936 [95% CI 0.904–0.969], with a sensitivity, specificity and accuracy of 0.835, 0.918, and 0.874 in the training dataset, respectively. B The AUC of the predictive model was 0.931 [95% CI 0.889–0.973], with a sensitivity, specificity, and accuracy of 0.735, 1.000, and 0.851 in the testing dataset, respectively

Fig. 5
figure 5

The AUC of the predictive model was higher than that of any feature alone

Table 4 The comparison of the AUCs between the predictive model and individual risk factors

The related nomogram revealed that more than 47.615 could be considered metastases, yielding an AUC of 0.936 with a specificity, sensitivity, and accuracy of 91.8%, 83.5%, and 87.4%, respectively (Fig. 6). Good calibrations were shown in both the training and testing datasets (Fig. 7A, B).

Fig. 6
figure 6

Nomogram of the predictive model

Fig. 7
figure 7

Good calibrations of the predictive model were shown in both the training (A) and testing datasets (B)

The regression coefficients acquired from the training dataset for every individual feature in the predictive model were used to build a scoring system to facilitate the use of the model as follows:

$$\mathrm{Scoring system}=0.878\times \mathrm{Gender}+0.070\times \mathrm{CTU}+2.73\times \mathrm{SUV}\text{/liver}+2.71\times \text{Clinical staging}$$

To make it more convenient to use, we simplified the scoring system as follows:

$$\mathrm{Simplified scoring system}=1\times \mathrm{Gender}+0.1\times \mathrm{CTU}+3\times \mathrm{SUV}\text{/liver}+3\times \text{Clinical staging}$$

The simplified regression coefficients did not affect the accuracy of differential diagnosis of the score by ROC analyses (comparations of the AUCs between the original scoring system and the simplified scoring system were not significantly different by the DeLong test in the training dataset (P = 0.573) and testing dataset (P = 1.000), Fig. 8A, B). Good calibrations of the simplified scoring system were displayed in the training dataset and testing dataset (Fig. 9A, B).

Fig. 8
figure 8

The comparison of the AUCs between the predictive model and the simplified scoring system were not significantly different in the training dataset (A) and testing dataset (B)

Fig. 9
figure 9

Good calibrations of the simplified scoring system were shown in both the training (A) and testing datasets (B)

Therefore, the simplified score was obtained simply by adding 1 if the patient was male, 0 if the patient was female, 10% of the CTU and 3 times the SUV/liver and 3 if the clinical stage was III/IV. The simplified score for the patients in the training dataset ranged from 4.11 to 24.54 points. The optimal cut-off value was a score of 10.5 with a specificity and sensitivity of 95.3% and 82.5%, respectively (Table 5). This indicated that 82.5% of the AM scores were ≥ 10.5 and that 17.5% of AMs would be missed at this cut-off value. Lower cut-off values may be used to reduce the proportion of missed diagnoses. The rate of missed diagnosis could be as low as 2.1% at the cost of a low precision (69.3%) when the cut-off was set at 7.5. Overall, the higher the score obtained by this simplified scoring system, the greater the risk of the lesion being predicted as metastasis. The simplified scores for patients in the testing dataset were shown in Table 6.

Table 5 Cut-off values and corresponding performance data for the simplified scoring system in the training dataset
Table 6 Cut-off values and corresponding performance data for the simplified scoring system in the testing dataset

Next, patients with histologic confirmation (AMs 15, ABNs 59), as a new dataset, were further analyzed. The simplified scores for the patients with histologic confirmation ranged from 3.63 to 21.53 points. When using the optimal cut-off value of 10.5, the specificity and sensitivity of the simplified scoring model were 96.6% and 86.7%, respectively, and accuracy was up to 94.6%, which indicates that compared with the original training and testing datasets, the simplified scoring system still has good diagnostic performance in the confirmed dataset (Table 7).

Table 7 Cut-off values and corresponding performance data for the simplified scoring system in the confirmed dataset

Discussion

For patients with extra-adrenal malignancies, AMs and ABNs are both common tumors. If adrenal lesions are found during cancer follow‐up or staging in cancer patients, an accurate diagnosis is extremely important for planning the treatment. Recently, a few studies on adrenal masses have shown the potential of textural features and radiomics for differentiating AMs from ABNs [17, 18]. However, radiomics has not been extensively applied in clinical practice owing to time-consuming computation and analysis for high-dimensional characteristics. Therefore, the work-up of adrenal lesions, especially indeterminate adrenal nodules, mainly depends on the traditional imaging parameters estimated by the human eye [24,25,26,27]. In this study, our predictive model developed with the clinical manifestations (clinical stage of primary malignancies and sex) and traditional PET/CT imaging features (CTU and SUV/liver of adrenal nodules) had robust performance in effectively distinguishing the two groups. Meanwhile, the simplified scoring system had comparable diagnostic value to the predictive model (or nomogram) and great foregrounds in clinical practice due to its simplicity and convenience.

Sex and clinical stage of primary malignancies were independent clinical factors for differentiating AMs from ABNs in our study. AMs were more likely to occur in men in our study, which was consistent with the results of Chen et al. [28], who found that the proportion of female patients with AMs was significantly lower than that with ABNs. There may be two primary reasons for the difference in the sex ratio between AMs and ABNs. First, lung cancer, accounting for the highest ratio of AMs, is most likely to occur in men [29]. Second, the incidence of gynecological and breast and thyroid cancers accounting for the highest ratio of ABNs was notably high among women [30, 31]. Regarding the clinical stage of primary cancers, ABNs were more likely to be in the early stage (I/II), while AMs were more likely to be in the late period (III/IV), which was in accordance with previous reports [32, 33].

In our study, the SUVmax, SUV/liver, and SUV/spleen were all higher for AMs than for ABNs in both the training and testing datasets (P were all < 0.001). Since strong correlations were observed among SUVmax, SUV/liver, and SUV/spleen, we chose SUV/liver as a risk factor for reducing redundancy according to a previous study [34]. The AUC, sensitivity, and specificity of SUV/liver in the differentiation of AMs from ABNs in this study were 0.851, 83.0%, and 72.2%, respectively, which were all lower than those in previous studies. Watanabe et al. found that the AUC of SUV/liver was 0.99 with a sensitivity of 96% and a specificity of 100% in the differential diagnosis of adrenal metastases and adrenal adenomas [34]. Boland et al. reported that SUV/liver had 100% specificity and sensitivity for differentiating adrenal lesions as malignant or benign in cancer patients based on PET images [35]. In addition, the best cut-off value of SUV/liver was 1.493 in this study, which was higher than that of Watanabe et al. (1.37) [34] and Launay et al. (1.33) [20]. The reason for these may be due to our special research object: indeterminate adrenal nodules, and differential diagnosis of indeterminate adrenal nodules has always been a difficult point in daily imaging and clinical practice. At present, there are few studies based on PET/CT about this special research object. In our study, the CTU of AMs was significantly higher than that of ABNs based on PET/CT, which was consistent with a previous study [20, 21]. Kunikowska et al. [21] showed that malignant adrenal tumours had a significantly higher mean CT attenuation value than benign tumours on PET/CT. Moreover, a CTU > 32.5 HU was an independent predictor of AMs with a sensitivity of 78.8% and a specificity of 74.2% in our study.

Most previous reports only focused on individual PET/CT imaging features to distinguish AMs and ABNs [20,21,22]. Our study was unique because of the comprehensive analysis of traditional clinical and PET/CT imaging features focused on indeterminate adrenal nodules. We found that the AUC for the predictive model combining the clinical stage of extra-adrenal cancers, sex, CTU, and SUV/liver of adrenal nodules reached 0.936, with a specificity, sensitivity, and accuracy of 91.8%, 83.5%, and 87.4% in the training dataset, respectively, while yielding an AUC of 0.931 and a sensitivity, specificity, and accuracy of 73.5%, 100%, and 85.1% in the testing dataset, respectively. Although these traditional clinical and PET/CT imaging features have relatively lower diagnostic accuracy and are not specific for metastases when used alone, the ability of differential diagnosis was significantly improved when these characteristics were combined in our study. Meanwhile, the simplified scoring system showed a good ability to differentiate AMs from ABNs with AUCs of 0.938 and 0.931, respectively, in the training and testing datasets, which had comparable diagnostic value to the predictive model (or nomogram), and the comparison of AUCs between the predictive model (or nomogram) and the simplified scoring system was not significantly different in either the training or testing datasets. When using the nomogram to predict the risk of metastases, we needed to acquire the points of each feature using a naked eye comparison, which would easily lead to an inaccurate total score and an inaccurate ultimate predictive percentage. We could directly gain the score of every feature without visual comparison in the simplified scoring system. Hence, the simplified scoring system has great foregrounds in clinical practice due to its simplicity and convenience.

We believe this may be a valuable predictive model for diagnosing AMs. Therefore, biopsy could be avoided in some cases.

But there is a crucial point to note when using this model. In clinical practice, our proposed model may provide management recommendations for patients with indeterminate adrenal nodules and extra-adrenal malignancies. Generally, a higher score obtained from this simplified scoring system suggests an increased likelihood of the lesion being predicted as a metastasis. However, increasing the cut-off value reduces sensitivity, potentially leading to fail to detect metastases. This may result in inaccurate clinical staging of the primary cancer and impact the formulation of precise treatment plans. On the other hand, lowering the cut-off point reduces the rate of missed diagnoses but lowers specificity, which increases the risk of falsely detecting ABN as AM. This could result in unnecessary examinations and psychological pressure for patients. Therefore, we recommend engaging in careful communication with patients before establishing treatment protocols, as the choice of the cut-off value should depend on the level of risk both patients and doctors are willing to accept. Ultimately, the simplified scoring system serves as a convenient and valuable tool, providing evidence for doctors and patients.

There were several limitations. Firstly, features such as CTU and SUVmax may be different among radiologists since they were manually extracted. Experienced radiologists may have more accurate measurement results than junior radiologists. Measuring such features through computer algorithms may improve the stability and reproducibility of models and features. Secondly, since only a small number of patients underwent pathological confirmation, the majority of adrenal nodules in our study were diagnosed based on follow-up imaging. However, this situation reflects current practices. Meanwhile, based on the confirmed dataset, we further validated the good diagnostic performance of our predictive model. Thirdly, in our study, thick-slice PET and CT images were utilized, and contrast-enhanced CT was not included as part of the PET/CT scanning procedure. It is important to note that previous studies have demonstrated a notable improvement in diagnostic accuracy when PET data is combined with contrast-enhanced CT data [36]. Therefore, conducting a subsequent study incorporating thin-slice enhanced CT scans may potentially further enhance the overall performance of this model. Fourthly, this study was retrospective and single-center with a relatively small sample size, which only included the Asian population, and the coefficients may only apply to one PET/CT system with one acquisition method and one reconstruction method. Further multi-center studies with multiple races and large sample sizes and different PET/CT systems may improve the performance of this predictive model to a certain extent.

Conclusions

In summary, the predictive model and simplified scoring system based on traditional clinical and PET/CT imaging features showed good diagnostic performance for differentiating AMs from ABNs in cancer patients with indeterminate adrenal nodules and thus assisted clinicians in pre-treatment decision-making.

Availability of data and materials

The data sets analyzed during the current study are not publicly available for patient privacy purposes but are available from the corresponding author on reasonable request.

Abbreviations

PET/CT:

Positron emission tomography/computed tomography

AMs:

Adrenal metastases

ABNs:

Adrenal benign nodules

SUVmax:

The maximum standardized uptake value

SUV:

Standardized uptake value

AUC:

Area under the receiver operating characteristic curve

LD:

Long diameter

SD:

Short diameter

CI:

Confidence interval

ROI:

Region of interest

HU:

Hounsfield units

CTU:

The value on unenhanced CT

ROC:

Receiver operating characteristic curve

MRI:

Magnetic resonance imaging

SUV/liver:

The ratio of the adrenal SUVmax to the mean liver SUV

SUV/spleen:

The ratio of the adrenal SUVmax to the mean spleen SUV

References

  1. Almeida MQ, Bezerra-Neto JE, Mendonça BB, Latronico AC, Fragoso MCBV. Primary malignant tumors of the adrenal glands. Clinics (Sao Paulo) .2018;https://doi.org/10.6061/clinics/2018/e756s.

  2. Klikovits T, Lohinai Z, Fábián K, Gyulai M, Szilasi M, Varga J, et al. New insights into the impact of primary lung adenocarcinoma location on metastatic sites and sequence: a multicenter cohort study. Lung Cancer. 2018. https://doi.org/10.1016/j.lungcan.2018.11.004.

    Article  PubMed  Google Scholar 

  3. Cingam SR, Mukkamalla SKR, Karanchi H. Adrenal metastasis. Treasure Island (FL): StatPearls Publishing; 2022.

    Google Scholar 

  4. Hammarstedt L, Muth A, Sigurjónsdóttir H, Almqvist E, Wangberg B, Hellstrom M. Adrenal lesions in patients with extra-adrenal malignancy - benign or malignant? Acta Oncol (Stockholm Sweden). 2012. https://doi.org/10.3109/0284186X.2011.608084.

    Article  Google Scholar 

  5. Szász P, Kučera P, Čtvrtlík F, Langová K, Hartmann I, Tüdös Z. Diagnostic value of unenhanced CT attenuation and CT histogram analysis in differential diagnosis of adrenal tumors. Medicina (Kaunas).2020; https://doi.org/10.3390/medicina56110597.

  6. Elsayes KM, Emad-Eldin S, Morani AC, Jensen CT. Practical approach to adrenal imaging. Radiol Clin North Am. 2017. https://doi.org/10.1016/j.rcl.2016.10.005.

    Article  PubMed  Google Scholar 

  7. Caoili EM, Korobkin M, Francis IR, Cohan RH, Platt JF, Dunnick NR, et al. Adrenal masses: characterization with combined unenhanced and delayed enhanced CT. Radiology. 2002. https://doi.org/10.1148/radiol.2223010766.

    Article  PubMed  Google Scholar 

  8. Korobkin M, Brodeur FJ, Francis IR, Quint LE, Dunnick NR, Londy F. CT timeattenuation washout curves of adrenal adenomas and nonadenomas. AJR Am J Roentgenol. 1998. https://doi.org/10.2214/ajr.170.3.9490968.

    Article  PubMed  Google Scholar 

  9. Peña CS, Boland GW, Hahn PF, Lee MJ, Mueller PR. Characterization of indeterminate (lipid-poor) adrenal masses: use of washout characteristics at contrast-enhanced CT. Radiology. 2000. https://doi.org/10.1148/radiology.217.3.r00dc29798.

    Article  PubMed  Google Scholar 

  10. Mayo-Smith WW, Song JH, Boland GL, Francis IR, Israel GM, Mazzaglia PJ, et al. Management of incidental adrenal masses: a white paper of ACR Incidental Findings Committee. J Am Coll Radiol. 2017. https://doi.org/10.1016/j.jacr.2017.05.001.

    Article  PubMed  Google Scholar 

  11. Choi YA, Kim CK, Park BK, Kim B, et al. Evaluation of adrenal metastases from renal cell carcinoma and hepatocellular carcinoma:use of delayed contrast-enhanced CT. Radiology. 2013. https://doi.org/10.1148/radiol.12120110.

    Article  PubMed  Google Scholar 

  12. Schloetelburg W, Ebert I, Petritsch B, Weng AM, Dischinger U, Kircher S, et al. Adrenal wash-out CT: moderate diagnostic value in distinguishing benign from malignant adrenal masses. Eur J Endocrinol. 2021. https://doi.org/10.1530/EJE-21-0650.

    Article  PubMed  PubMed Central  Google Scholar 

  13. Koo HJ, Choi HJ, Kim HJ, Kim SO, Cho KS. The value of 15-minute delayed contrast enhanced CT to differentiate hyper attenuating adrenal masses compared with chemical shift MR imaging. Eur Radiol. 2014. https://doi.org/10.1007/s00330-013-3084-7.

    Article  PubMed  Google Scholar 

  14. Schieda N, Krishna S, McInnes MDF, Moosavi B, Alrashed A, Moreland R, et al. Utility of MRI to differentiate clear cell renal cell carcinoma adrenal metastases from adrenal adenomas AJR Am J Roentgenol. 2017; https://doi.org/10.2214/AJR.16.17649. Epub 2017 Jul 25.

  15. Chong S, Lee KS, Kim HY, Kim YK, Kim BT, Chung MJ, et al. Integrated PET-CT for the characterization of adrenal gland lesions in cancer patients: diagnostic efficacy and interpretation pitfalls. Radiographics. 2006. https://doi.org/10.1148/rg.266065057.

    Article  PubMed  Google Scholar 

  16. Kassirer JP. Our stubborn quest for diagnostic certainty. A cause of excessive testing. N Engl J Med. 1989; https://doi.org/10.1056/NEJM198911023211820.

  17. Torresan F, Crimi F, Ceccato F, Zavan F, Barbot M, Lacognata C, et al. Radiomics: a new tool to differentiate adrenocortical adenoma from carcinoma. BJS Open. 2021. https://doi.org/10.1093/bjsopen/zraa061.

    Article  PubMed  PubMed Central  Google Scholar 

  18. Cao LX, Xu WG. Radiomics approach based on biphasic CT images well differentiate "early stage" of adrenal metastases from lipid-poor adenomas: a STARD compliant article. Medicine (Baltimore).2022;  https://doi.org/10.1097/MD.0000000000030856.

  19. Kim JY, Kim SH, Lee HJ, Kim MJ, Kim YH, Cho SH, et al. Utilisation of combined 18F-FDG PET/CT scan for differential diagnosis between benign and malignant adrenal enlargement. Br J Radiol. 2013. https://doi.org/10.1259/bjr.20130190.

    Article  PubMed  PubMed Central  Google Scholar 

  20. Launay N, Silvera S, Tenenbaum F, Groussin L, Tissier F, Audureau E, et al. Value of 18-F-FDG PET/CT and CT in the diagnosis of indeterminate adrenal masses. Int J Endocrinol. 2015. https://doi.org/10.1155/2015/213875.

    Article  PubMed  PubMed Central  Google Scholar 

  21. Kunikowska J, Matyskiel R, Toutounchi S, Grabowska-Derlatka L, Koperski L, Królicki L. What parameters from 18F-FDG PET/CT are useful in evaluation of adrenal lesions? Eur J Nucl Med Mol Imaging. 2014. https://doi.org/10.1007/s00259-014-2844-1.

    Article  PubMed  PubMed Central  Google Scholar 

  22. Vos E, Grewal RK, Russo AE, Reidy-Lagune D, Untch BR, Gavane SC, et al. Predicting malignancy in patients with adrenal tumors using 18F-FDG PET/CT SUVmax. J Surg Oncol. 2020. https://doi.org/10.1002/jso.26203.

    Article  PubMed  PubMed Central  Google Scholar 

  23. Nishino M, Jagannathan JP, Ramaiya NH, Abbeele ADV. Revised RECIST guideline version 1.1: What oncologists want to know and what radiologists need to know. AJR Am J Roentgenol. 2010;  https://doi.org/10.2214/AJR.09.4110.

  24. Kang S, Oh YL, Park SY. Distinguishing pheochromocytoma from adrenal adenoma by using modified computed tomography criteria. Abdom Radiol(NY). 2021;https://doi.org/10.1007/s00261-020-02764-4.

  25. Akbulut S, Erten O, Kahramangil B, Gokceimam M, Kim YS, Li P, et al. A critical analysis of computed tomography washout in lipid-poor adrenal incidentalomas. Ann Surg Oncol. 2021. https://doi.org/10.1245/s10434-020-09329-1.

    Article  PubMed  Google Scholar 

  26. Schieda N, Siegelman ES. Update on CT and MRI of adrenal nodules. AJR Am J Roentgenol. 2017. https://doi.org/10.2214/AJR.16.17758.

    Article  PubMed  Google Scholar 

  27. Liu HP, Guan X, Xu BB, Zeng FY, Chen CY, Yin HL, et al. Computed tomography-based machine learning differentiates adrenal pheochromocytoma from lipid-poor adenoma. Front Endocrinol (Lausanne). 2022. https://doi.org/10.3389/fendo.2022.833413.

    Article  PubMed  PubMed Central  Google Scholar 

  28. Chen JC, He YD, Zeng XW, Zhu SX, Li FY. Distinguishing between metastatic and benign adrenal masses in patients with extra-adrenal malignancies. Front Endocrinol (Lausanne).2022; https://doi.org/10.3389/fendo.2022.978730.

  29. Groot PD, Munden RF. Lung cancer epidemiology, risk factors, and prevention. Radiol Clin North Am. 2012. https://doi.org/10.1016/j.rcl.2012.06.006.

    Article  PubMed  Google Scholar 

  30. Siegel RL, Miller KD, Fuchs HE, Jemal A. Cancer statistics, 2021. CA Cancer J Clin. 2021. https://doi.org/10.3322/caac.21654.

    Article  PubMed  Google Scholar 

  31. Sherlock M, Scarsbrook A, Abbas A, Fraser S, Limumpornpetch P, Dineen R, et al. Adrenal incidentaloma. Endocr Rev. 2020. https://doi.org/10.1210/endrev/bnaa008.

    Article  PubMed  PubMed Central  Google Scholar 

  32. Sasaguri K, Takahashi N, Takeuchi M, Carter RE, Leibovich BC, Kawashima A. Differentiation of benign from metastatic adrenal masses in patients with renal cell carcinoma on contrast-enhanced CT.AJR Am J Roentgenol. 2016;https://doi.org/10.2214/AJR.16.16193.

  33. Van den Broek J, Geenen R, Heijnen L, Kobus C, Schreurs H. Adrenal incidentalomas during diagnostic work-up of colorectal cancer patients: what is the risk of metastases? Ann Surg Oncol. 2018. https://doi.org/10.1245/s10434-018-6501-y.

    Article  PubMed  Google Scholar 

  34. Watanabe H, Kanematsu M, Goshima S, Kondo H, Kawada H, Noda Y, et al. Adrenal-to-liver SUV ratio is the best parameter for differentiation of adrenal metastases from adenomas using 18F-FDG PET/CT. Ann Nucl Med. 2013. https://doi.org/10.1007/s12149-013-0730-8.

    Article  PubMed  PubMed Central  Google Scholar 

  35. Boland GW, Goldberg MA, Lee MJ, Mayo-Smith WW, Dixon J, McNicholas MM, et al. Indeterminate adrenal mass in patients with cancer: evaluation at PET with 2-[F-18]-fluoro-2-deoxy-D-glucose. Radiology. 1995. https://doi.org/10.1148/radiology.194.1.7997539.

    Article  PubMed  Google Scholar 

  36. Jana S, Zhang T, Milstein DM, Isasi CR, Blaufox MD. FDG-PET and CT characterization of adrenal lesions in cancer patients. Eur J Nucl Med Mol Imaging. 2006. https://doi.org/10.1007/s00259-005-1915-8.

    Article  PubMed  Google Scholar 

Download references

Acknowledgements

The authors would like to thank Yongliang Liu and Wengui Xu for the study design.

Funding

There was no funding.

Author information

Authors and Affiliations

Authors

Contributions

YL, WX, and LC together designed the study. LC, DZ and HY collected the patient images and performed the statistical analysis and wrote the manuscript.WX critically reviewed the manuscript. All authors contributed to the article and approved the submitted version.

Corresponding authors

Correspondence to Wengui Xu or Yongliang Liu.

Ethics declarations

Ethics approval and consent to participate

The studies involving human participants were reviewed and approved by Tianjin Medical University Cancer Hospital Institutional Ethics Committee. Written informed consent was waived by Tianjin Medical University Cancer Hospital Institutional Ethics Committee.

Consent for publication

Not applicable.

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Cao, L., Zhang, D., Yang, H. et al. 18F-FDG-PET/CT-based machine learning model evaluates indeterminate adrenal nodules in patients with extra-adrenal malignancies. World J Surg Onc 21, 305 (2023). https://doi.org/10.1186/s12957-023-03184-6

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1186/s12957-023-03184-6

Keywords