CILP2 overexpression correlates with tumor progression and predicts poor outcome in patients with colorectal cancer


 Background: Genetic alterations play an important role in the progression of colorectal cancer (CRC). Identifying new biomarkers to assess the prognosis of patients with colorectal cancer is critical. Cartilage Intermediate Layer Protein 2 (CILP2) gene, screened from the TCGA database by bioinformatics, may be closely related to the progression of colorectal cancer. CILP2 was barely reported with clinical features of tumors.Materials and methods: Clinical information and RNA-seq data were derived from the TCGA colorectal carcinoma cohort. CILP2 expression at mRNA level was estimated by bioinformatical analysis of TCGA cases. Tissue microarray (TMA) was constructed containing paraffin-embedded 64 pairs of CRC and matched adjacent normal tissues. The expression at the protein level was detected in 64 pairs of CRC and matched adjacent normal tissues by immunohistochemical analysis. CILP2 expression level and its clinical value were estimated by bioinformatical analysis with linear and logistic regression. Survival analysis was performed between high and low groups of CILP2 expression by Cox regression analysis, and the P-value was calculated by the log-rank test. Kaplan-Meier curves were tested by the log-rank test. Results: CILP2 was statistically significant higher expressed in the colorectal cancer tissues when compared with paired adjacent normal tissues in the TCGA cohort (P<0.001) and in the TMA cohort (P=0.001). Also, CILP2 high-expression was strongly correlated with T3/4 stage (P=0.001), N1/2/3 stage (P=0.005), M1 stage (P=0.048), and higher clinical stage (UICC 2010 stage) (P<0.001) in TCGA cohort, and also positively associated with T3/4 stage (P=0.022) and higher clinical stage (UICC 2010 stage) (P=0.03) in TMA cohort. Furthermore, CILP2 overexpression predicted poor prognosis and could be as an independent prognostic factor (P=0.003).Conclusion: We revealed that CILP2 is associated with advanced stages and could play a role as an independent predictor of poor survival in colorectal cancer.


Introduction
Colorectal cancer (CRC) is one of the most common cancers that ranks second in cancer-associated mortality among the world, with increasing morbidity in recent years. It was estimated that in 2018, more than 1.8 million new colorectal cancer cases occurred with 881,000 deaths [1] . Colorectal cancer is caused by a variety of factors and is involved successive accumulation of genetic and epigenetic alternations [2] .
Surgical resection is the mainstay for treatment of CRC patients, but tumor recurrence is common. A large cohort study indicated that the median survival time was 13.3 months before recurrence [3] . Numerous works have been done to reveal the underlying mechanisms of CRC, and encouraging progresses have been made [4][5][6][7] . However, further investigating works are still needed to deeply understand the molecular mechanisms, and molecular biomarkers for both early detection and prognosis are to be developed for better therapeutic uses in CRC patients.
Cartilage Intermediate Layer Protein 2 (CILP2) is a secreted glycoprotein that has been rst isolated from human articular cartilage. The CILP gene located on chromosome 19p13. It was reported that through genome-wide association studies (GWAS), the Neurocan-cartilage intermediate layer protein 2-pre-B-cell leukemia homeobox 4 (NCAN-CILP2-PBX4) region, an intergenic region spanning 300 kb, is associated with concentrations of low-density lipoprotein cholesterol and triglycerides in sera [8] . Eleven genes and one miRNA are encoded in this region. This region has been shown consistent and deep association with serum lipid levels in subsequent studies for individuals of European and Chinese descents [9][10][11] . In addition to plasma lipid levels, the genome region around CILP2 was identi ed as a non-alcoholic fatty liver disease (NAFLD)-associated locus by GWAS in individuals of European descent [12] , but not in Japanese individuals [13] . A signi cant association was also highlighted between polymorphisms in the CILP gene and osteoarthritis progression [14] . But to our best knowledge, few reports have described the relationship between CILP2 and cancers, except one that reported an expression quantitative trait locus, namely rs8103992, was signi cantly associated with osteosarcoma risk [15] .
Tumors have been considered as high demands of energy and abnormal anabolism for their rapid growth, in which lipid metabolism plays a key role in tumorigenesis. But subtle mechanisms underlying over-reacted lipid metabolism remain poorly understood. CILP2 was barely reported with clinical features of tumors. In this study, we evaluated CILP2 expression and its correlations with clinicopathological characteristics, such as tumor stages, and overall survival of CRC patients in the Cancer Genome Altas (TCGA), and furtherly veri ed using immunohistochemistry assay within human CRC tissues, which may provide a new potential molecular marker for prognostic use of CRC patients.

Materials And Methods
TCGA Data mining and gene expression datasets.
The CRC cohort in TCGA was downloaded and level 3 RNA-seq V2 datasets was used, which was based on Illumina HiSeq 2000. Matched clinical data from CRC patients were also downloaded (https://portal.gdc.cancer.gov/). In the cohort, 621 CRC patients were included, and 609 among them had intact survival data recorded. So, 609 patients were included in the survival analysis in the study. For each gene, the transcript with highest expression was selected for the following process. Meanwhile, the data of one gene was considered invalid when raw counts of the gene in all samples were less than 50. All ltered genes expressions had been processed and been normalized by Trimmed Mean of M-values analysis.
Tissue microarray construction.
Tissue microarray construction was carried out as described previously [16] . Brie y, 64 pairs of CRC and matched adjacent normal tissues were obtained from patients undergoing CRC surgery between January 2016 and October 2019 at the Department of Gastrointestinal Tumor Surgery, Fujian Cancer Hospital.
Tissue microarray recipient block was constructed containing para n-embedded 64 pairs of CRC and matched adjacent normal tissues previously xed in 10% formaldehyde. The most representative tumor or normal areas were carefully selected and marked based on the matched haematoxylin-eosin-stained slides. Altogether, 128 cores (diameter 1.8 mm) of test tissue were taken from the donor blocks with the tissue microarrayer (Beecher Instruments, Silver Spring, MD, USA) and inserted into the recipient block. Immunohistochemistry analysis.
Immunohistochemistry was carried out as described previously [16] . Brie y, unstained 4 mm sections were cut from the tissue microarray recipient block and depara nized in xylene, and the slides were bathed in 0.01 mol/l sodium citrate and heated in a microwave oven for 12 min. The sections were incubated with anti-CILP2 antibody (Santa Cruz, CA, USA) and kept at 4˚C overnight. Negative control slides were treated with only non-immunized mouse immunoglobulin fraction under equivalent conditions. For the secondary developing reagents, a labeled streptavidin-biotin kit (Dako, CA, USA) was used. Slides were developed with diaminobenzaminidine and counterstained with hematoxylin.

Evaluation of immunostaining results
Immunohistochemistry staining was scored as described previously blindly by two independent pathologists without knowledge of the patient's clinicopathology and clinical outcome [17] . Positive cases were de ned by the presence of intracellular staining with red/brown color in epithelial cells. The expression level of CILP2 was evaluated semi-quantitatively according to the proportion of positively stained tumour cells for CILP2 and the intensity of the staining. The immunoscores ranged between 0 and 3 as follows: I) 0, no recognizable staining, referred to as negative (-); II) 1, slight staining, referred to as weak positive (+); III) 2, moderate staining, referred to as moderate positive (++); and IV) 3, distinct staining, referred to as strong positive (+++). Positive expression of CILP2 protein was de ned as moderate positive staining (++) and strong positive staining (+++) for CILP2, whereas Negative expression of CILP2 protein was de ned as negative () and weak positive (+) staining.
Statistical Analysis. Statistical analyses were performed using spss 22.0 software. CILP2 gene expression in different groups (divided by each parameter) was compared using Mann-Whiteney U test. Correlation between CILP2 gene expression and different TNM stages were analyzed by Spearman's test, and Spearman rank correlation coe cient (r s ) was used to evaluate the strength of association. CILP2 gene expression in different groups was analyzed by one-way ANOVA followed by Welch's t-test. CILP2 protein expression in different groups was analyzed by Fisher's exact test. Survival analysis was performed between high and low groups of CILP2 expression (de ned by median value of CILP2 expression) by Cox regression analysis, and P value was calculated by log-rank test. Kaplan-Meier curves were tested by log-rank test.

Results
CILP2 was overexpressed in Colorectal cancer.
Aiming at searching for potential novel prognostic markers of CRC, we rstly analyze expression data of TCGA CRC cohort from Illumina HiSeq 2000 platform, which contains 621 samples and correlating clinical and demographic information. We found that CILP2 was strongly correlated with clinical features of CRC samples in TCGA cohort, and CILP2 has not been reported in CRC before. So we focused on CILP2 and furtherly analyzed the association between CILP2 expression and CRC prognosis. To determine the role of CILP2 in colorectal cancer, we rst analyzed CILP2 gene expression in 50 patients samples with paired adjacent normal tissues in TCGA cohort, and the results suggested that CILP2 gene was overexpressed signi cantly in tumor samples compared to paired adjacent normal tissues ( Fig. 1A-1B, Fold change = 3.412, P < 0.001, Table 1). Additionally, CILP2 gene expression was upregulated in total amount of colorectal cancer samples compared with adjacent normal tissue samples in TCGA cohort Correlations between CILP2 expression and clinicopathological parameters in Colorectal cancer.
High CILP2 expression is associated with poor outcome of colorectal cancer patients.
Kaplan-Meier analysis was performed to investigate relationship between CILP2 expression and overall survival (OS) in TCGA cohort. There were 609 CRC samples available of prognostic information. Median expression of CILP2 of all CRC samples was chosen as a cutoff to divide CRC samples into CILP2-high (n = 305) group and CILP2-low (n = 304) group. As shown in Fig. 3, Table 5, CRC patients with high CILP2 expression exhibited a poorer OS rate compared with the lowexpression group (P = 0.003). Moreover, the univariate Cox regression analysis indicated that high CILP2 expression was strongly associated with a poor prognosis(P = 0.003). Other clinical variables, such as age (P < 0.0001), T stage (P = 0.005), N stage (P < 0.0001), M stage (P < 0.0001), and clinical stage (UICC 2010 stage) (P < 0.0001) were all associated with OS (Table 6). Moreover, the multivariate analysis revealed that high CILP2 expression (P = 0.034), age (P < 0.0001), M stage (P < 0.0001) and clinical stage (UICC 2010 stage) (P = 0.017) were independently associated with a poor prognosis (Table 6). These results suggested that CILP2 could be used as an independent prognostic predictor for colorectal cancer patients in the dataset.

Discussion
The incidence of colorectal cancer has risen sharply in recent years [1] , with limited diagnostic and prognostic tools for early detection and patients' survival prediction. There are many researches focusing on the issue, and numerous advances have been achieved to reveal the underlying mechanisms of cancer development [4][5][6][7] . For example, lots of studies have shown that microsatellite instability (MSI) in genome could act as an exclusive prognostic marker in the early stages of CRC [4,18] . Another useful tool, Septin9 hypermethylation detection in blood samples has received researchers' attention and was the rstapproved serum test for CRC screening by FDA. But further estimation on Septin9 serum assay for CRC screening turned out that it was weakly recommended because of low sensitivity for cancer, and inability to detect advanced adenomas [19] . Extensive works are still needed to provide new insights into the tumor. CILP2 (Cartilage Intermediate Layer Protein 2) protein is a noncollagenous protein in human articular cartilage. In last few years, correlations between CILP2 and plasma lipid concentration in different populations have been studied in some GWAS researches. According to Kathiresan et al. [8] , in Caucasian individuals analyzed, rs16996148 variant of CILP2 gene had a reducing role in triglyceride and LDL-C level. While in other reports, the relationship between CILP2 polymorphism and lipid metabolism was not yet discovered [20] , nor in Japanese population [21] or in Slovak Midlife women [22] . However, Lenka et al.
indicated that the minor T allele in CILP2 gene was associated with lower LDL-C, apoB, and atherogenic indices and higher HDL-C levels [22] . This result was in accordance with the study in Singaporean population ranging from 40 to 80 years of age [23] . On the other hand, it have been reported that SNPs in CILP2 gene was associated with adult height attainment [24] , and CpGs in CILP2 were signi cantly associated with both body mass index and fat-free mass index in preschool children [25] .
According to Chenan Zhang et al., an expression quantitative trait locus for CILP2 gene, rs8103992, was signi cantly associated with adult height attainment and osteosarcoma risk after adjustment for multiple comparisons in 864 osteosarcoma cases and 1879 controls of European ancestry [15] . To our best acknowledgement, there were no more reports describing relationship between CILP2 and cancers.
Our work presented here has evaluated the prognostic value of CILP2 in CRC by analyzing dataset of TCGA cohort and TMA cohort. For the rst time, we found out that CILP2 was upregulated in colorectal cancer tissues compared to normal tissues. In addition, we observed that CILP2 expression was signi cantly correlated with clinicopathological parameters of CRC patients in TCGA cohort and TMA cohort. In high-stage CRC samples, CILP2 was upregulated compared to low-stage CRC samples. To evaluate prognostic value of CILP2 on overall survival of CRC patients in TCGA cohort, Kaplan-Meier and Cox regression analysis were performed. We found out that higher CILP2 expression was correlated with much poorer prognosis in CRC patients. These results indicated that CILP2 could act as an independent prognostic marker in colorectal cancer.
Recently, many reports have shown that obesity represents a common risk factor for several types of cancer [26,27] , especially for hormone dependent cancers, such as breast cancer [28,29] and advanced prostate cancer [30] . The biological association between obesity and cancer might relate to tissue lipid metabolism. It is well known that cancer cells, including CRC cells, show alterations in lipid metabolism of synthesis, desaturation, elongation and mitochondrial oxidation of fatty acids [31][32][33] . A populationbased study has revealed incidences of colorectal cancer to be associated with circulating levels of apolipoproteins [32] . Sophisticated correlation and therapeutic use of lipid metabolism-related alternations remain further investigations.

Conclusions
Our study has raised that CILP2 might serve as a potential prognostic marker in CRC patients. Further studies would be needed to detect CILP2 expression in serum of CRC patinets, and con rm the prognostic value and feasibility in larger and multi-center cohorts of CRC patients, as well as to further elucidate molecular mechanisms underlying correlations between CILP2 and colorectal cancer development.

Availability of data and materials
Availability of data and materials The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.
Ethics approval and consent to participate Not applicable.

Consent for publication
Not applicable.

Competing interests
The authors declare that they have no competing interests.  :some missing data for parameter.P value<0.05 was considered statistically significant (in bold). T: tumor; N: Regional lymph node; M: metastasis. T: tumor; N: Regional lymph node; M: metastasis. T: tumor; N: Regional lymph node; M: metastasis. T: tumor; N: Regional lymph node; M: metastasis; HR, hazard ratio; CI, confidence interval   Kaplan-Meier analysis of CILP2 expression and overall survival in total CRC samples of 10 years in TCGA cohort. Higher CILP2 expression group had a poorer overall survival than low CILP2 expression group. (P=0.003)

Supplementary Files
This is a list of supplementary les associated with this preprint. Click to download.