Skip to main content

A signature of 24 aging‑related gene pairs predict overall survival in gastric cancer



Aging is the major risk factor for most human cancers. We aim to develop and validate a reliable aging-related gene pair signature (ARGPs) to predict the prognosis of gastric cancer (GC) patients.


The mRNA expression data and clinical information were obtained from two public databases, The Cancer Genome Atlas (TCGA) dataset, and Gene Expression Omnibus (GEO) dataset, respectively. The best prognostic signature was established using Cox regression analysis (univariate and least absolute shrinkage and selection operator). The optimal cut-off value to distinguish between high- and low-risk patients was found by time-dependent receiver operating characteristic (ROC). The prognostic ability of the ARGPS was evaluated by a log‐rank test and a Cox proportional hazards regression model.


The 24 ARGPs were constructed for GC prognosis. Using the optimal cut-off value − 0.270, all patients were stratified into high risk and low risk. In both TCGA and GEO cohorts, the results of Kaplan–Meier analysis showed that the high-risk group has a poor prognosis (P < 0.001, P = 0.002, respectively). Then, we conducted a subgroup analysis of age, gender, grade and stage, and reached the same conclusion. After adjusting for a variety of clinical and pathological factors, the results of multivariate COX regression analysis showed that the ARGPs is still an independent prognostic factor of OS (HR, 4.919; 95% CI 3.345–7.235; P < 0.001). In comparing with previous signature, the novel signature was superior, with an area under the receiver operating characteristic curve (AUC) value of 0.845 vs. 0.684 vs. 0.695. The results of immune infiltration analysis showed that the abundance of T cells follicular helper was significantly higher in the low-risk group, while the abundance of monocytes was the opposite. Finally, we identified and incorporated independent prognostic factors and developed a superior nomogram to predict the prognosis of GC patients.


Our study has developed a robust prognostic signature that can accurately predict the prognostic outcome of GC patients.


Globally, gastric cancer (GC) is an important public health problem, and its morbidity and mortality are at the forefront of cancer [1, 2]. As research continues, treatment approaches for GC have been refined and optimized. However, due to the occult course of GC, most patients who are first diagnosed are already at an advanced stage, with a 5-year survival rate of only about 30% [3]. In addition, GC is a highly heterogeneous cancer, has multiple histological types and each type has a unique biological behavior [4, 5]. It is well known that the most common prognostic tool used by clinicians is the American Joint Committee on Cancer (AJCC) tumor node metastasis (TNM) stage system [6]. However, some patients with GC have the same TNM stage and finally get different clinical outcomes, which show that the AJCC stage system ignores the biological heterogeneity of the tumor. Therefore, novel and powerful prognostic signature is needed in clinical work to accurately identify high-risk patients in order to improve survival outcomes in GC patients.

Aging is the process of loss and degeneration of the body from constitutive substances and tissue structures to physiological functions [7, 8]. Previous studies have shown that aging is the risk factor for many diseases, such as heart disease [9], neurodegenerative diseases [10], type 1 diabetes [11] and cancer [12]. Interestingly, aging-related genes (AGs) have two different biological roles in regulating cell senescence. It can inhibit tumor growth by regulating the senescence of tumor cells, and can also promote tumor invasion and metastasis [13,14,15,16]. Selecting those gene sets with potential biological significance for the development of predictive models can provide new insights for individualized treatment. Given the key role of aging in the development of GC, we selected a set of AGs to build a gene signature to distinguish different risk groups, which are of great benefit in selecting personalized treatment plans for patients.

The rapid development of high-throughput sequencing technologies and bioinformatics tools has allowed us to study cancer in greater depth. For GC, many previous studies have established gene signature to predict the prognosis of patients [17,18,19]. However, for various reasons (technical limitations and diversity of data types) these gene signature have not been widely used in daily clinical work. Fortunately, in order to overcome above problems and make the developed gene signatures available to clinicians as early as possible, researchers have developed a new algorithm based on the relative ranking of gene expression levels, and many studies using this algorithm have yielded reliable results [20,21,22].

Therefore, the main focus of this study is to construct a robust and stable individualized prognostic signature based on AGs using a new algorithm. In addition, the robustness of our signature was validated using another independent cohort. Subsequently, we also compared the performance of the new signature with previously published prognostic models for GC.


Construction of ARGPs signature

The analysis process of present study is shown in Fig. 1. As described in the method, a total of 155 ARGPs related to prognosis was identified. Next, we applied a Lasso‐penalized Cox regression to further reduce the number of ARGPs in the prognostic model. Finally, we filtered to obtain 24 ARGPs and their corresponding coefficients (Table 1). Based on the score of each gene pair and the corresponding coefficient, the following formula was constructed to calculate the risk score:

Fig. 1

Flowchart of the research procedure in this study

Table 1 Prognostic signature consists of 24 aging-related gene pairs

Risk score = (− 0.097 * TP53_PDGFRB) + (− 0.090 * E2F1_PTGS2) + (− 0.043 * E2F1_SERPINE1) + (0.256 * STAT5B_FOXO1) + (0.067 * NGFR_ADCY5) + (0.069 * PDGFB_TFAP2A) + (− 0.462 * SST_BCL2) + (0.219 * PRKCD_FEN1) + (0.059 * HIF1A_MIF) + (0.221 * NR3C1_PIK3R1) + (− 0.042 * BRCA2_RGN) + (0.001 * CEBPA_FEN1) + (− 0.174 * FAS_IL6) + (− 0.388 * BAX_SERPINE1) + (− 0.370 * FOXO1_CDKN2B) + (− 0.142 * MSRA_PDGFRA) + (− 0.072 * RECQL4_NUDT1) + (0.188 * UCHL1_CDKN2B) + (0.120 * SNCG_PPARGC1A) + (− 0.046 * GSR_CDKN1A) + (− 0.123 * PIK3R1_EFEMP1) + (− 0.075 * PIK3R1_PDGFRA) + (0.073 * MXD1_PYCR1) + (− 0.217 * DBN1_SERPINE1).

Validation and assessment of the established ARGPs signature

According to the above formula, the risk score for all patients was calculated in the TCGA dataset. The optimal cut-off points for the risk score using ROC curve analysis were − 0.270, which can categorize the patients into two groups (high risk vs. low risk) (Additional file 1: Figure S1). The K–M curve and log-rank test showed that patients in the high-risk group have a shorter survival time than in the low-risk group (P < 0.001, Fig. 2a). Next, we used the same cut-off value to divide patients in the GEO dataset into two groups, and got the same results (P = 0.002, Fig. 2c). In addition, to further evaluate the prediction accuracy of ARGPs signature, a time-dependent ROC curve analysis was performed. In the TCGA cohort, the AUC values of the prognostic models predicted by 1-year, 3-year and 5-year survival rates were 0.738, 0.791 and 0.845, respectively (Fig. 2b). This demonstrated that the predictive performance of our established prognostic signature was reliable. The AUC values for OS in the GEO cohort at 1 year, 3 years and 5 years were 0.637, 0.685 and 0.690, respectively (Fig. 2d). These results were sufficient to confirm that the prognosis model can accurately predict the prognosis of GC patients.

Fig. 2

Construction and validation of aging-related gene pair signatures (ARGPs). a Survival curves of high- and low-risk groups separated by ARGPs in the TCGA cohort. b Receiver operating characteristic (ROC) curves for survival risk predicted by ARGPs for 1-, 3-, and 5-year follow-ups. c Survival curve of high- and low-risk groups separated by ARGPs in the GEO cohort. d ROC analysis for survival rate predicted by ARGPs for 1-, 3-, and 5-year follow-ups

Subsequently, we performed a stratified survival analysis to determine whether our model still had predictive value in different subgroups of clinicopathological parameters. As expected, the K–M curves illustrated that the signature was a stable prognostic marker for patients with GC stratified by age (≤ 65 or > 65), gender (female or male), grade (G1–2 or G3), and stage (I–II or III–IV) (Fig. 3).

Fig. 3

The Kaplan–Meier curves of patients with gastric cancer stratified by age (a, b), gender (c, d), grade (e, f), and stage (g, h)

In addition, univariate and multivariate Cox regression analyses were performed in the TCGA and GEO datasets to assess whether the prognostic power of ARGPs signature was independent of other clinicopathological parameters (age, sex, stage, grade). After two COX regression analyses, the risk score remained an independent prognostic indicator in the TCGA cohort (HR = 4.919, 95% CI [3.345–7.235], P < 0.001; Fig. 4a, b). As shown in Fig. 4c, d, the independent value of the ARGPs signature risk score had been verified in the GEO dataset (HR = 1.529, 95% CI [1.083–2.158], P = 0.016).

Fig. 4

Univariate (a) and multivariate (b) analyses of prognostic factors in TCGA cohort and univariate (c) and multivariate (d) analyses of prognostic factors in GEO cohort identified the signature as an independent prognostic factor

Comparison of the performance of ARGPs signature and existing models in survival prediction

We further compared the predictive performance of ARGPs signatures with two previously published mRNA signatures: the RNA-binding protein-related gene signature established by Zhou et al. [23] and the metabolic-related gene signature established by Wen et al. [24] using the same TCGA patient cohort. As shown in Fig. 5, the AUC at 5 years of OS for the ARGPs was 0.845, which was significantly higher than that of RNA-binding proteins signature (AUC = 0.684) and metabolic-related gene signature (AUC = 0.695). Collectively, the above result indicated that our model was a promising tool for predicting the prognosis of GC patients.

Fig. 5

Comparison of ROC analysis between our model and two other existing models in predicting 5-year overall survival

Functional and pathway enrichment analysis

Additional file 2: Figure S2a shows the results of GO enrichment analysis. These genes were significantly enriched in response to oxidative stress, aging, apoptotic signaling pathway, regulation of reactive oxygen species metabolic process, negative regulation of cell proliferation, reproductive structure development. The KEGG pathway analysis revealed that these genes were highly associated with the pathways in cancer, HTLV-I infection, Kaposi sarcoma-associated herpesvirus infection, AGE–RAGE signaling pathway in diabetic complications, cellular senescence (Additional file 2: Figure S2b).

Immune cell infiltration in different risk groups

Tumor-infiltrating immune cells have been shown to correlate with the prognosis of many cancer patients [25, 26]. We applied the CIBERSORT algorithm to estimate the relative abundance of 22 immune cells within different risk groups. The radar charts depict a comparative summary of various immune cells in these two risk groups (Fig. 6a). We found that immune cells, such as T cells follicular helper and monocytes were enriched in different risk groups. Among them, T cells follicular helper was significantly and highly expressed in the low-risk group, and the monocytes were highly expressed in the high-risk group (Fig. 6b).

Fig. 6

The relative fraction of infiltrated immune cells in different risk groups in the TCGA dataset. a Radar plot of the difference in the abundance of 22 immune cells in tumor tissue in the two risk groups. b The abundance distribution of T cells follicular helper and monocytes within the high-risk group and the low-risk group

Establishment and evaluation of nomogram for predicting the prognosis of GC patients

Multivariate COX regression analysis showed that in addition to the risk score, age and stage are also independent prognostic factors for OS. To further improve the accuracy of clinicians in predicting patient prognosis, we constructed a practical and novel nomogram tool combining all parameters with independent prognostic significance (Fig. 7a). The total score was obtained by calculating the patient's score for each parameter separately, and by making a vertical line downward from the total score on the nomogram, the clinician can easily derive the probability of OS at 1, 3 and 5 years. In addition, we also evaluated the performance of the nomogram. The calibration curves consisting of calibration points were very close to the standard curves (Fig. 7b). This was sufficient to show that our nomogram prediction performance was very strong, especially when predicting the 3-year OS.

Fig. 7

Construction and evaluation of nomogram. a A nomogram that integrated the age, stage and risk score. b The calibration curves for 1-, 3-, and 5-year overall survival


In this study, the 24 ARGPs obtained from the screening were constructed into a new prognostic signature and its performance was further validated in an independent GEO cohort. The results showed the robust performance of our gene signature in predicting the prognosis of GC patients and accurately distinguishing high-risk populations. Then, subgroup survival analysis of age, sex, grade and stage also verified the stability of the signature. In addition, multivariate Cox analysis showed that the risk score was an independent prognostic factor for GC patients. In addition, the new prognostic signature identified performed well compared to the two published prognostic signatures, which is an important strength of this study. Finally, our nomogram constructed by combining risk score and clinicopathological parameters with independent prognostic significance can improve the predictive accuracy of OS in patients with GC.

Due to the heterogeneity of GC and the technical noise caused by cross-platform sequencing, the previously established prognostic signature needs to standardize the mRNA expression profile, which is a difficult but necessary step in data analysis. However, our study differs from previous literature in the building of prognostic signatures. The construction of the prognostic signature is based on gene pairs rather than single genes in this study. This method of data analysis using gene expression values for pairwise comparison is a novel algorithm that can overcome the above-mentioned problems. The use of novel algorithms to construct signatures is an advantage in this study, and has been proven to be reliable in previous studies [27]. ARGPs signature not only have a predictive prognostic function, but also reduce the detection process to some extent. For instance, both genes, RECQL4 and NUDT1, were overexpressed in GC tissues. Increased RECQL4 expression was associated with poor prognosis in GC patients [28]. The overexpression of NUDT1 was closely related to the increased depth of invasion and lower survival rate of GC patients [29]. These two mRNAs formed an ARGP in this study, and when the expression value of RECQL4 was lower than the expression level of NUDT1, a high-risk score was acquired, leading to poorer prognostic outcomes. The results require only a brief comparison of the magnitude of expression values for the constituent genes in the prognostic gene pairs, without the need for specific expression data. Thus, its feasibility is better than traditional gene signature detection. This novel signature is composed of 36 unique AGs, some of which have been intensively studied in GC and other cancers. For example, previous studies have shown that the expression of PDFGRB was related to lymphatic metastasis and poor prognosis of GC [30]. CDKN2B is highly expressed in GC tissues compared with adjacent tissues, and can be used as a prognostic marker for GC [31]. Ai et al.’s study showed that PIK3R1 overexpression can promote the progression of hepatocellular carcinoma [32]. The results of Irene Arroyo-Solera et al. [33] showed that high SERPINE1 expression promotes tumor invasion and metastasis, and is also associated with poor prognosis in head and neck squamous cell carcinoma patients. The above evidence is sufficient to suggest that ARGPs are not surprising as a prognostic signature for GC.

Recently, a lot of evidence supports the fact that the tumor microenvironment plays a huge role in determining the occurrence and development of tumors [34,35,36]. The results of Zeng et al. on the role of tumor immune infiltrating cells in GC suggest that it can be used as independent prognostic markers [37, 38]. Therefore, in this study we compared the differences in the abundance of tumor immune infiltrating cells between the high-risk and low-risk groups, and the results showed that two types of immune cells, T cells follicular helper, monocytes, differed between the two groups. There into, T cells follicular helper was more enriched in the low-risk group. Pan et al. found that T cells follicular helper was associated with poorer results in clear cell renal cell carcinoma [39]. The biological significance of T cells follicular helper in GC needs further study. In addition, compared with the low-risk group, the high-risk group had higher levels of monocytes. Increased monocytes have a negative impact on tumor immunity and are related to the poor prognosis of GC [40].

Nomogram is a powerful and easy-to-use tool that has been widely used in previous biomedical studies [41,42,43]. The clinicopathological parameters we included in the nomogram were all factors with independent prognostic significance obtained by multivariate Cox regression analysis; in this way, the nomogram plot is more convincing for predicting the prognosis of GC patients, and we also tested the accuracy of the tool using calibration plot curves. Although studies have been developed to predict prognosis nomogram plots for GC patients, this is the first one to be constructed in conjunction with ARGPs signature, providing new ideas for the development and use of nomograms.

Nevertheless, some limitations to our study should be acknowledged. First of all, we cannot avoid that our research results are based on retrospective analysis, and the results should be verified in prospective studies. Second, functional experimental studies should be conducted on these AGs to progressively elucidate the function of these ARGPs in GC.


In summary, we have developed a robust ARGPs signature using a new algorithm. It can still accurately predict the prognosis of individual patients in independent GEO data set. In addition, its performance was better than previously published prognostic models.

Materials and methods

Data sources and AGs acquisition

The preprocessed level 3 RNA-seq data and associated clinical information of GC were mined from TCGA data portal. Similarly, independent validation dataset GSE15459 was downloaded from GEO, including gene expression files and clinical follow-up information of 192 GC sample. 307 human AGs were obtained from the human aging genome resource (HAGR, Probe IDs correctly changed to the corresponding gene symbols based on the annotated information on the platform without further standardization. If multiple probe IDs are mapped to a single gene symbol, the average value is calculated as the expression value of the gene for further analysis. Expression data for 307 AGs were extracted from the TCGA cohort and the GSE15459 dataset, respectively, and the AGs common to both datasets were thus obtained.

Construction of a prognostic gene signature based on ARGPs

First, the median absolute deviation (MAD) < 0.5 was used as a filtering condition to discard those AGs that were imbalanced in distribution or had fairly little mutations [21]. The values of gene expression in a given sample or profile were compared pairwise to obtain a score for each ARGP. Briefly, the value was assigned as 1 when the first AG exhibit higher expression relative to the second AG in each ARGP; otherwise, the output was 0. In addition, we also filter out those ARGPs with a ratio of “0” or “1” < 20%. This novel algorithm was one of the strengths of this study because ARGP was derived from pairwise comparisons and was based entirely on gene expression in the same sample, which can overcome the bias associated with sequencing across different platforms and does not require additional processing of the data. A total of 1278 ARGPs were used as initial candidate factors for prognostic prediction in two datasets. Through univariate Cox regression and the Kaplan–Meier (K–M) method, a summary of 155 ARGPs related to the prognosis of survival were screened out (P < 0.05). To get a more practical model, we used least absolute shrinkage and selection operator (LASSO) regression to eliminate highly correlated ARGPs. Finally, 24 ARGPs were reserved for constructing prognostic signature. For every patient, the risk score was obtained according to the following model formula we constructed:

$${\text{Risk score}} = {\text{ coef}}\left( {{\text{gene}}1} \right) *{\text{Exp}}\left( {{\text{gene}}1} \right) \, + {\text{Coef }}\left( {{\text{gene}}2} \right)*{\text{ Exp }}\left( {{\text{gene}}2} \right) \, + \, \cdots \, + {\text{ coef}}\left( {{\text{genen}}} \right)*{\text{ Exp }}\left( {{\text{genen}}} \right),$$

where “coef” was the coefficient estimated from the LASSO regression analysis and “Exp” represents the score of the ARGP. We used the R package “survival” and “survivalROC” to access the best cut-off value of a risk score by the time-dependent ROC curve analysis. We then categorized the GC patients into high- and low-risk groups, according to the best cut-off value.

Evaluation of the prognostic performance of the ARGPs signature

Survival curves for OS were generated according to the K–M method and the log-rank test was applied to compare the differences between low- and high-risk groups. The predictive ability of the model was then determined by the area under the ROC curve (AUC). Univariate and multivariate Cox analysis were performed to estimate the influence of risk score model on prognosis and other clinicopathological factors in the TCGA dataset. Hazard ratios (HRs) with their corresponding 95% confidence intervals (CIs) were obtained.

Comparison of performance with other prognostic models

To demonstrate the powerful performance of the prognostic model we have established, we use the R packages “limm’’, “survival”, “survminer” and “timeROC” to compare it with previously published models. The AUC was used to measure the predictive performance of each model.

GO and KEGG pathway functional enrichment analysis of AGs

To further explore the underlying biological significance of AGs in GC, 39 AGs of the constructed prognostic model were used for functional enrichment analysis. Metascape is an online web tool with comprehensive functions and fast update speed. Therefore, we used this tool to investigate the biological functions and pathways involved in AGs.

Profiling of infiltrating immune cells

The CIBERSORT algorithm is a linear support vector regression-based machine learning method that accurately identifies the relative abundance of 22 infiltrating immune cells in samples with unknown content and noise (e.g., solid tumors) compared to other deconvolution methods [44]. Therefore, we chose to use the CIBERSORT algorithm to quantify the relative abundance of 22 infiltrating immune cells, including T cells, helper follicular cells and monocytes, in all samples. In addition, to assess reliability of the results, CIBERSORT derives a P-value for the deconvolution of each sample using Monte Carlo sampling.

Building and validating a predictive nomogram

To optimize prognosis prediction, we integrated parameters with independent prognostic significance from multivariate Cox regression analysis to construct nomogram to predict 1-year, 3-year, and 5-year OS survival in GC patients. The calibration curve was utilized to assess the predictive accuracy and discriminatory power of the developed nomogram.

Statistical analysis

All statistical analyses were performed with R statistical software. Wilcoxon test was used to compare the differences high- and low-risk groups. For all assays, two-sided P value of less than 0.05 indicated a statistically significant difference.

Availability of data and materials

The raw data of this study are derived from the TCGA database ( and GEO data portal (, which are publicly available databases.


  1. 1.

    Siegel RL, Miller KD, Jemal A. Cancer statistics, 2015. CA Cancer J Clin. 2015;65(1):5–29.

    Article  Google Scholar 

  2. 2.

    Torre LA, Bray F, Siegel RL, Ferlay J, Lortet-Tieulent J, Jemal A. Global cancer statistics, 2012. CA Cancer J Clin. 2015;65(2):87–108.

    Article  Google Scholar 

  3. 3.

    Eusebi LH, Telese A, Marasco G, Bazzoli F, Zagari RM. Gastric cancer prevention strategies: a global perspective. J Gastroenterol Hepatol. 2020;35(9):1495–502.

    Article  Google Scholar 

  4. 4.

    Machlowska J, Baj J, Sitarz M, Maciejewski R, Sitarz R. Gastric cancer: epidemiology, risk factors, classification, genomic characteristics and treatment strategies. Int J Mol Sci. 2020;21(11):4012.

    Article  Google Scholar 

  5. 5.

    Cristescu R, Lee J, Nebozhyn M, Kim KM, Ting JC, Wong SS, Liu J, Yue YG, Wang J, Yu K, et al. Molecular analysis of gastric cancer identifies subtypes associated with distinct clinical outcomes. Nat Med. 2015;21(5):449–56.

    Article  Google Scholar 

  6. 6.

    Fang C, Wang W, Deng JY, Sun Z, Seeruttun SR, Wang ZN, Xu HM, Liang H, Zhou ZW. Proposal and validation of a modified staging system to improve the prognosis predictive performance of the 8th AJCC/UICC pTNM staging system for gastric adenocarcinoma: a multicenter study with external validation. Cancer Commun. 2018;38(1):67.

    Article  Google Scholar 

  7. 7.

    López-Otín C, Blasco MA, Partridge L, Serrano M, Kroemer G. The hallmarks of aging. Cell. 2013;153(6):1194–217.

    Article  Google Scholar 

  8. 8.

    He S, Sharpless NE. Senescence in health and disease. Cell. 2017;169(6):1000–11.

    Article  Google Scholar 

  9. 9.

    Mozaffarian D, Benjamin EJ, Go AS, Arnett DK, Blaha MJ, Cushman M, Das SR, de Ferranti S, Després JP, Fullerton HJ, et al. Heart disease and stroke statistics-2016 update: a report from the American Heart Association. Circulation. 2016;133(4):e38-360.

    Google Scholar 

  10. 10.

    Moreno-Gonzalez I, Soto C. Misfolded protein aggregates: mechanisms, structures and potential for disease transmission. Semin Cell Dev Biol. 2011;22(5):482–7.

    Article  Google Scholar 

  11. 11.

    Chhabra P, Brayman KL. Stem cell therapy to cure type 1 diabetes: from hype to hope. Stem Cells Transl Med. 2013;2(5):328–36.

    Article  Google Scholar 

  12. 12.

    Issa JP. Aging and epigenetic drift: a vicious cycle. J Clin Investig. 2014;124(1):24–9.

    Article  Google Scholar 

  13. 13.

    Lee S, Schmitt CA. The dynamic nature of senescence in cancer. Nature Cell Biol. 2019;21(1):94–101.

    Article  Google Scholar 

  14. 14.

    Johnson SC, Rabinovitch PS, Kaeberlein M. mTOR is a key modulator of ageing and age-related disease. Nature. 2013;493(7432):338–45.

    Article  Google Scholar 

  15. 15.

    Henriksson M, Lüscher B. Proteins of the Myc network: essential regulators of cell growth and differentiation. Adv Cancer Res. 1996;68:109–82.

    Article  Google Scholar 

  16. 16.

    Ko LJ, Prives C. p53: puzzle and paradigm. Genes Dev. 1996;10(9):1054–72.

    Article  Google Scholar 

  17. 17.

    Chen X, Zhang D, Jiang F, Shen Y, Li X, Hu X, Wei P, Shen X. Prognostic prediction using a stemness index-related signature in a cohort of gastric cancer. Front Mol Biosci. 2020;7:570702.

    Article  Google Scholar 

  18. 18.

    Luo T, Du Y, Duan J, Liang C, Chen G, Jiang K, Chen Y, Chen Y. Development and validation of a scoring system based on 9 glycolysis-related genes for prognosis prediction in gastric cancer. Technol Cancer Res Treat. 2020;19:1533033820971670.

    Google Scholar 

  19. 19.

    Ren H, Zhu J, Yu H, Bazhin AV, Westphalen CB, Renz BW, Jacob SN, Lampert C, Werner J, Angele MK, et al. Angiogenesis-related gene expression signatures predicting prognosis in gastric cancer patients. Cancers. 2020;12(12):3685.

    Article  Google Scholar 

  20. 20.

    Kang C, Jia X, Liu H. Development and validation of a RNA binding protein gene pair-associated prognostic signature for prediction of overall survival in hepatocellular carcinoma. Biomed Eng Online. 2020;19(1):68.

    Article  Google Scholar 

  21. 21.

    Heinäniemi M, Nykter M, Kramer R, Wienecke-Baldacchino A, Sinkkonen L, Zhou JX, Kreisberg R, Kauffman SA, Huang S, Shmulevich I. Gene-pair expression signatures reveal lineage control. Nat Method. 2013;10(6):577–83.

    Article  Google Scholar 

  22. 22.

    Li B, Cui Y, Diehn M, Li R. Development and validation of an individualized immune prognostic signature in early-stage nonsquamous non-small cell lung cancer. JAMA Oncol. 2017;3(11):1529–37.

    Article  Google Scholar 

  23. 23.

    Zhou L, Wu Y, Xin L, Zhou Q, Li S, Yuan Y, Wang J, Wu D. Development of RNA binding proteins expression signature for prognosis prediction in gastric cancer patients. Am J Transl Res. 2020;12(10):6775–92.

    Google Scholar 

  24. 24.

    Wen F, Huang J, Lu X, Huang W, Wang Y, Bai Y, Ruan S, Gu S, Chen X, Shu P. Identification and prognostic value of metabolism-related genes in gastric cancer. Aging. 2020;12(17):17647–61.

    Article  Google Scholar 

  25. 25.

    Domingues P, González-Tablas M, Otero Á, Pascual D, Miranda D, Ruiz L, Sousa P, Ciudad J, Gonçalves JM, Lopes MC, et al. Tumor infiltrating immune cells in gliomas and meningiomas. Brain Behav Immun. 2016;53:1–15.

    Article  Google Scholar 

  26. 26.

    Liu X, Wu S, Yang Y, Zhao M, Zhu G, Hou Z. The prognostic landscape of tumor-infiltrating immune cell and immunomodulators in lung cancer. Biomed Pharmacother. 2017;95:55–61.

    Article  Google Scholar 

  27. 27.

    Popovici V, Budinska E, Tejpar S, Weinrich S, Estrella H, Hodgson G, Van Cutsem E, Xie T, Bosman FT, Roth AD, et al. Identification of a poor-prognosis BRAF-mutant-like population of patients with colon cancer. J Clin Oncol. 2012;30(12):1288–95.

    Article  Google Scholar 

  28. 28.

    Chen H, Yuan K, Wang X, Wang H, Wu Q, Wu X, Peng J. Overexpression of RECQL4 is associated with poor prognosis in patients with gastric cancer. Oncol Lett. 2018;16(4):5419–25.

    Google Scholar 

  29. 29.

    Duan J, Zhang H, Li S, Wang X, Yang H, Jiao S, Ba Y. The role of miR-485-5p/NUDT1 axis in gastric cancer. Cancer Cell Int. 2017;17:92.

    Article  Google Scholar 

  30. 30.

    Kodama M, Kitadai Y, Sumida T, Ohnishi M, Ohara E, Tanaka M, Shinagawa K, Tanaka S, Yasui W, Chayama K. Expression of platelet-derived growth factor (PDGF)-B and PDGF-receptor β is associated with lymphatic metastasis in human gastric carcinoma. Cancer Sci. 2010;101(9):1984–9.

    Article  Google Scholar 

  31. 31.

    Chen X, Yu X, Shen E. Overexpression of CDKN2B is involved in poor gastric cancer prognosis. J Cell Biochem. 2019;120(12):19825–31.

    Article  Google Scholar 

  32. 32.

    Ai X, Xiang L, Huang Z, Zhou S, Zhang S, Zhang T, Jiang T. Overexpression of PIK3R1 promotes hepatocellular carcinoma progression. Biol Res. 2018;51(1):52.

    Article  Google Scholar 

  33. 33.

    Arroyo-Solera I, Pavón M, León X, López M, Gallardo A, Céspedes MV, Casanova I, Pallarès V, López-Pousa A, Mangues MA, et al. Effect of serpinE1 overexpression on the primary tumor and lymph node, and lung metastases in head and neck squamous cell carcinoma. Head Neck. 2019;41(2):429–39.

    Google Scholar 

  34. 34.

    Domogauer JD, de Toledo SM, Azzam EI. A mimic of the tumor microenvironment: a simple method for generating enriched cell populations and investigating intercellular communication. J Visual Exp. 2016;115:54429.

    Google Scholar 

  35. 35.

    Liotta LA, Kohn EC. The microenvironment of the tumour–host interface. Nature. 2001;411(6835):375–9.

    Article  Google Scholar 

  36. 36.

    Mlecnik B, Bindea G, Kirilovsky A, Angell HK, Obenauf AC, Tosolini M, Church SE, Maby P, Vasaturo A, Angelova M, et al. The tumor microenvironment and immunoscore are critical determinants of dissemination to distant metastasis. Sci Transl Med. 2016;8(327):327ra326.

    Article  Google Scholar 

  37. 37.

    Zeng D, Li M, Zhou R, Zhang J, Sun H, Shi M, Bin J, Liao Y, Rao J, Liao W. Tumor microenvironment characterization in gastric cancer identifies prognostic and immunotherapeutically relevant gene signatures. Cancer Immunol Res. 2019;7(5):737–50.

    Article  Google Scholar 

  38. 38.

    Zeng D, Zhou R, Yu Y, Luo Y, Zhang J, Sun H, Bin J, Liao Y, Rao J, Zhang Y, et al. Gene expression profiles for a prognostic immunoscore in gastric cancer. Br J Surg. 2018;105(10):1338–48.

    Article  Google Scholar 

  39. 39.

    Pan Q, Wang L, Chai S, Zhang H, Li B. The immune infiltration in clear cell renal cell carcinoma and their clinical implications: a study based on TCGA and GEO databases. J Cancer. 2020;11(11):3207–15.

    Article  Google Scholar 

  40. 40.

    Urakawa S, Yamasaki M, Goto K, Haruna M, Hirata M, Morimoto-Okazawa A, Kawashima A, Iwahori K, Makino T, Kurokawa Y, et al. Peri-operative monocyte count is a marker of poor prognosis in gastric cancer: increased monocytes are a characteristic of myeloid-derived suppressor cells. Cancer Immunol Immunother. 2019;68(8):1341–50.

    Article  Google Scholar 

  41. 41.

    Olofsson Bagge R, Kicinski M, Faries MB, Gyorki DE, Isaksson K, Katsarelias D, Lo S, Moncrieff M, Spillane A, Suciu S, et al. Comment on “factors affecting sentinel node metastasis in thin (t1) cutaneous melanomas: development and external validation of a predictive nomogram.” J Clin oncol. 2020;38(27):3233–4.

    Article  Google Scholar 

  42. 42.

    Bibault JE, Chang DT, Xing L. Development and validation of a model to predict survival in colorectal cancer using a gradient-boosted machine. Gut. 2020.

  43. 43.

    Jehi L, Ji X, Milinovich A, Erzurum S, Rubin BP, Gordon S, Young JB, Kattan MW. Individualizing risk prediction for positive coronavirus disease 2019 testing: results from 11,672 patients. Chest. 2020;158(4):1364–75.

    Article  Google Scholar 

  44. 44.

    Newman AM, Liu CL, Green MR, Gentles AJ, Feng W, Xu Y, Hoang CD, Diehn M, Alizadeh AA. Robust enumeration of cell subsets from tissue expression profiles. Nat Method. 2015;12(5):453–7.

    Article  Google Scholar 

Download references


The authors gratefully acknowledge the Gene Expression Omnibus (GEO) database, and the Cancer Genome Atlas (TCGA) database, which made the data available.


We are very grateful for the support of the National Natural Science Foundation of China (Grant number: 81702336) and the Peking University International Hospital Research Grant (No.YN2019XQ01).

Author information




This research was conducted in collaboration with all authors. YZ and YY performed the data curation and analysis. YZ and NN analyzed and interpreted the results. YZ, ZS and YY drafted and reviewed the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Yichao Yan or Yingjiang Ye.

Ethics declarations

Ethics approval and consent to participate

Not necessary.

Consent for publication

Not applicable.

Competing interests

The authors declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1: Figure S1.

The optimal cut-off value of the ARGPs risk-score obtained by the time-dependent ROC curve analysis.

Additional file 2: Figure S2.

Functional enrichment of 39 unique aging-related genes using the Metascape database. (A) GO enrichment analysis; (B) KEGG enrichment analysis.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit The Creative Commons Public Domain Dedication waiver ( applies to the data made available in this article, unless otherwise stated in a credit line to the data.

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Zhang, Y., Yan, Y., Ning, N. et al. A signature of 24 aging‑related gene pairs predict overall survival in gastric cancer. BioMed Eng OnLine 20, 35 (2021).

Download citation


  • Gastric cancer
  • Aging-related gene pairs
  • Prognosis
  • Nomogram