ABSTRACT
Using biomarkers as prediction tools or therapeutic targets can be a valuable strategy in transplantation. Recent studies identified biomarkers of acute rejection (AR) and operational tolerance (TOL) through the application of meta-analysis. In this study, we comparatively analyzed the signature genes in acute rejection and operational tolerance seen in human allogeneic transplantations using massive bioinformatical meta-analysis. To identify the signature genes in opposite immunological conditions, AR and TOL, we first collected the 1,252 gene expression data specifically intended for those circumstances. Then we excluded based on biological cut-values, Principal Component Analysis (PCA) as well as Multi-Dimensional Scaling (MDS). Using differentially expressed genes (DEGs) from meta- analysis, we then applied a ranked scoring system to identify the signature genes of AR and TOL. We identified 53 up-regulated and 32 down-regulated signature genes in acute rejection condition. Among them, ISG20, CXCL9, CXCL10, CCL19, FCER1G, PMSE1, UBD are highly expressed in AR condition. In operational tolerance, we identified 110 up-regulated and 48 down-regulated signature genes. TCL1A, BLNK, MS4A1, EBF1, IGHM are up-regulated in TOL condition. These genes are highly representative of AR or TOL across the different organs such as liver, kidney and heart. Since immune response is the sum of complex biological and molecular dynamics, these signature genes as well as pathway analysis using a systems biology approach could be used to catch the insights of the certain pathways that would be overlooked with the conventional gene-level comparative analysis.
Keywords: Signature genes; Systems biology; Graft rejection; Transplantation tolerance
INTRODUCTION
Solid organ transplantation is the best therapeutic option for end-stage organ failure. Due to antigenic differences, allogeneic organ transplantation induces immune responses against the graft, necessitating the application of immunosuppressants to prevent graft
Original Article
Received: Apr 25, 2017 Revised: Aug 6, 2017 Accepted: Aug 10, 2017
*Correspondence to Ji Won Oh
Biomedical Research Institute, Kyungpook National University Hospital, Department of Anatomy, Kyungpook National University School of Medicine, 680 Gukchaebosang-ro, Jung-gu, Daegu 41944, Korea.
Tel: +82-53-420-4806 E-mail: [email protected] Yong-Hee Kim
Department of Microbiology, Kyungpook National University School of Medicine, 680 Gukchaebosang-ro, Jung-gu, Daegu 41944, Korea.
Tel: +82-53-420-4842 E-mail: [email protected]
Copyright © 2017. The Korean Association of Immunologists
This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License (https://
creativecommons.org/licenses/by-nc/4.0/) which permits unrestricted non-commercial use, distribution, and reproduction in any medium, provided the original work is properly cited.
Conflict of Interest
The authors declare no potential conflicts of interest.
Jeong-Woo Choi1,2, Yong-Hee Kim3,4,*, Ji Won Oh1,2,*
1Biomedical Research Institute, Kyungpook National University Hospital, Daegu 41404, Korea
2Department of Anatomy, Kyungpook National University School of Medicine, Daegu 41944, Korea
3Department of Microbiology, Kyungpook National University School of Medicine, Daegu 41944, Korea
4Xenotransplantation Research Center, Seoul National University College of Medicine, Seoul 03080, Korea
Comparative Analyses of Signature Genes in Acute Rejection and
Operational Tolerance
Abbreviations
AR, acute rejection; DEG, differentially expressed gene; DSP, Desmoplakin;
GEO, Gene Expression Omnibus; GSE, Gene Expression Omnibus series; MDS, Multi-Dimensional Scaling; PCA, Principal Component Analysis; PPI, Protein-Protein Interaction; STA, stable; TOL, tolerance Author Contributions
Conceptualization: Choi JW, Kim YH, Oh JW. Data curation: Choi JW, Oh JW. Formal analysis: Choi JW, Kim YH, Oh JW. Funding acquisition: Kim YH, Oh JW. Investigation:
Choi JW, Kim YH, Oh JW. Methodology: Choi JW, Oh JW. Project administration: Kim YH, Oh JW. Resources: Choi JW, Oh JW. Software:
Choi JW, Oh JW. Supervision: Kim YH, Oh JW.
Validation: Kim YH, Oh JW. Visualization: Choi JW, Kim YH, Oh JW. Writing - original draft:
Choi JW, Kim YH, Oh JW. Writing - review &
editing: Choi JW, Kim YH, Oh JW.
rejection. However, some grafts are rejected by physiological immune responses of the recipients even under the use of immunosuppressants (1). On the contrary, a few recipients acquire operational tolerance against the allogeneic graft and can terminate the immunosuppressants (2). To elucidate how to acquire immunological tolerance and avoid rejection against the graft is the fundamental goal for solid organ transplantation. Using biomarkers as prediction tools or therapeutic indicators not only in rejection situation but also in immunological tolerance can be a valuable strategy in transplantation. If we detect biomarkers of acute rejection (AR), we may try to prevent the ongoing rejection with timely interventions. In addition, the biomarker molecule can possibly be a target for developing more specific immunosuppressant (3). If we identify biomarkers of transplantation tolerance (TOL), we may terminate immunosuppressant administration and avoid side effects such as opportunistic infections. In addition, a method for enhancing the physiological function of the biomarker molecules can be a promising tolerance induction strategy (4). In these ways, identifying transplantation biomarkers may contribute to personalized medicine and development of novel therapeutics.
Recently, Khatri et al. (5) reported 11 genes that were significantly overexpressed in AR in human allogeneic transplantations across four solid organs. They utilized and evaluated 8 independent gene expression studies of organ biopsy specimens from liver, kidney, lung, and heart transplant patients. Using meta-analysis approaches, they were able to identify genes specifically overexpressed in AR. On the other hand, Baron et al. (6) sought to distinguish biomarkers for tolerance following human allogeneic kidney transplantation. At first, they could not identify overlapping gene signatures among 5 independent microarray data studied on blood of tolerant groups (7-11). At that point, they adopted meta-analysis strategy used by Khatri et al. (5). Through the meta-analysis by integration of 5 datasets, they successfully identified a common gene signature consisting of the top-20 tolerance biomarkers.
Medical phenomena or body reaction after therapeutic intervention like solid organ transplantation are not induced by a single cause but rather the sum of complex biological and molecular dynamics (12). To shed light on the pathological reason and genetic
susceptibility of certain condition, it is important to investigate and comprehend the network of each molecules as well as the meaning of integral cellular behaviors instead of traditional methods using a single gene perturbation analysis (13-15). To identify the signature gene sets which can drive the molecular cascade as well as pathway analyses with systems biology approach are the recent bioinformatics tools for this specific purpose, leading to a hidden insight into specific biological phenomena (16,17). Especially, due to the production of high- throughput data, it is highly critical for traditional biologists to analyze these massive data as well as to lead the meaningful experimental hypothesis. By identifying specific signature genes enriched in 2 opposite immunologic results, we could find implications for preventing acute rejection and inducing transplantation tolerance.
In this study, we comparatively analyzed the published transplantation studies of 2 opposite consequences, rejection and tolerance, using the ranked scoring system. Instead of simple comparison of gene lists, we conducted the comprehensive integrated meta-analysis using 1,252 public available gene expression data as well as gene-set enrichment pathway analysis of signature genes obtained from the current study. In this context, the primary objective of our study is to identify and to suggest the signature gene groups indicative of tolerance and acute rejection.
MATERIALS AND METHODS
Principal Component Analysis (PCA) and bioinformatical tools
Acute rejection (AR) and operational tolerance (TOL) information were taken from public available database, Gene Expression Omnibus (GEO) microarray datasets. In AR condition, heart (GSE4470, GSE2596, GSE9377), kidney (GDS724, GSE9493), liver (GSE13440), and lung (GSE6095) datasets were used to analyze the expression pattern of AR. In TOL condition, kidney (GSE14655, GSE22229, GSE22707K, GSE47755) and liver (GSE22707L) were used to examine the gene expression pattern compared with stable condition after transplantation.
We filtered GEO information to select the stable (STA), AR, and TOL groups. Next, we modified the datasets by our previously reported paper (18). Briefly, quantile normalization, PCA, MDS, and heat-map visualization were conducted by the R 3.4 program (R Project, Vienna, Austria). The visualization form of PCA and heat-map referenced the previously reported paper (18). Adobe Illustrator CS6 (Adobe Systems Inc., San Jose, CA, USA) was used to draw a summary figure.
Scoring system development for identification of the signature genes
To identify the signature genes across the different organs, we developed the scoring system.
In principle, number of overlap genes that were involved in up-regulated genes or down- regulated genes in AR or TOL were marked score. Each gene has different maximum score following the microarray platform of GEO series (GSE) information because each GSE dataset used the different microarray chip. Then, the maximum score each gene can get is considered as the base score. “Score divided by base score (score/base score)” was used to validate the score rank. In this study, genes in condition of score ≥2, score/base score >0.5 were selected to be the signature genes. In AR group, 53 up-regulated signature genes were selected from 1,049 total differentially expressed genes (DEGs) list and 32 down-regulated signature genes were selected from 1,107 DEGs list. In TOL group, 110 up-regulated signature genes were selected from 920 DEGs and 48 down-regulated signature genes were selected from 1,004 DEGs list, respectively.
Pathway analysis with gene-set enrichment analysis
We applied several gene-set association analysis tools. Well-known database libraries such as ChIP Enrichment Analysis (ChEA) (19), Encyclopedia of DNA Elements (ENCODE) Transcription Factor ChIP-seq, ENCODE and ChEA consensus Transcription Factors, TargetScan microRNA, Kyoto Encyclopedia of Genes and Genomes (KEGG) 2016, Reactome 2016, BioCarta 2016, Wiki Pathways 2016, Protein-Protein Interaction (PPI) hub proteins, Human gene atlas, National Cancer Institute (NCI)-Nature 2016, and Gene Ontology (GO) tools were used initially for the pilot screening analysis. Application of these methods led us to find integrative insights into molecular pathways as well as gene ontologies and transcription factors. Gene-set enrichment analyses of 2 different gene lists (AR and TOL) were performed independently.
Statistical analysis of gene-set enrichment analysis
For enrichment analysis in this study, we applied the standard method of the Fisher exact test used in other common bioinformatics analysis studies (20). In general, the Fisher exact test is a proportion test that compares every possible event and expected linked events. The proportions of the query input list of overexpressed genes involved in the specific lists in the gene set libraries were calculated. This proportion is considered as a key factor in the bioinformatics computation. If a query input list has higher proportion with a list of specific
gene set library compared to random query input lists, more likely statistical p-value in the list of specific gene library is projected to be lower, leading to a meaningful inference. The enriched pathway or term was considered as a significant pathway by calculated p-value under 0.05. We visualized GO molecular function and PPI hub proteins using Network2Canvas tools (Icahn School of Medicine at Mount Sinai, New York, NY, USA) (20).
RESULTS
Global data overview of total gene sets in AR and TOL groups
To check an overall expression of GEO information related with AR and TOL condition, we conducted PCA and heat-map visualization. First, we separated AR and TOL database into specific conditions sorted by official gene symbol ID. Since the platform of each transcriptome database uses different gene sets, for the precise comparison, we needed to make the
composite dataset for each condition. We used the overlapped gene sets shared by every database set of each condition for the global comparison. After filtering, we obtained total 2,269 AR common genes across platforms and 4,335 TOL genes across platforms for the later comparison. Intriguingly, PCA results indicated that dataset of both AR and TOL groups are significantly biased based on their technical conditions. (Fig. 1A, 1B, 1D, and 1E). We could find that the hypothetical intention of our experiment, the gap between STA and disease groups, are much smaller than the gap between each experimental condition, called the batch effect, based on PCA. To further investigation, we used heatmap to display the expression pattern in the AR and TOL specific genes. Heat-map results show that the inconsistent pattern in each disease condition that should have been highlighted. (Fig. 1C and 1F). This global overview leads us to make a different strategy to analyze AR and TOL information, meaning that DEGs were examined in each dataset comparison reducing batch effect. (Fig. 1G).
Meta-analysis of transcriptome expression of AR group compared with stable group
To identify the signature genes in the condition of AR, we compared the gene expression patterns of the different organs such as liver, kidney and heart defined as AR condition compared with stable condition after transplantation. PCA results show that the distinct clusters of STA and AR groups in every organ (Fig. 2A). It concluded that global gene expression data of AR condition are significantly different from the stable condition. The difference between expression pattern of AR and STA groups are more distinct than each individual difference. Heat-map visualization of DEGs displayed the noticeable expressional pattern of AR datasets compared with stable sets (Fig. 2B). We utilized these DEGs of each experimental set for later signature gene identification in condition of AR (Supplementary Table 1).Meta-analysis of transcriptome expression of TOL group compared with stable group
As we analyzed AR group, we compared the transcriptome data of TOL group with expression pattern of STA group. In this analysis, we used liver and kidney datasets while we utilized liver, kidney and heart datasets in AR transcriptome. PCA results show that STA and TOL groups were clearly distinguished (Fig. 3A). Intriguingly, kidney profiles containing the large dataset like GSE47755, the expression patterns of TOL group are centrally located compared with those of STA group. In heat-map visualization of 200 up-regulated genes and 200 down- regulated genes, we identified prominent expression pattern of TOL DEGs (Fig. 3B). We used the 200 DEGs of each direction (up-regulated 200 and down-regulated 200) from each AR and TOL datasets (p<0.05) for identifying the signature genes using the scoring system (Supplementary Table 1).A
20
−20 10 0
0
−20 20
PC2-9.7%
PC1-19.4%
KIDNEY9493
KIDNEY724 HEART2596
HEART4470 HEART1684
HEART9377 LIVER13440
−10
−30
Group GDS1684(STA) GDS1684(AR)
GDS724(STA) GDS724(AR) GSE2596(STA) GSE2596(AR) GSE4470(STA) GSE4470(AR) GSE9377(STA) GSE9377(AR) GSE9493(STA) GSE9493(AR) GSE13440(STA) GSE13440(AR)
HEART1684
B
−3.0
−3.5
−4.0
−5.0
−4.5
−20
−21
−22
−23
PC2-9.7%
PC1-19.4%
Group GDS1684(STA) GDS1684(AR)
Color key
Row Z-score
C
Stable AR
Up-regulated 200 genes
Down-regulated 200 genes
0 4
−4
AR total
TOL total
KIDNEY4775
D
30
−20 10 0
10
−20 30
PC2-9.1%
PC1-17.4%
20
−10
−30 −10 0 20
E
7.5 5.0 2.5
−5.0
−2.5
10 5 0
−5
PC2-6.4%
PC1-49.1%
0.0
Color key
Row Z-score
F
Stable TOL
Up-regulated 200 genes
Down-regulated 200 genes
0 4
−4
Group 4775(K-STA) 4775(K-TOL) 22707(K-STA) 22707(K-TOL) 22707(L-STA) 22707(L-TOL) 2229(K-STA) 2229(K-TOL)
Group 22707(K-STA) 22707(K-TOL) 22707(L-STA) 22707(L-TOL) 2229(K-STA) 2229(K-TOL) KIDNEY22707
KIDNEY2229 LIVER22707
G GEO database
AR 8 datasets (n=357): STA, AR, HV TOL 6 datasets (n=895): STA, TOL, HV, CR
• Scoring system (score >2, score/base score >0.5)
• Comparison with Khatri et al. or Baron et al. signature genes
• Sort datasets separately
• Analyze 200 DEGs
• Up-DEGs and down-DEGs
• PCA
• Exclude outlier samples
• Exclude invalid dimension data AR 7 datasets (n=236): STA 140, AR 96
TOL 6 datasets (n=671): STA 530, AR 141
AR 6 datasets (n=155): STA 90, AR 65 TOL 6 datasets (n=627): STA 497, AR 130
AR 6 meta analysis sets TOL 6 meta analysis sets
AR signature genes (up: 53, down: 32) TOL signature genes (up: 110, down: 48)
• STA, AR, TOL samples
• Total number of sample >10
• Number of gene list >5,000
Figure 1. Global analysis of total AR and TOL groups with experimental strategy. (A) Global PCA results using 7 different microarray datasets related to AR. Each group made different clusters regardless of what condition they have. (B) A heart microarray dataset (GDS1684) shows wide spread global expression data and not well distinguished data clustering. It is excluded for later investigation. (C) Heat map visualization with AR global analysis. We selected only 200 genes in each direction (up-regulated and down-regulated). (D) Global PCA results using 4 different microarray datasets related to TOL. The datasets are significantly biased because of wide spreading patterns of kidney GSE4775 datasets. (E) Global PCA results without GSE4775 data. Each group made different clusters regardless of what condition they have. (F) Heat map visualization with TOL global analysis. We selected only 200 genes in each direction (up-regulated and down-regulated). (G) Comprehensive experimental strategy features.
20
−10 10
0
−20 −10 0 10
PC2-8.5%
PC1-10.6%
40
−20 20
0
−50 −25 0 25
PC2-24.5%
PC1-32.7%
A AR sets
B
50 100
−50 50 0
−100 0 100
PC2-7.2%
PC1-20.1%
30 20
−20
−10 10 0
−25 0 25
PC2-11.8%
PC1-13.4%
20
−10 10
0
−25 0 25
PC2-12.9%
PC1-17.9% 50
10
−10 0
−5 0 10
PC2-24.2%
PC1-25.2%
Liver(GSE13440) Kidney(GDS724) Kidney(GSE9493) Heart(GSE2596) Heart(GSE4470) Heart(GSE9377)
5 15
−10
Total: 21 Total: 16 Total: 36 Total: 44 Total: 24 Total: 14
Up-regulated 200 genes in AR
Down-regulated 200
genes in AR
Stable AR Stable AR Stable AR Stable AR Stable AR Stable AR
Liver(GSE13440) Kidney(GDS724) Kidney(GSE9493) Heart(GSE2596) Heart(GSE4470) Heart(GSE9377)
−2 2 Color key
Row Z-score −404
Color key
Row Z-score −302
Color key
Row Z-score −302
Color key Row Z-score
−6 0 4 Color key Row Z-score
−303 Color key Row Z-score AR
Stable
Figure 2. Analysis of PCA and Heatmap visualization of DEGs in AR group sets. (A) PCA analysis of AR microarray datasets obtained from GEO. The organ names of each data are shown at the top with GSE number. Stable condition is visualized in gray triangle and AR in red circle. (B) Heatmap visualization of DEGs in AR group sets. We identified 200 up-regulated and 200 down-regulated genes in AR group.
50
−25 25
0
−30 0 30 60
PC2-9.8%
PC1-22.7%
60
−20
−40 40 20 0
−30 0 30
PC2-12.5%
PC1-22.4%
A TOL sets
B
20
−20
−10
−30 10 0
−25 0 25
PC2-9.7%
PC1-20.8%
40 20
−40
−20 0
−40 0 40
PC2-9.1%
PC1-27.1%
−80
40
−20 20
0
−20 0 20
PC2-6.8%
PC1-7.9% 40
20 10
−10
−20 0
−30 0 30
PC2-8.1%
PC1-25.6%
Liver(GSE22707) Kidney(GSE22707) Kidney(GSE1465) Kidney(GSE22229) Kidney(GSE47755) Kidney(GSE14655)
Liver(GSE22707) Kidney(GSE22707) Kidney(GSE1465) Kidney(GSE22229) Kidney(GSE47755) Kidney(GSE14655)
−60
Total: 23 Total: 21 Total: 53 Total: 38 Total: 418 Total: 74
Up-regulated 200 genes in TOL
Down-regulated 200
genes in TOL
Stable TOL Stable TOL Stable TOL Stable TOL Stable TOL Stable TOL
−2 2 Color key
Row Z-score −10010
Color key Row Z-score
−404 Color key
Row Z-score −404
Color key
Row Z-score −404
Color key Row Z-score
−604 Color key Row Z-score TOL
Stable
Figure 3. Analysis of PCA and Heatmap visualization of DEGs in TOL group sets. (A) PCA analysis of TOL microarray datasets obtained from GEO. The organ names of each data are shown at the top with GSE number. Stable condition is visualized in gray triangle and TOL in blue circle. (B) Heatmap visualization of DEGs in TOL group sets. We identified 200 up-regulated and 200 down-regulated genes in TOL group.
Scoring system for identifying the global signature genes in AR and TOL
To identify specific signature genes in AR and TOL conditions, we developed the ranked system using overall DEGs in those conditions. Because of different microarray platforms used in meta-analysis, we cannot get consistent scoring system due to the absence of specific genes in a certain microarray platform. To overcome this, we used “base score” which is an identical score that each gene can get maximally. Then we divided the score of each gene by“base score” of each gene (Fig. 4A and 4B). “Score/base score” was used to validate the score rank (Supplementary Table 2). The signature genes were selected by condition of score ≥2 and score/base ≥0.5. In AR, 53 up-signature genes were selected by the scoring system and ISG20, CXCL9, CCL19, CXCL10, and FCER1G were marked as top 5 genes. Total 32 down-signature genes were filtered by scoring system and SORD, MAOA, CRYL1, IL13RA2, and Desmoplakin (DSP) were marked as top 5 genes. In TOL group, 110 up-signature genes were selected and TCL1A, BLNK, MS4A1, EBF1, and FCRL2 were marked as top 5 genes. In case of down-signature genes, 48 genes were selected and PFKFB3, F2RL1, SLC2A14, ANXA3, and SLC26A8 were marked as top 5 genes.
Expression pattern and validation of signature genes across organs
To validate the signature genes in AR or TOL group, we analyzed expression pattern of top 5 genes in heart and kidney datasets. In AR group, top 5 signature genes such as ISG20, CXCL9,
B A
1.00
0.50
2 3 4 5 6
0 1
Up-score/base score
Score
n=198 n=209 n=116 n=126
n=216 n=38 n=57 n=10
n=31 n=14
n=5 n=2 n=11n=3 n=14 n=1
n=44
n=10 n=40
n=160 n=134 n=130
n=34 n=17 n=183
n=7 n=16 n=2n=2
n=2n=1 n=6 n=137 n=9
n=3 n=20 n=108 n=34 n=209 n=11
n=145
n=33 n=5
n=1 n=1 n=1 n=9 n=12 n=26 n=53 n=228 n=156 n=147 n=275
n=38 n=8 n=11 n=207 n=215 n=2n=1
n=92 n=6
0.75
0.25
AR AR
1.00.8 0.60.4 0.2
1.00.8 0.60.4 0.2
1.00
0.50
2 3 4 5 6
0 1
Down-score/base score
Score 0.75
0.25
1.00
0.50
2 3 4 5 6
0 1
Up-score/base score
Score 0.75
0.25
TOL TOL
1.00.8 0.60.4 0.2
1.00.8 0.60.4 0.2
1.00
0.50
2 3 4 5 6
0 1
Down-score/base score
Score 0.75
0.25
Figure 4. Scoring visualization of AR signature genes and TOL signature genes. (A) Scoring visualization of up-regulated and down-regulated signature genes in AR condition. (B) Scoring visualization of up-regulated and down-regulated signature genes in TOL condition. Using DEGs of each datasets in AR and TOL, we identified the signature genes. Note X axis is a score, while Y is a ratio of up- and down-score/base score. Base score is determined by the total number of array sets containing a specific gene. Dashed circles are the boundary of the signature genes in each panel.
A
B
GSE4470 GSE9377 GSE9493 GDS724
FCER1GCXCL10CCL19Signature genes in up-regulated AR DEGs CXCL9ISG20
Kidney Heart
Not available
Not available
GSE22707 GSE22229 GSE14655-INT GSE14655-IOT
Kidney
FCRL2EBF1MS4A1Signature genes in up-regulated TOL DEGs BLNKTCL1A
AR up
signature Score Expression increase
(GSE4470, GSE9377, GSE9493, GDS724)
ISG20 4 3.4 1.5 4.1 2.3
CXCL9 4 10.0 2.1 3.1 7.9
CCL19 4 2.3 2.1 3.4 2.7
CXCL10 3 3.5 1.5 3.2 -
FCER1G 3 2.1 - 3.6 4.1
CD3D 3 - 1.6 3.2 3.6
PSME1 3 2.0 - - 2.2
UBD 3 4.8 2.3 - 5.8
CORO1A 3 3.9 - 3.6 2.3
TAP1 3 3.2 1.4 - 3.1
TIMP1 3 - 1.3 3.9 4.3
C1QB 3 - 1.4 4.8 8.2
IGSF6 3 3.0 1.3 - 2.7
PTPRC 3 2.8 1.5 - 6.1
UBE2L6 3 2.8 - - 3.3
TNFSF10 3 - - - 2.3
CD48 3 2.1 1.8 - 2.6
TNFAIP2 3 2.0 1.3 - 3.2
HLA-DMA 3 2.6 1.2 - 3.8
HLA-DRA 3 2.4 - 3.4 5.5
CD14 3 2.4 1.4 3.7 -
PLEK 3 2.3 1.4 - 2.7
CASP4 3 2.3 1.4 - 6.9
TOL up
signature Score Expression increase
(GSE22707, GSE22229, GSE14655-INT, GSE14655-IOT)
TCL1A 5 3.7 6.5 4.0 2.7
BLNK 5 2.1 2.9 2.4 2.1
MS4A1 5 2.1 2.3 1.7 2.1
EBF1 4 1.6 1.4 2.7 1.9
FCRL2 4 1.6 1.4 2.6 2.8
IGHM 4 2.1 2.1 2.6 1.5
PLEKHG1 4 1.4 1.5 2.1 1.7
CD200 4 2.0 2.0 2.1 1.8
BTLA 4 2.1 1.9 2.1 1.9
FCRLA 4 2.0 1.9 1.7 1.5
SH2D1B 4 - 1.8 1.7 2.4
PNOC 4 1.8 1.6 1.7 1.7
MGC24103 4 3.0 1.5 1.6 2.4
CLIC3 4 - 1.7 1.5 1.6
PCDH9 4 - 1.4 2.7 3.8
CD22 4 1.6 1.5 2.4 1.8
AKR1C3 4 - 2.6 1.6 2.0
CLC 4 3.0 1.4 1.5 2.3
CD79A 4 1.9 2.1 1.5 -
EBI2 3 - - 1.4 1.5
E2F5 3 1.6 1.6 - -
BANK1 3 1.7 1.7 - -
IGKC 3 1.6 1.7 2.5 6.9
NR3C2 3 - 1.6 1.7 1.7
QRSL1 3 1.5 1.5 1.7 -
PYHIN1 3 - 1.4 - 1.7
OCIAD2 3 - 1.4 1.5 1.6
PLEKHA1 3 - 1.5 1.5 1.5
CD247 3 - 1.5 1.5 -
Figure 5. Expression pattern and validation of signature genes across organs. (A) Expression patterns of top 5 genes in the signature genes of AR condition.
Table shows the global average expression level of the signature genes in AR sorted by score. (B) Expression patterns of top 5 genes in the signature genes of TOL condition. Table shows the global average expression level of the signature genes in TOL sorted by score.
CCL19, CXCL10, and FCER1G are up-regulated in heart (GSE4470 and GSE9377) datasets. Also, Top 5 signature genes displayed up-regulated expression pattern in overall kidney (GSE9493 and GDS724) datasets condition as well. We summarized the AR up-regulated signature gene list (score ≥3) with expression increase in each dataset (Fig. 5A).
In TOL group, top 5 signature genes (TCL1A, BLNK, MS4A1, EBF1, and FCRL2) were up- regulated in kidney (GSE22707, GSE22229, GSE14655-INT, GSE14655-IOT) datasets. We displayed the TOL up-signature gene list (score ≥3) and show the value of expression increase in each dataset (Fig. 5B).
DISCUSSION
In this study, we identified the signature genes in AR and TOL groups compared with normal STA group using 1,252 public available microarray databases (AR: 357 sets, TOL:
895 sets). Meta-analysis of these transplantation related gene sets leads us to specify the unique gene expression patterns in specific disease outcome. For the precise selection of signature genes, we adopted the ranked scoring systems and validate them in other public accessible experimental settings. These signature genes can be utilized as the biomarkers of a transplantation condition which can predict a potential therapeutic intervention.
Making composite datasets of different experimental settings failed to identify a strong association of signature genes because of large gap between each experimental setting. The fact that the gap between experimental hypothesis groups is significantly smaller than the gap between each individual experimental trial is comprehensively known by several other groups (5,6). It seems to be mainly because of that the experimental methods isolating transcriptome data are quite different from facility to facility as well as of that the collection of sampling varies not only how many hours after transplantation which can influence in the decay of transcriptome molecules, but also the health conditions or reaction tolerance of individual recipient patients (4-6,21). It is important to note that in meta-analysis, the comprehension of the consistent experimental setting and reducing non-hypothesis related specific factors, so called batch effects, are the important parameters to make integration sets.
In case of identifying signature genes that represent the certain disease condition, it is significant that whether those signature genes are globally represent of all circumstantial disease condition or only represent under specific condition. The meaningfulness of the signature genes of medical conditions such as kidney transplantation is also important in terms of organ specific molecular reaction (8-10). In our study, however, we are trying to configure the global signature genes in heart, liver and kidney. To overcome the difference between individual batch effects of each experiments, we adopted the scoring system of making composite results in both AR and TOL conditions. The advantage of the ranked scoring system is that it can reduce batch effects efficiently as well as it gives us the global landscape of signature genes (21). Our scoring system is based mainly on the gene sets of each microarray, leading certain genes to be skipped inevitably. To overcome this artefact, we normalized the scores of every signature genes by applying “base score.” This normalization gives not only the insight of global AR and TOL signature gene patterns but the consideration of specific organ condition as well.
After identifying the signature genes, researchers can investigate these genes more using the enrichment analysis (22). The enrichment analysis can decipher the hidden pathways which are strongly related with a query signature gene-set in term of an interaction network (Supplementary Figs. 1 and 2). Accompanying with the proper functional interpretation of gene-set, it is possible to deduce novel insight for the experimental purpose. However, the enrichment analysis is the computational and arithmetic tool, occasionally leading deviated, unconventionally enriched pathways. At the same time, certain pathway groups can be highlighted due to the small number of input driver genes, entailing careful and meticulous supervision. For example, only single gene can make the pathway significant in a specific enriched pathway, resulting in p-value under 0.05 because of unsupervised computational calculation. In this case, careful speculation after analysis is necessary to avoid misinterpretation.
When we apply the network analysis of opposite pathological conditions, we encounter the commonly involved genes or pathways such as B-cell receptor (BCR) pathways in Supplementary Fig. 1. The main meaning of the commonly involved genes or pathways in opposite conditions can be explained by 2 fundamental arguments. First, the enriched pathways do not have the specific direction (positively regulated or negatively regulated for later cascade). This infers that even though certain pathways are highlighted in both conditions, it may have a different outcome in downstream. In common network analysis, it is difficult to decide which direction the enriched pathway is, unless that is specifically notified as positive or negative. Second, there may have the common immunologic system which may be activated by every immune situation like acute rejection or operational tolerance, so called general immune reaction. Though common immunologic system seems to be enhanced by opposite triggering factors, opposite outcome like acute rejection or operational tolerance can happen if down-signaling cascades are finely tuned. In terms of immunological signaling pathway, within certain pathway, there can be several fine-tuned signaling possibilities, may result in totally opposite outcome.
If a certain gene is up-regulated in TOL condition, while the gene is also down-regulated in AR, that gene may have a critical role in driving tolerance condition. The opposite direction of our hypothesis also may give the insight to define an important gene in acute rejection.
Applying that strategy in our current signature genes, we cannot find overlap genes up- regulated in AR as well as down-regulated in TOL. We found a gene called DSP is down- regulated in AR as well as up-regulated in TOL, which perfectly suits to our hypothesis. DSP (23,24) is a critical protein in the desmosome structures in cardiac muscle and epidermal cells. Though desmosome has a key role in intercellular junction to link adjacent cells tightly, it needs further investigation why DSP is highly up-regulated in TOL circumstances.
Though the high expression of certain genes does not mean the genes are functionally critical in mechanism, to identify the signature genes have several advantages for finding alternative options of transplantation circumstance such as early prediction of rejection, early intervention of immune tolerant drugs, early cessation of them as well as prognostic prediction (25,26). We believe that our current study, focused on the signature genes in different immunological circumstance, acute rejection and operational tolerance, will have a certain role in shedding light not only on the mechanistic insight but on the detection of biological markers regardless of their functional involvement.
Here, we identified specific signature gene sets concerned in 2 contrasting immunologic reaction using systems biology approach with bioinformatics tools. Further experimental investigation can elucidate important key players to reveal their exact integrative role in regulation of immune responses.
ACKNOWLEDGEMENTS
This research was supported by a grant of the Korea Health Technology R&D Project through the Korea Health Industry Development Institute (KHIDI), funded by the Ministry of Health
& Welfare, Republic of Korea (grant number: HI13C0954).
Ji Won Oh is supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) (2016R1C1B1015211). Authors thank Jung Min Park and Wook Heo for technical assistance, Buu Le Dao and Brian Vu for critical reading of the manuscript.
SUPPLEMENTARY MATERIALS
Supplementary Table 1
Details of DEGs in AR and TOL Click here to viewSupplementary Table 2
Detail lists of high-ranked signature genes in AR and TOL Click here to view
Supplementary Figure 1
Example pilot network analysis of using gene sets*. Click here to view
Supplementary Figure 2
Example pilot in silico network meta-analysis of using gene sets*. Click here to view
REFERENCES
1. Lechler RI, Sykes M, Thomson AW, Turka LA. Organ transplantation--how much of the promise has been realized? Nat Med 2005;11:605-613.
PUBMED | CROSSREF
2. Orlando G, Hematti P, Stratta RJ, Burke GW 3rd, Di Cocco P, Pisani F, Soker S, Wood K. Clinical operational tolerance after renal transplantation: current status and future challenges. Ann Surg 2010;252:915-928.
PUBMED | CROSSREF
3. Roedder S, Vitalone M, Khatri P, Sarwal MM. Biomarkers in solid organ transplantation: establishing personalized transplantation medicine. Genome Med 2011;3:37.
PUBMED | CROSSREF
4. Gökmen R, Hernandez-Fuentes MP. Biomarkers of tolerance. Curr Opin Organ Transplant 2013;18:416-420.
PUBMED
5. Khatri P, Roedder S, Kimura N, De Vusser K, Morgan AA, Gong Y, Fischbein MP, Robbins RC, Naesens M, Butte AJ, et al. A common rejection module (CRM) for acute rejection across multiple organs identifies novel therapeutics for organ transplantation. J Exp Med 2013;210:2205-2221.
PUBMED | CROSSREF
6. Baron D, Ramstein G, Chesneau M, Echasseriau Y, Pallier A, Paul C, Degauque N, Hernandez-Fuentes MP, Sanchez-Fueyo A, Newell KA, et al. A common gene signature across multiple studies relate biomarkers and functional regulation in tolerance to renal allograft. Kidney Int 2015;87:984-995.
PUBMED | CROSSREF
7. Braud C, Baeten D, Giral M, Pallier A, Ashton-Chess J, Braudeau C, Chevalier C, Lebars A, Léger J, Moreau A, et al. Immunosuppressive drug-free operational immune tolerance in human kidney transplant recipients: part I. Blood gene expression statistical analysis. J Cell Biochem 2008;103:1681-1692.
PUBMED | CROSSREF
8. Brouard S, Mansfield E, Braud C, Li L, Giral M, Hsieh SC, Baeten D, Zhang M, Ashton-Chess J, Braudeau C, et al. Identification of a peripheral blood transcriptional biomarker panel associated with operational renal allograft tolerance. Proc Natl Acad Sci U S A 2007;104:15448-15453.
PUBMED | CROSSREF
9. Lozano JJ, Pallier A, Martinez-Llordella M, Danger R, López M, Giral M, Londoño MC, Rimola A, Soulillou JP, Brouard S, et al. Comparison of transcriptional and blood cell-phenotypic markers between operationally tolerant liver and kidney recipients. Am J Transplant 2011;11:1916-1926.
PUBMED | CROSSREF
10. Newell KA, Asare A, Kirk AD, Gisler TD, Bourcier K, Suthanthiran M, Burlingham WJ, Marks WH, Sanz I, Lechler RI, et al. Identification of a B cell signature associated with renal transplant tolerance in humans.
J Clin Invest 2010;120:1836-1847.
PUBMED | CROSSREF
11. Sagoo P, Perucha E, Sawitzki B, Tomiuk S, Stephens DA, Miqueu P, Chapman S, Craciun L, Sergeant R, Brouard S, et al. Development of a cross-platform biomarker signature to detect renal transplant tolerance in humans. J Clin Invest 2010;120:1848-1861.
PUBMED | CROSSREF
12. Oh JW, Kloepper J, Langan EA, Kim Y, Yeo J, Kim MJ, Hsi TC, Rose C, Yoon GS, Lee SJ, et al. A guide to studying human hair follicle cycling in vivo. J Invest Dermatol 2016;136:34-44.
PUBMED | CROSSREF
13. Oh JW, Chung O, Cho YS, MacGregor GR, Plikus MV. Gene loss in keratinization programs accompanies adaptation of cetacean skin to aquatic lifestyle. Exp Dermatol 2015;24:572-573.
PUBMED | CROSSREF
14. Song SH, Jang HU, Oh JW, Kim JS. Gene expression analysis in nasal polyp using microarray. Korean J Otorhinolaryngol-Head Neck Surg 2011;54:55-61.
CROSSREF
15. Choi SW, Chung HY, Lim YK, Kim HN, Oh JW, Kim MK, Jeon SH, Hong YT. Difference of gene expression between hypertrophic scar keratinocytes and normal keratinocytes. J Korean Soc Plast Reconstr Surg 2010;37:317-322.
16. Neale BM, Kou Y, Liu L, Ma'ayan A, Samocha KE, Sabo A, Lin CF, Stevens C, Wang LS, Makarov V, et al.
Patterns and rates of exonic de novo mutations in autism spectrum disorders. Nature 2012;485:242-245.
PUBMED | CROSSREF
17. Méndez-Ferrer S, Michurina TV, Ferraro F, Mazloom AR, Macarthur BD, Lira SA, Scadden DT, Ma'ayan A, Enikolopov GN, Frenette PS. Mesenchymal and haematopoietic stem cells form a unique bone marrow niche. Nature 2010;466:829-834.
PUBMED | CROSSREF
18. Wang Q, Oh JW, Lee HL, Dhar A, Peng T, Ramos R, Guerrero-Juarez CF, Wang X, Zhao R, Cao X, et al.
A multi-scale model for hair follicles reveals heterogeneous domains driving rapid spatiotemporal hair growth patterning. eLife 2017;6:e22772.
PUBMED | CROSSREF
19. Lachmann A, Xu H, Krishnan J, Berger SI, Mazloom AR, Ma'ayan A. ChEA: transcription factor regulation inferred from integrating genome-wide ChIP-X experiments. Bioinformatics 2010;26:2438-2444.
PUBMED | CROSSREF
20. Tan CM, Chen EY, Dannenfelser R, Clark NR, Ma'ayan A. Network2Canvas: network visualization on a canvas with enrichment analysis. Bioinformatics 2013;29:1872-1878.
PUBMED | CROSSREF
21. Cahan P, Rovegno F, Mooney D, Newman JC, St Laurent G 3rd, McCaffrey TA. Meta-analysis of microarray results: challenges, opportunities, and recommendations for standardization. Gene 2007;401:12-18.
PUBMED | CROSSREF
22. Subramanian A, Tamayo P, Mootha VK, Mukherjee S, Ebert BL, Gillette MA, Paulovich A, Pomeroy SL, Golub TR, Lander ES, et al. Gene set enrichment analysis: a knowledge-based approach for interpreting genome-wide expression profiles. Proc Natl Acad Sci U S A 2005;102:15545-15550.
PUBMED | CROSSREF
23. Arnemann J, Spurr NK, Wheeler GN, Parker AE, Buxton RS. Chromosomal assignment of the human genes coding for the major proteins of the desmosome junction, desmoglein DGI (DSG), desmocollins DGII/III (DSC), desmoplakins DPI/II (DSP), and plakoglobin DPIII (JUP). Genomics 1991;10:640-645.
PUBMED | CROSSREF
24. Garrod D, Chidgey M. Desmosome structure, composition and function. Biochim Biophys Acta 2008;1778:572-587.
CROSSREF
25. Strimbu K, Tavel JA. What are biomarkers? Curr Opin HIV AIDS 2010;5:463-466.
PUBMED | CROSSREF
26. Biomarkers Definitions Working Group. Biomarkers and surrogate endpoints: preferred definitions and conceptual framework. Clin Pharmacol Ther 2001;69:89-95.
PUBMED | CROSSREF