docsKnowledge BaseTissue Specificity

Tissue Specificity

Tissue-specific expression from GTEX and HPA, value ranging from 0 to +Inf

Bulk RNA-seq

Data sources:

Data typeCount (tissues)DataCoverage (nr genes)
RNA expression (consensus)50Gene level RNA seq data based on 50 consensus tissues used for expression profiling and classification20162
RNA expression (HPA)40Gene level RNA seq data based on the 40 tissues in the HPA dataset20162
RNA expression (GTEx)35Gene level RNA seq data for 35 tissues based on 46 tissue subtypes in the GTEx dataset19266
RNA expression (FANTOM)46Gene level RNA seq data for 46 tissues based on 66 tissue subtypes in the FANTOM dataset18292
  • Consensus transcript expression levels summarized per gene in 50 tissues based on transcriptomics data from HPA and GTEx.
  • HPA and GTEx datasets were processed in a normalization pipeline to be combined into a consensus dataset.
  • For tissues with multiple sub-tissues (brain regions, lymphoid tissues and intestine) the maximum of all sub-tissues is used for the tissue type.
  • Consensus normalized expression (“nTPM”) value is calculated as the maximum nTPM value for each gene in the two data sources.

In the end, the data is transformed into the csv file below:

Bulk RNA-seq data in our knowledge baseBulk RNA-seq data in our knowledge base

You can find the data in the format of {tissue name} in our tool, for example, “amygdala”, shown as below:

Bulk RNA-seq data naming conventionBulk RNA-seq data naming convention

Single cell RNA-seq

Data sources:

  • Transcript expression levels summarized per gene and cluster in 30 different datasets were analyzed.
  • These datasets were retrieved from
    • Single Cell Expression Atlas
    • Human Cell Atlas
    • Gene Expression Omnibus
    • Allen Brain Map
    • European Genome-phenome Archive
Data typeCount (tissues)DescriptionCoverage (nr genes)
RNA expression (tissues)31RNA read count for genes per cell across 31 tissues20082
RNA expression (clusters)557RNA expression for genes across 557 clusters20082
RNA expression (cell type)81RNA expression levels per gene and cell type20082
  • Data generated is based on meta-analysis of literature on single cell RNA sequencing and single cell databases that include healthy human tissue.
  • Droplet-based 10X Genomics Chromium (10X) approach were processed by Cell Ranger (v6.1.2), and datasets generated by the plate-based scRNA-seq were processed by STAR (v2.7.9a).
  • Downstream analysis followed an in-house pipeline using Scanpy (v1.7.1).
  • Each of the 557 different cell type clusters were manually annotated based on an extensive survey of >500 well-known tissue and cell type-specific markers, including both markers from the original publications, and additional markers used in pathology diagnostics.

In the end, the data is transformed into the csv file below:

Single cell RNA-seq data in our knowledge baseSingle cell RNA-seq data in our knowledge base

You can find the data in the format of {tissue name}_{cluster}_{cell type} in our tool, for example, “fallopian tube_c-5_fibroblasts”, shown as below:

Single cell RNA-seq data naming conventionSingle cell RNA-seq data naming convention