Skip to main content

Genomic Elements

The IGVF Catalog contains genomic elements, which are potential functional regions for regulating gene expression. These elements are identified/tested by functional assays (e.g., MPRA), association studies (e.g., caQTLs), and computational predictions (e.g., the ENCODE-rE2G model). Data sources include: Additional genomic elements are added when studied in specific contexts, for example accessible elements used in predictive modeels or elements tested in an MPRA experiment or CRISPR screen

Genomic Elements-Genes Edges

SourceClassEdge DescriptionDatasets
ENCODEpredictionBiosample specific enhancer-gene predictions using rE2G pipelineENCODE-rE2G
IGVFobserved dataCRISPRa and CRISPRi Perturb-seq integrating a guide (sgRNA) library targeting transcription start sites in 24 genes in CD8-positive, alpha-beta memory T primary cellsIGVF Perturb-seq assays
IGVFobserved dataCRISPRi CRISPRa CRISPRko FACS screen of CCR7 and IL7R inCD8-positive, alpha-beta memory T primary cellsIGVF CRISPR FACS screen

Genomic Elements-Biosamples Edges

SourceClassEdge DescriptionDatasets
ENCODEobserved datalentiMPRA testing potential HepG2, K562, and WTC11 enhancers and promoters of all protein-coding genes.ENCODE lentiMPRA
IGVFobserved datalentiMPRA in 5 biosamples measuring effect of element (and variants) on gene expressionIGVF lentiMPRA

Variants-Genomic Elements Edges

(also on variants page)
  • External caQTL studies: variants associated with genomic elements function from external studies (PMID:34038741, PMID:34017130) and AFGR(African Functional Genomics Resource)
  • IGVF BlueSTARR: computational model predicts variants affecting the regulatory function in ENCODE cCREs (Candidate Cis-Regulatory Elements)
  • IGVF MPRA

Table Columns:

ColumnDescription
RegionGenomic coordinates of the regulatory region (clickable for more details)
Biochemical ActivityThe type of biochemical activity in this region
Biochemical Activity DescriptionDetailed description of the biochemical activity
TypeClassification of the regulatory region
SourceOrigin of the data
Source URLLink to the original data source (clickable)

Interacting with the Table:

  • Click on column headers to sort the table by that column
  • Use the search box above the table to filter results
  • Adjust the number of rows displayed using the dropdown menu
  • Navigate between pages using the “Previous” and “Next” buttons

Biosample Table

The Biosample Table provides information about biological samples associated with the regulatory regions in your selected genomic coordinates.

Table Columns:

ColumnDescription
Activity ScoreNumerical score indicating the level of activity (higher is more active)
SourceOrigin of the biosample data (clickable link)
Chromosome, Region Start, Region EndLocation of the associated regulatory region
Region Name, Region TypeIdentifiers for the regulatory region
Biochemical Activity, Activity DescriptionDetails about the biochemical activity in the region
Biosample NameName of the biological sample
Term IDUnique identifier for the biosample
DescriptionDetailed description of the biosample
Biosample SourceOrigin of the biosample
SubontologySubclassification of the biosample, if applicable

Enhancer-Gene Prediction(s)

This table reports whether the query region contains predicted enhancers from the ENCODE-rE2G model, and their target genes and cell types. Each row reports one predicted enhancer, target gene, and cell type.
ColumnDescription
Cell TypeCell type in which the enhancer is predicted to regulate the gene
Target GeneGene predicted to be regulated by the enhancer
ScoreStrength of the prediction (range: 0 to 1, higher indicates a more confident prediction)
DatasetSource dataset
ModelPredictive model. Currently: ENCODE-rE2G
Variant-Gene DistanceGenomic distance between the variant and gene body
Currently, this table includes predictions from the ENCODE-rE2G model across 1700+ ENCODE biosamples (see Gschwind et al. bioRxiv 2023) The table is initially sorted by Score in descending order, showing the strongest predictions first.