Skip to main content

Overview

The IGVF Catalog contains variant nodes primarily sourced from FAVOR.
Variants not present in FAVOR but present in the other variant collections below were also loaded.
Variants may be identified using SPDI, HGVS, or their genomic position.

Choose the data type you need

  • A row of cards now appears at the top of each variant page so you can jump straight to the evidence you want: Coding, Noncoding, Molecular Networks, Phenotypes, or Allele Frequencies.
  • Pick a card to land on the matching section of the page and scan its tables without scrolling.
  • The cards stay easy to tap on smaller screens, helping you quickly switch between coding changes, regulatory signals, network partners, phenotype associations, or population frequency summaries.

Stay oriented across pages

  • A colored coverage stripe now sits under the variant title, mirroring the home page bar so you instantly see how much IGVF evidence exists for each data type.
  • Table headers now spell out whether rows come from datasets (for example, FAVOR or MaveDB experiments) or predictions (such as ENCODE E2G), making it clear when you are viewing measured results versus model outputs.
  • SPDI and RefSeq-style variant identifiers now resolve cleanly even when they contain multiple colons—for example, NC_000023.11:154533039:G:C opens directly instead of showing an error.
Region links in the Gene Regulation section now open reliably for every chromosome label—chrX, chrY, chrM, and other reference contigs—so clicking a coordinate no longer leads to a blank page.

Non-coding Variant Edges

SourceClassEdge DescriptionDatasets
TopLDstatistical assessmentVariant-to-variant edges describing variants in linkage disequilibrium (LD).TopLD
caQTLstatistical assessmentVariant-to-genomic element edges describing variants associated with chromatin accessibility.ENCODE, AFGR
eQTLstatistical assessmentVariant-to-gene edges describing variants associated with gene expression.EBI eQTL, AFGR
sQTLstatistical assessmentVariant-to-gene edges describing variants associated with alternative splicing.AFGR
Variant-EFFECTSobserved dataVariant-to-gene edges describing causal effects of variants on endogenous gene expression from genome editing experiments.Variant-EFFECTS datasets (Jesse Engreitz, Stanford)
STARR-seqobserved dataVariant-to-biosample edges describing allele-specific regulatory activity in the K562 cell line.STARR-seq datasets (Tim Reddy, Duke)
BlueSTARRpredictionVariant-to-genomic element edges describing predicted regulatory activity in ENCODE cCRE sequences using a STARR-seq–trained model.BlueSTARR dataset (Bill Majoros, Duke)
MPRAobserved dataVariant-to-genomic element edges describing variant activity within genomic elements tested in MPRA.lentiMPRA datasets (Nadav Ahituv, UCSF)

Variant Edges

SourceEdge DescriptionDatasets
TopLDVariant-to-variant edges describing variants in linkage disequilibrium (LD).TopLD
caQTLVariant-to-genomic element edges describing variants associated with chromatin accessibility.ENCODE, AFGR
eQTLVariant-to-gene edges describing variants associated with gene expression.EBI eQTL, AFGR
sQTLVariant-to-gene edges describing variants associated with alternative splicing.AFGR
Variant-EFFECTSVariant-to-gene edges describing causal effects of variants on endogenous gene expression from genome editing experiments.Variant-EFFECTS datasets (Jesse Engreitz, Stanford)
STARR-seqVariant-to-biosample edges describing allele-specific regulatory activity in the K562 cell line.STARR-seq datasets (Tim Reddy, Duke)
BlueSTARRVariant-to-genomic element edges describing predicted regulatory activity in ENCODE cCRE sequences using a STARR-seq–trained model.BlueSTARR dataset (Bill Majoros, Duke)
MPRAVariant-to-genomic element edges describing variant activity within genomic elements tested in MPRA.lentiMPRA datasets (Nadav Ahituv, UCSF)

Enhancer-Gene Model Prediction Table

This table shows which genes are predicted to be regulated by enhancers overlapping the variant you’re viewing.
ColumnDescription
Cell TypeCell type in which the enhancer is predicted to regulate the gene
Target GeneGene predicted to be regulated by the enhancer (click for gene details)
ScoreStrength of the prediction (range: 0 to 1, higher indicates a more confident prediction)
DatasetSource dataset (click for more details)
ModelPredictive model. Currently: ENCODE-rE2G
Variant-Gene DistanceGenomic distance between the variant and gene body
Score ranges from 0 (no prediction) to 1 (confident prediction). Currently, this table includes predictions from the ENCODE-rE2G model across 1,458 ENCODE biosamples (see Gschwind et al. bioRxiv 2023) The table is initially sorted by Score in descending order, showing the strongest predictions first.

Variants in Linkage Disequilibrium

This table lists variants in linkage disequilibrium with the query variant, and summarizes functional evidence about those variants. Each row reports one variant in one ancestry. LD information is sourced from 1000 Genomes Phase 3 queried from Ensembl.
ColumnDescription
rsIDThe reference SNP ID (click for variant details)
LD (r²)Measure of linkage disequilibrium (correlation coefficient)
LD (D’)Measure of linkage disequilibrium (D prime statistic)
Cell Types w/ pred. EG linkCell types with predicted enhancer-gene links
Genes w/ pred. EG linkGenes with predicted enhancer-gene links
QTL TypesTypes of QTL associations
Genes w/ QTLsGenes with QTL associations
AncestryThe population ancestry for the LD information
Most Severe ConsequenceThe predicted functional impact
You can filter this table by ancestry using the dropdown menu above the table.

GWAS Association

Which phenotypes are associated with this variant through genome-wide association studies? GWAS Catalog. Study IDs now open directly in the GWAS Catalog or UK Biobank (when applicable) so you can read the full study record without leaving the catalog page, and the previous Source column has been removed so the table stays focused on the study details shown here.
ColumnDescription
Lead VariantLead variant identifier for the association
Study IDIdentifier for the GWAS study (click to view in GWAS Catalog or UK Biobank when provided)
TraitThe associated phenotype or trait
Effect AlleleEffect allele reported for the association
Lead Variant P-valueStatistical significance of the association
BetaEffect size and direction
95% Confidence IntervalConfidence interval for the effect size
PMIDPubMed ID for the study (click to view publication)
Author (Year)First author and publication year
Study NSample size of the study
CSV downloads now include the variant identifier in every exported row so your spreadsheets keep the context of the on-page tables.

Coding variant measurements

Experimental assays now sit in their own table so you can focus on observed protein effects (for example, VAMP-seq abundance drops or SGE functional scores) without scrolling past prediction models. Hover the info bubbles in the Source column to see quick explanations of VAMP-seq or SGE when they appear, and use the updated VAMP-seq link to open variant plots directly at https://abundance.gs.washington.edu/plots/variants.
ColumnDescription
NameName identifier for the coding variant
ReferenceReference allele
AlternateAlternate allele
Protein NameName of the affected protein
Gene NameName of the affected gene (click for gene details)
Transcript IDTranscript identifier (click for transcript details)
Amino Acid PositionPosition of the amino acid change
HGVS ProteinHGVS notation for the protein change
HGVSHGVS notation for the variant
Reference CodonReference codon sequence
Codon PositionPosition within the codon
VEST4 ScoreVEST4 pathogenicity prediction score (blank when not provided by the dataset)
REVEL ScoreREVEL pathogenicity prediction score (blank when not provided by the dataset)
MutPred2 ScoreMutPred2 probability of disrupting protein function; hover the question mark for a short method summary
BayesDel noAF ScoreBayesDel pathogenicity score without allele frequency input
ESM-1v ScoreESM-1v evolutionary constraint score; hover the question mark for a short method summary
AlphaMissense ScoreAlphaMissense pathogenicity prediction score
SourceStudy, resource, or assay behind the row (hover for quick context where available)

Coding variant predictions

Computational models now have a separate table labelled Computational models, letting you review in silico scores (MutPred2, ESM-1v, AlphaMissense, REVEL, VEST4, BayesDel) without mixing them with wet-lab measurements. Legacy columns like SIFT, PolyPhen-2, CADD Raw, VARITY, and EVE have been removed to keep the view focused on current methods. Hover the question marks on MutPred2 Score and ESM-1v Score to see what each model measures.
ColumnDescription
NameName identifier for the coding variant
ReferenceReference allele
AlternateAlternate allele
Protein NameName of the affected protein
Gene NameName of the affected gene (click for gene details)
Transcript IDTranscript identifier (click for transcript details)
Amino Acid PositionPosition of the amino acid change
HGVS ProteinHGVS notation for the protein change
HGVSHGVS notation for the variant
Reference CodonReference codon sequence
Codon PositionPosition within the codon
VEST4 ScoreVEST4 pathogenicity prediction score
REVEL ScoreREVEL pathogenicity prediction score
MutPred2 ScoreMutPred2 probability of disrupting protein function (hover for method notes)
BayesDel noAF ScoreBayesDel pathogenicity score without allele frequency input
ESM-1v ScoreESM-1v evolutionary constraint score (hover for method notes)
AlphaMissense ScoreAlphaMissense pathogenicity prediction score
SourceModel or reference resource for the prediction (click for details when linked)

Allelic effect(s) on transcription factor binding

This table shows effects of the variant on allelic binding in transcription factor (TF) ChIP-seq experiments.
ColumnDescription
ProteinTranscription factor protein (click for protein details)
GeneGene encoding the transcription factor (click for gene details)
BiosampleBiological sample or cell type
MotifTranscription factor binding motif (click for motif details)
Motif SummarySummary of motif binding effects
Motif Fold-ChangeFold change in motif binding affinity
Ref ScoreBinding score for reference allele
Alt ScoreBinding score for alternate allele
Ref FDRFalse discovery rate for reference allele
Alt FDRFalse discovery rate for alternate allele
Motif PositionPosition of the motif
Motif StrandStrand orientation of the motif
SourceSource of the binding data

Molecular Networks

A notice now appears at the top of this section reading “Perturb-seq and other CRISPR data coming soon!” so you know additional network evidence is on the way while you review the current results.

Molecular QTLs

Molecular QTLs for the variant now appear in a single, streamlined table that combines gene-level and protein-level associations. Cell type and tissue names are shown with clear, human-readable labels.
ColumnDescription
TypeType of molecular QTL (e.g., eQTL, sQTL, pQTL)
GeneGene linked to the variant through the QTL (click for gene details)
Cell type/tissueBiosample label using readable cell type or tissue names
Effect SizeMagnitude of the variant’s effect
P-value (-log10)Statistical significance of the association
SourceData source for the QTL (click for source details)
The table subtitle lists the contributing sources—GTEx, AFGR, EBI QTL Catalogue, and UK Biobank proteomics—so you can see provenance at a glance.

Associated Disease

This table shows diseases associated with this variant from ClinGen and Orphanet.
ColumnDescription
DiseaseAssociated disease or condition (click for disease details)
GeneGene associated with the disease variant (click for gene details)
AssertionClinical significance assertion
PMIDsPubMed IDs supporting the association
SourceSource of the disease association data (click for source details)

Functional assay measurement

This table shows functional assay measurements for each biosample, including AdAstra and GVATdb assay readouts for the variant.
ColumnDescription
BiosampleBiosample name (click for biosample details)
SynonymsAlternative names for the biosample
DescriptionDescription of the biosample
MethodExperimental method used
LabelEvidence label or type
Input CountsInput counts for reference and alternate alleles
Output CountsOutput counts for reference and alternate alleles
Variant SPDISPDI notation for the variant
Post. Prob. EffectPosterior probability of effect
log2 Fold ChangeLog2 fold change measurement
95% Confidence IntervalConfidence interval for the measurement
SourceSource of the experimental evidence

Interactive Visualizations

The variant page includes several interactive visualization components that provide rich insights into variant effects, population genetics, and functional predictions.

Primary Variant Visualization

An animated visualization showing the molecular mechanism of the variant using SPDI notation. Features:
  • Animated Sequence: Shows the variant change as deletion and insertion events
  • Color Coding:
    • Teal (#337788): Insertion allele
    • Coral (#CC8877): Deletion allele
  • Multi-phase Animation:
    • Initial display of deletion event
    • Fade-in of insertion allele
    • Arrow indicating the replacement direction
  • Genomic Context: Displays surrounding genomic sequence with placeholder bases
  • Responsive Design: Automatically adjusts to container width
This visualization helps users understand the molecular nature of the variant change at the nucleotide level.

Allele Frequency Distribution

An interactive bar chart displaying population-specific allele frequencies from gnomAD. Features:
  • Population Breakdown: Shows frequencies across major ancestry groups:
    • African, Amish, Ashkenazi Jewish, East Asian, Finnish
    • Native American, Non-Finnish European, Other, South Asian
  • Interactive Elements:
    • Hover over bars to see exact frequency values
    • Color changes on hover for visual feedback
    • Precise tooltips with 3 significant figures
  • Animated Rendering: Bars animate from bottom to top on initial load
  • Rotated Labels: Population labels displayed at 45-degree angle for readability
  • Responsive Scaling: Y-axis automatically scales to accommodate data range
This visualization provides crucial population genetics context for variant interpretation.

Summary Data Table with Score Visualization

An interactive table combining functional prediction scores with visual progress bars. Features:
  • Score Visualization: Each numerical score displayed with:
    • Horizontal progress bar showing relative magnitude
    • Color-coded bar (brand color) indicating score strength
    • Exact numerical value alongside visual representation
  • Sortable Interface: Click column headers to sort by score values
  • Dynamic Columns: Table adapts to show only relevant columns:
    • Functional class (when available)
    • Catalog links (when available)
    • Portal links (when available)
  • Clamped Scoring: All scores normalized to 0-1 range for consistent visualization
  • Loading States: Graceful handling of data loading with appropriate placeholders
This component allows quick visual comparison of functional prediction scores across different tools and databases.

Linkage Disequilibrium Heatmap

An interactive heatmap showing pairwise linkage disequilibrium relationships with nearby variants. Features:
  • Dual Metrics: Toggle between r² and D’ measurements using dropdown selector
  • Interactive Exploration:
    • Hover over cells to see detailed LD statistics
    • Tooltips display both variant IDs and precise r²/D’ values
    • Mouse tracking with real-time feedback
  • Color Gradient: Intensity represents LD strength (white to teal)
  • Variant Organization: Variants automatically sorted by genomic position
  • Export Functionality: Download heatmap as PNG image
  • Responsive Design: Square aspect ratio that scales with container
  • Legend: Color scale reference showing value mapping
The heatmap reveals local LD structure and helps identify variant clusters that may represent the same causal signal.

Ancestry Filtering

An animated interface for filtering LD data by population ancestry. Features:
  • Smooth Transitions: Framer Motion animations for state changes
  • Toggle Interaction:
    • Click ancestry labels to filter data
    • Selected ancestry highlighted with brand color
    • Clear button (×) to remove filter
  • Population Mapping: Displays both short codes and full population names:
    • ASJ (Ashkenazi Jewish), EAS (East Asian), AFR (African)
    • FIN (Finnish), NFE (Non-Finnish European), SAS (South Asian)
    • AMI (Amish), OTH (Other), AMR (American), EUR (European)
  • Dynamic Visibility: Only shows when multiple ancestries are available
  • Hover Effects: Scale animations on hover for visual feedback
This component enables population-specific analysis of linkage disequilibrium patterns.