Overview

The IGVF Catalog contains variant nodes primarily sourced from FAVOR.
Variants not present in FAVOR but present in the other variant collections below were also loaded.
Variants may be identified using SPDI, HGVS, or their genomic position.

Choose the data type you need

A row of cards now appears at the top of each variant page so you can jump straight to the evidence you want: Coding, Noncoding, Molecular Networks, Phenotypes, or Allele Frequencies.
Pick a card to land on the matching section of the page and scan its tables without scrolling.
The cards stay easy to tap on smaller screens, helping you quickly switch between coding changes, regulatory signals, network partners, phenotype associations, or population frequency summaries.

Stay oriented across pages

A colored coverage stripe now sits under the variant title, mirroring the home page bar so you instantly see how much IGVF evidence exists for each data type.
Table headers now spell out whether rows come from datasets (for example, FAVOR or MaveDB experiments) or predictions (such as ENCODE E2G), making it clear when you are viewing measured results versus model outputs.
SPDI and RefSeq-style variant identifiers now resolve cleanly even when they contain multiple colons—for example, NC_000023.11:154533039:G:C opens directly instead of showing an error.

Region links in the Gene Regulation section now open reliably for every chromosome label—chrX, chrY, chrM, and other reference contigs—so clicking a coordinate no longer leads to a blank page.

Non-coding Variant Edges

Source	Class	Edge Description	Datasets
TopLD	statistical assessment	Variant-to-variant edges describing variants in linkage disequilibrium (LD).	TopLD
caQTL	statistical assessment	Variant-to-genomic element edges describing variants associated with chromatin accessibility.	ENCODE, AFGR
eQTL	statistical assessment	Variant-to-gene edges describing variants associated with gene expression.	EBI eQTL, AFGR
sQTL	statistical assessment	Variant-to-gene edges describing variants associated with alternative splicing.	AFGR
Variant-EFFECTS	observed data	Variant-to-gene edges describing causal effects of variants on endogenous gene expression from genome editing experiments.	Variant-EFFECTS datasets (Jesse Engreitz, Stanford)
STARR-seq	observed data	Variant-to-biosample edges describing allele-specific regulatory activity in the K562 cell line.	STARR-seq datasets (Tim Reddy, Duke)
BlueSTARR	prediction	Variant-to-genomic element edges describing predicted regulatory activity in ENCODE cCRE sequences using a STARR-seq–trained model.	BlueSTARR dataset (Bill Majoros, Duke)
MPRA	observed data	Variant-to-genomic element edges describing variant activity within genomic elements tested in MPRA.	lentiMPRA datasets (Nadav Ahituv, UCSF)

Variant Edges

Source	Edge Description	Datasets
TopLD	Variant-to-variant edges describing variants in linkage disequilibrium (LD).	TopLD
caQTL	Variant-to-genomic element edges describing variants associated with chromatin accessibility.	ENCODE, AFGR
eQTL	Variant-to-gene edges describing variants associated with gene expression.	EBI eQTL, AFGR
sQTL	Variant-to-gene edges describing variants associated with alternative splicing.	AFGR
Variant-EFFECTS	Variant-to-gene edges describing causal effects of variants on endogenous gene expression from genome editing experiments.	Variant-EFFECTS datasets (Jesse Engreitz, Stanford)
STARR-seq	Variant-to-biosample edges describing allele-specific regulatory activity in the K562 cell line.	STARR-seq datasets (Tim Reddy, Duke)
BlueSTARR	Variant-to-genomic element edges describing predicted regulatory activity in ENCODE cCRE sequences using a STARR-seq–trained model.	BlueSTARR dataset (Bill Majoros, Duke)
MPRA	Variant-to-genomic element edges describing variant activity within genomic elements tested in MPRA.	lentiMPRA datasets (Nadav Ahituv, UCSF)

Enhancer-Gene Model Prediction Table

This table shows which genes are predicted to be regulated by enhancers overlapping the variant you’re viewing.

Column	Description
Cell Type	Cell type in which the enhancer is predicted to regulate the gene
Target Gene	Gene predicted to be regulated by the enhancer (click for gene details)
Score	Strength of the prediction (range: 0 to 1, higher indicates a more confident prediction)
Dataset	Source dataset (click for more details)
Model	Predictive model. Currently: ENCODE-rE2G
Variant-Gene Distance	Genomic distance between the variant and gene body

Score ranges from 0 (no prediction) to 1 (confident prediction). Currently, this table includes predictions from the ENCODE-rE2G model across 1,458 ENCODE biosamples (see Gschwind et al. bioRxiv 2023) The table is initially sorted by Score in descending order, showing the strongest predictions first.

Variants in Linkage Disequilibrium

This table lists variants in linkage disequilibrium with the query variant, and summarizes functional evidence about those variants. Each row reports one variant in one ancestry. LD information is sourced from 1000 Genomes Phase 3 queried from Ensembl.

Column	Description
rsID	The reference SNP ID (click for variant details)
LD (r²)	Measure of linkage disequilibrium (correlation coefficient)
LD (D’)	Measure of linkage disequilibrium (D prime statistic)
Cell Types w/ pred. EG link	Cell types with predicted enhancer-gene links
Genes w/ pred. EG link	Genes with predicted enhancer-gene links
QTL Types	Types of QTL associations
Genes w/ QTLs	Genes with QTL associations
Ancestry	The population ancestry for the LD information
Most Severe Consequence	The predicted functional impact

You can filter this table by ancestry using the dropdown menu above the table.

GWAS Association

Which phenotypes are associated with this variant through genome-wide association studies? GWAS Catalog. Study IDs now open directly in the GWAS Catalog or UK Biobank (when applicable) so you can read the full study record without leaving the catalog page, and the previous Source column has been removed so the table stays focused on the study details shown here.

Column	Description
Lead Variant	Lead variant identifier for the association
Study ID	Identifier for the GWAS study (click to view in GWAS Catalog or UK Biobank when provided)
Trait	The associated phenotype or trait
Effect Allele	Effect allele reported for the association
Lead Variant P-value	Statistical significance of the association
Beta	Effect size and direction
95% Confidence Interval	Confidence interval for the effect size
PMID	PubMed ID for the study (click to view publication)
Author (Year)	First author and publication year
Study N	Sample size of the study

CSV downloads now include the variant identifier in every exported row so your spreadsheets keep the context of the on-page tables.

Coding variant measurements

Experimental assays now sit in their own table so you can focus on observed protein effects (for example, VAMP-seq abundance drops or SGE functional scores) without scrolling past prediction models. Hover the info bubbles in the Source column to see quick explanations of VAMP-seq or SGE when they appear, and use the updated VAMP-seq link to open variant plots directly at https://abundance.gs.washington.edu/plots/variants.

Column	Description
Name	Name identifier for the coding variant
Reference	Reference allele
Alternate	Alternate allele
Protein Name	Name of the affected protein
Gene Name	Name of the affected gene (click for gene details)
Transcript ID	Transcript identifier (click for transcript details)
Amino Acid Position	Position of the amino acid change
HGVS Protein	HGVS notation for the protein change
HGVS	HGVS notation for the variant
Reference Codon	Reference codon sequence
Codon Position	Position within the codon
VEST4 Score	VEST4 pathogenicity prediction score (blank when not provided by the dataset)
REVEL Score	REVEL pathogenicity prediction score (blank when not provided by the dataset)
MutPred2 Score	MutPred2 probability of disrupting protein function; hover the question mark for a short method summary
BayesDel noAF Score	BayesDel pathogenicity score without allele frequency input
ESM-1v Score	ESM-1v evolutionary constraint score; hover the question mark for a short method summary
AlphaMissense Score	AlphaMissense pathogenicity prediction score
Source	Study, resource, or assay behind the row (hover for quick context where available)

Coding variant predictions

Computational models now have a separate table labelled Computational models, letting you review in silico scores (MutPred2, ESM-1v, AlphaMissense, REVEL, VEST4, BayesDel) without mixing them with wet-lab measurements. Legacy columns like SIFT, PolyPhen-2, CADD Raw, VARITY, and EVE have been removed to keep the view focused on current methods. Hover the question marks on MutPred2 Score and ESM-1v Score to see what each model measures.

Column	Description
Name	Name identifier for the coding variant
Reference	Reference allele
Alternate	Alternate allele
Protein Name	Name of the affected protein
Gene Name	Name of the affected gene (click for gene details)
Transcript ID	Transcript identifier (click for transcript details)
Amino Acid Position	Position of the amino acid change
HGVS Protein	HGVS notation for the protein change
HGVS	HGVS notation for the variant
Reference Codon	Reference codon sequence
Codon Position	Position within the codon
VEST4 Score	VEST4 pathogenicity prediction score
REVEL Score	REVEL pathogenicity prediction score
MutPred2 Score	MutPred2 probability of disrupting protein function (hover for method notes)
BayesDel noAF Score	BayesDel pathogenicity score without allele frequency input
ESM-1v Score	ESM-1v evolutionary constraint score (hover for method notes)
AlphaMissense Score	AlphaMissense pathogenicity prediction score
Source	Model or reference resource for the prediction (click for details when linked)

Allelic effect(s) on transcription factor binding

This table shows effects of the variant on allelic binding in transcription factor (TF) ChIP-seq experiments.

Column	Description
Protein	Transcription factor protein (click for protein details)
Gene	Gene encoding the transcription factor (click for gene details)
Biosample	Biological sample or cell type
Motif	Transcription factor binding motif (click for motif details)
Motif Summary	Summary of motif binding effects
Motif Fold-Change	Fold change in motif binding affinity
Ref Score	Binding score for reference allele
Alt Score	Binding score for alternate allele
Ref FDR	False discovery rate for reference allele
Alt FDR	False discovery rate for alternate allele
Motif Position	Position of the motif
Motif Strand	Strand orientation of the motif
Source	Source of the binding data

Molecular Networks

A notice now appears at the top of this section reading “Perturb-seq and other CRISPR data coming soon!” so you know additional network evidence is on the way while you review the current results.

Molecular QTLs

Molecular QTLs for the variant now appear in a single, streamlined table that combines gene-level and protein-level associations. Cell type and tissue names are shown with clear, human-readable labels.

Column	Description
Type	Type of molecular QTL (e.g., eQTL, sQTL, pQTL)
Gene	Gene linked to the variant through the QTL (click for gene details)
Cell type/tissue	Biosample label using readable cell type or tissue names
Effect Size	Magnitude of the variant’s effect
P-value (-log10)	Statistical significance of the association
Source	Data source for the QTL (click for source details)

The table subtitle lists the contributing sources—GTEx, AFGR, EBI QTL Catalogue, and UK Biobank proteomics—so you can see provenance at a glance.

Associated Disease

This table shows diseases associated with this variant from ClinGen and Orphanet.

Column	Description
Disease	Associated disease or condition (click for disease details)
Gene	Gene associated with the disease variant (click for gene details)
Assertion	Clinical significance assertion
PMIDs	PubMed IDs supporting the association
Source	Source of the disease association data (click for source details)

Functional assay measurement

This table shows functional assay measurements for each biosample, including AdAstra and GVATdb assay readouts for the variant.

Column	Description
Biosample	Biosample name (click for biosample details)
Synonyms	Alternative names for the biosample
Description	Description of the biosample
Method	Experimental method used
Label	Evidence label or type
Input Counts	Input counts for reference and alternate alleles
Output Counts	Output counts for reference and alternate alleles
Variant SPDI	SPDI notation for the variant
Post. Prob. Effect	Posterior probability of effect
log2 Fold Change	Log2 fold change measurement
95% Confidence Interval	Confidence interval for the measurement
Source	Source of the experimental evidence

Interactive Visualizations

The variant page includes several interactive visualization components that provide rich insights into variant effects, population genetics, and functional predictions.

Primary Variant Visualization

An animated visualization showing the molecular mechanism of the variant using SPDI notation. Features:

Animated Sequence: Shows the variant change as deletion and insertion events
Color Coding:
- Teal (#337788): Insertion allele
- Coral (#CC8877): Deletion allele
Multi-phase Animation:
- Initial display of deletion event
- Fade-in of insertion allele
- Arrow indicating the replacement direction
Genomic Context: Displays surrounding genomic sequence with placeholder bases
Responsive Design: Automatically adjusts to container width

This visualization helps users understand the molecular nature of the variant change at the nucleotide level.

Allele Frequency Distribution

An interactive bar chart displaying population-specific allele frequencies from gnomAD. Features:

Population Breakdown: Shows frequencies across major ancestry groups:
- African, Amish, Ashkenazi Jewish, East Asian, Finnish
- Native American, Non-Finnish European, Other, South Asian
Interactive Elements:
- Hover over bars to see exact frequency values
- Color changes on hover for visual feedback
- Precise tooltips with 3 significant figures
Animated Rendering: Bars animate from bottom to top on initial load
Rotated Labels: Population labels displayed at 45-degree angle for readability
Responsive Scaling: Y-axis automatically scales to accommodate data range

This visualization provides crucial population genetics context for variant interpretation.

Summary Data Table with Score Visualization

An interactive table combining functional prediction scores with visual progress bars. Features:

Score Visualization: Each numerical score displayed with:
- Horizontal progress bar showing relative magnitude
- Color-coded bar (brand color) indicating score strength
- Exact numerical value alongside visual representation
Sortable Interface: Click column headers to sort by score values
Dynamic Columns: Table adapts to show only relevant columns:
- Functional class (when available)
- Catalog links (when available)
- Portal links (when available)
Clamped Scoring: All scores normalized to 0-1 range for consistent visualization
Loading States: Graceful handling of data loading with appropriate placeholders

This component allows quick visual comparison of functional prediction scores across different tools and databases.

Linkage Disequilibrium Heatmap

An interactive heatmap showing pairwise linkage disequilibrium relationships with nearby variants. Features:

Dual Metrics: Toggle between r² and D’ measurements using dropdown selector
Interactive Exploration:
- Hover over cells to see detailed LD statistics
- Tooltips display both variant IDs and precise r²/D’ values
- Mouse tracking with real-time feedback
Color Gradient: Intensity represents LD strength (white to teal)
Variant Organization: Variants automatically sorted by genomic position
Export Functionality: Download heatmap as PNG image
Responsive Design: Square aspect ratio that scales with container
Legend: Color scale reference showing value mapping

The heatmap reveals local LD structure and helps identify variant clusters that may represent the same causal signal.

Ancestry Filtering

An animated interface for filtering LD data by population ancestry. Features:

Smooth Transitions: Framer Motion animations for state changes
Toggle Interaction:
- Click ancestry labels to filter data
- Selected ancestry highlighted with brand color
- Clear button (×) to remove filter
Population Mapping: Displays both short codes and full population names:
- ASJ (Ashkenazi Jewish), EAS (East Asian), AFR (African)
- FIN (Finnish), NFE (Non-Finnish European), SAS (South Asian)
- AMI (Amish), OTH (Other), AMR (American), EUR (European)
Dynamic Visibility: Only shows when multiple ancestries are available
Hover Effects: Scale animations on hover for visual feedback

This component enables population-specific analysis of linkage disequilibrium patterns.

Get Started

Nodes

Regions

Catalog Pages

Variant

Overview

Choose the data type you need

Stay oriented across pages

Non-coding Variant Edges

Variant Edges

Enhancer-Gene Model Prediction Table

Variants in Linkage Disequilibrium

GWAS Association

Coding variant measurements

Coding variant predictions

Allelic effect(s) on transcription factor binding

Molecular Networks

Molecular QTLs

Associated Disease

Functional assay measurement

Interactive Visualizations

Primary Variant Visualization

Allele Frequency Distribution

Summary Data Table with Score Visualization

Linkage Disequilibrium Heatmap

Ancestry Filtering

Get Started

Nodes

Regions

Catalog Pages

​Overview

​Choose the data type you need

​Stay oriented across pages

​Non-coding Variant Edges

​Variant Edges

​Enhancer-Gene Model Prediction Table

​Variants in Linkage Disequilibrium

​GWAS Association

​Coding variant measurements

​Coding variant predictions

​Allelic effect(s) on transcription factor binding

​Molecular Networks

​Molecular QTLs

​Associated Disease

​Functional assay measurement

​Interactive Visualizations

​Primary Variant Visualization

​Allele Frequency Distribution

​Summary Data Table with Score Visualization

​Linkage Disequilibrium Heatmap

​Ancestry Filtering

Overview

Choose the data type you need

Stay oriented across pages

Non-coding Variant Edges

Variant Edges

Enhancer-Gene Model Prediction Table

Variants in Linkage Disequilibrium

GWAS Association

Coding variant measurements

Coding variant predictions

Allelic effect(s) on transcription factor binding

Molecular Networks

Molecular QTLs

Associated Disease

Functional assay measurement

Interactive Visualizations

Primary Variant Visualization

Allele Frequency Distribution

Summary Data Table with Score Visualization

Linkage Disequilibrium Heatmap

Ancestry Filtering