GeneGaze screens 3,500+ clinically significant variants across 10 constellations by synthesizing evidence from fourteen globally recognized sources. Pathogenicity predictions incorporate Google DeepMind's AlphaMissense. Every condition carries an ICD-10 code. Medication response predictions map to 96 CPIC Level A gene-drug pairs. Everything runs on your device. Nothing is uploaded.

Core clinical sources

๐Ÿ—„๏ธ
ClinVar โ†—
Primary clinical classification

NCBI's public archive of variants and clinical significance interpretations. Every GeneGaze variant with a ClinVar VCV accession shows its current clinical significance, review status, and submitter evidence.

~100% of pathogenic / likely-pathogenic variants

๐Ÿงฌ
OMIM โ†—
Mendelian disease catalog

Online Mendelian Inheritance in Man, the authoritative catalog of human genes and genetic disorders. Every gene in the GeneGaze database is cross-referenced with its OMIM entry and associated Mendelian phenotypes.

98.5% OMIM gene coverage across catalog

๐Ÿ“Š
gnomAD v4 โ†—
Population allele frequencies

The Genome Aggregation Database from the Broad Institute. Global and ancestry-specific allele frequencies from over 800,000 exomes and genomes. Used to filter common benign variants and surface genuinely rare risk alleles.

Global + African + European + East Asian + South Asian + Latino

๐Ÿ’Š
PharmGKB โ†—
Pharmacogenomics

The Pharmacogenomics Knowledgebase. Clinically annotated gene-drug interactions with evidence levels 1A through 4. GeneGaze's Medications tab pulls directly from PharmGKB annotations and CPIC guidelines.

96 CPIC Level A gene-drug pairs (ASCPT 2025)

๐Ÿ”ฌ
GWAS Catalog โ†—
Population-scale associations

EBI's curated catalog of genome-wide association studies. Supplies effect sizes, p-values, and replicated associations for complex-trait variants that aren't in ClinVar.

400,000+ associations across 6,500+ traits

โœจ
AlphaMissense โ†—
AI pathogenicity prediction

Google DeepMind's 2023 release scoring 71 million possible missense variants for pathogenicity, 89% classifiable as likely benign or likely pathogenic. Fills the interpretive gap for rare missense variants not yet in ClinVar. Cited as GeneGaze's single largest gap-filler.

71 million missense scores, all human protein-coding variation

Enrichment & interpretation

๐Ÿงช
UniProt / UniVar โ†—
Protein-level functional annotation

Variant impact at the protein sequence, domain, and functional site level. Every GeneGaze gene links to its UniProt record for canonical sequence, post-translational modifications, and domain architecture.

20,000+ reviewed human protein records

๐Ÿ“š
LOVD โ†—
Locus-specific variant databases

Leiden Open Variation Databases, community-curated, gene-specific variation archives maintained by disease experts. Often contains rare Mendelian variants before they're submitted to ClinVar.

Gene-specific archives for rare Mendelian variants

๐Ÿฉบ
MalaCards โ†—
Integrated disease-gene associations

The Weizmann Institute's integrated human disease compendium. Aggregates disease-gene-drug-pathway associations across 75+ sources into disease-centric cards. Used for cross-referencing condition synonyms and disease ontology.

22,000+ diseases across 75+ aggregated sources

๐Ÿ“–
Mastermind โ†—
Literature variant mining

Genomenon's AI-powered genomic literature database. Used to capture variants with published primary literature that haven't yet made it into ClinVar, critical for rare disease curation.

9M+ literature-mined variant-article mappings

๐Ÿงฉ
SpliceAI โ†—
Splicing impact prediction

Broad Institute deep-learning tool predicting splice-altering potential of any variant (0.0โ€“1.0 delta score). GeneGaze flags splice region variants with SpliceAI ฮด โ‰ฅ 0.5 as likely splice-disrupting.

Splice-altering predictions across all human genes

๐Ÿ”–
dbSNP โ†—
Canonical variant identifiers

NCBI's reference registry for rsIDs. Ensures every GeneGaze variant uses the current canonical rsID. Deprecated and merged entries are rejected at curation time (for example, rs1982073 resolves to rs1800470).

1 billion+ canonical rsID reference mappings

๐Ÿฅ
ICD-10 / ICD-11 โ†—
Standard disease coding

WHO's International Classification of Diseases. Every variant-linked condition carries its current ICD-10 (and where relevant ICD-11) code for interoperability with medical records.

Standard disease coding for EHR interoperability

๐Ÿ’Š
CPIC โ†—
Pharmacogenomics guidelines

The Clinical Pharmacogenetics Implementation Consortium. Published prescribing guidelines for Level A gene-drug pairs. Used alongside PharmGKB for the Medications tab.

Level A prescribing guidelines for 96 gene-drug pairs

How we curate

Our curation standard

Every variant in GeneGaze is hand-curated by a clinical-genetics-trained editor following a fixed eight-step protocol:

  1. Verify the canonical rsID on dbSNP
  2. Pull the ClinVar VCV accession and clinical significance
  3. Cross-reference OMIM for gene and phenotype links
  4. Confirm gnomAD v4 population frequency across ancestries
  5. Add the AlphaMissense score for missense variants
  6. Attach UniProt and MalaCards links where relevant
  7. Include the current ICD-10 (and ICD-11 where available) code
  8. Cite at least four independently verifiable primary sources