Williwaw Biosciences

Molecular Subtyping from Signals Others Discard

We have moved beyond the genome-wide association study (GWAS) paradigm entirely. While the field debates common versus rare variants, we have found that the most valuable information is hidden in what everyone discards.

Platform

Our patent-pending computational platform extracts molecular subtypes from genomic signals that conventional pipelines discard, applying machine learning and family-based inheritance analysis to identify biologically distinct disease subtypes from existing datasets.

How It Works

  1. Family-Based Data

    Analyze trio and family genomic data to capture inheritance patterns unavailable in case-only analyses.

  2. Signal Extraction

    Apply proprietary algorithms to extract subtype-defining signals from genomic patterns typically filtered out as noise — including probe intensity anomalies, copy number variants, and non-Mendelian inheritance signatures.

  3. Machine Learning Integration

    Unsupervised learning methods group individuals by molecular signal profiles, revealing biologically coherent subtypes without requiring predefined labels.

  4. Pathway Analysis

    Map extracted signals to biological pathways and molecular mechanisms, generating mechanistically interpretable subtype definitions.


Technology Overview

  • Patent-pending computational methods for molecular subtype discovery
  • Family trio-based analysis captures inheritance patterns unavailable in case-only designs
  • Machine learning integration identifies subtype-defining signals across biological pathways
  • Applicable to any complex disease with existing SNP array or sequencing data

Our Approach

Conventional genetic analysis is built around filters designed to reduce noise. Those same filters discard rare variants, structural signals, and inheritance patterns that carry biologically meaningful information.

Conventional Approaches Williwaw Platform
GWAS identifies common variants of small effect Detects rare variants from accessible SNP arrays
Rare variant detection requires expensive NGS Extracts signal from probe intensity anomalies
Single-sample analysis misses inheritance patterns Leverages family-based genomic patterns
Copy number gains treated as technical artifacts Copy number gains reveal structural variants
Subtypes invisible to standard pipelines Reveals molecular subtypes with pathway resolution

Large-scale genetic databases already contain the subtyping information we need — it has been systematically filtered out by conventional pipelines. We do not require new data collection; we extract previously invisible biological signals from existing datasets.

100k+
Family trios in public databases
50+
Conditions with available data
1M+
Samples across biobanks

Published Research: Autism Spectrum Disorder

Autism was our initial application because it represents one of the most genetically complex conditions in medicine: highly heritable yet extremely heterogeneous, with no reliable biomarkers and limited treatment options. Our findings are published in Frontiers in Molecular Neuroscience and HGG Advances.

Why Autism: A Demanding Genetic Architecture

Autism spectrum disorder encompasses the full complexity of human genetic architecture:

  • Extreme heterogeneity: Thousands of genes contribute to risk, with no single dominant cause
  • Polygenic and monogenic mechanisms: Both common variants and rare de novo mutations play roles
  • Variable expressivity: The same genetic changes produce vastly different clinical outcomes
  • Diagnostic challenges: Purely behavioral diagnosis with no molecular biomarkers
1,811
Individuals analyzed
6
Distinct molecular subtypes
10⁻⁴⁰
Statistical significance
8
Biological pathways identified
Six biologically distinct autism subtypes

Unsupervised machine learning identifies six biologically distinct autism subtypes with different underlying mechanisms (n=1,811).

Autism Molecular Subtypes

Each subtype has distinct biological mechanisms, clinical characteristics, and therapeutic implications.

High Genetic Burden

n=494 (27%)
  • Polygenic architecture with highest variant load
  • Disruption across multiple biological pathways simultaneously
  • No single dominant mechanism; includes more Simplex than Multiplex cases

Therapeutic implication: Requires comprehensive, multi-system approach rather than single-pathway intervention.

Glutamate Dysfunction

n=267 (15%)
  • Variants in glutamatergic signaling genes; excitatory/inhibitory imbalance
  • Sensory sensitivities; seizure susceptibility

Therapeutic implication: Harbors variants in glutamate receptors and ion channels with FDA-approved therapies for other conditions — an immediate drug repurposing candidate.

Synaptic & Immune Regulation

n=267 (15%)
  • Disruption of synaptic organization genes; abnormal synapse formation and pruning
  • Complement system involvement

Therapeutic implication: FDA-approved drug targets include cholesterol receptors, GABA-B receptors, and neurotrophin pathways. Anti-inflammatory strategies may be relevant.

Cholesterol & Steroid Metabolism

n=251 (14%)
  • Dominated by steroid synthesis pathways in the brain
  • May represent multiple rare subtypes within this cluster

Therapeutic implication: Potentially responsive to cholesterol-modulating drugs.

Cytoskeletal Organization & Neurodevelopment

n=273 (15%)
  • Microtubule dysfunction; affected neuronal migration and morphology
  • Axon guidance pathway genes; motor coordination difficulties

Therapeutic implication: Early motor interventions; occupational and physical therapy focused on motor development.

Presynaptic Function

n=259 (14%)
  • Serotonin receptor dysfunction; presynaptic scaffolding disruption
  • Limbic system involvement; mood/anxiety vulnerabilities; sleep disturbances

Therapeutic implication: Strong candidate for SSRI response. May respond to dietary tryptophan modulation.


Biological Pathways Identified

Synaptic Function

Neurotransmission, vesicle recycling, and synaptic plasticity

Ion Channel Activity

Excitability regulation, E/I balance, action potential generation

Cell Adhesion

Cell-cell contacts, dendritic spine formation, synapse stability

Cytoskeletal Structure

Microtubule organization, axon guidance, neuronal polarity

Vesicle Trafficking

Intracellular transport, endosomal sorting, membrane dynamics

Immune / Complement

Synaptic pruning, maternal immune activation, inflammatory responses

Epigenetic Regulation

DNA methylation, RNA modification, chromatin remodeling

RNA Processing

mRNA degradation, translation control, post-transcriptional regulation

Rare Disease Applications

Our platform identifies molecular subtypes in any genetically complex condition — including diseases traditionally considered simple Mendelian disorders. Even single-gene diseases show unexpected molecular diversity.

Molecular Heterogeneity in Rare Disease

Even single-gene diseases are molecularly heterogeneous. Standard genetic testing identifies the primary mutation but cannot predict severity or treatment response. Characterizing the modifier landscape that determines clinical outcomes is where precision medicine for rare disease begins.

Hereditary Hemochromatosis

Iron overload disorder

The Clinical Question

Patients with identical HFE C282Y/C282Y genotypes show 100-fold variation in outcomes — some develop severe cirrhosis by age 40, others remain asymptomatic for life. Penetrance of HFE mutations is approximately 1%, and no clinical factors reliably explain this variation.

Hemochromatosis Missing Heritability

What We Found

HFE (C282Y, H63D) Common variants identified by standard testing — necessary but not sufficient to predict severity.
MTF1 Rare SNP (MAF=0.04 in Europeans). Metal transcription factor that regulates HAMP in response to zinc.
SLC39A12 Rare SNP (MAF=0.01 in Europeans). Zinc transporter; cryptic deletion detectable only via NMI signal.
Epistasis All four mutations required for severe phenotype: iron + zinc dysregulation acting in combination.

Novel Biology

Both HFE (iron sensor) and MTF1 (zinc sensor) regulate HAMP expression. Zinc transporter disruption via SLC39A12 compounds the dysregulation, representing a previously undescribed iron-zinc epistatic interaction in hemochromatosis. Both NMI loci are eQTLs regulating gene expression.

Clinical Implications

  • Risk stratification: Identify which C282Y homozygotes need aggressive phlebotomy vs. monitoring
  • Personalized therapy: Zinc modulation for patients with MTF1/SLC39A12 variants
  • Prognostic testing: Predict severity at diagnosis rather than years later
  • Novel drug targets: Zinc pathway modulators for treatment-resistant cases
Hemochromatosis Iron-Zinc Pathway

Iron-zinc regulatory pathway showing multiple hemochromatosis types. HFE (Type 1) requires modifier genes for severe phenotype. Rare variants in SLC39A12 (zinc transporter) and MTF1 (zinc-responsive transcription factor) regulate HAMP expression, creating epistatic interaction with HFE mutations. Other hemochromatosis genes shown: HJV (Type 2A), HAMP (Type 2B), TFR2 (Type 3), SLC40A (Type 4), FTH1 (Type 5), plus iron metabolism modifiers (TMPRSS6, BMP2, BMP6).

Niemann-Pick Type C Disease

Severe progressive neurodegenerative ataxia

Reclassifying the Disease Mechanism

The same NPC1 mutations cause infantile vs. adult-onset disease (decades apart), with variable progression from rapid neurodegeneration to slow decline. The prevailing cholesterol-storage model has driven therapies with limited efficacy. Our analysis points to iron-dependent ferroptosis as the primary driver, opening different therapeutic avenues.

What We Found

NPC1 Primary lysosomal mutation. Traditional focus on cholesterol accumulation; triggers iron overload.
Ferroptosis genes COX10, ALOX5, PSMB8, GCLC modulate lipid peroxidation sensitivity.
NRF2 pathway Antioxidant defense system; protective variants slow progression. Target for FDA-approved drugs.
Glutathione system SLC7A11, SLC3A2 (System xc-) import cysteine for GSH synthesis.

Novel Biology

  • Iron accumulation: NPC1 defect causes Fe²⁺ buildup in mitochondria
  • Lipid peroxidation: Iron catalyzes PUFA oxidation via the arachidonic acid pathway
  • GSH depletion: System xc- dysfunction prevents antioxidant synthesis
  • NRF2 failure: Insufficient stress response in severe cases

Clinical Implications

  • Disease reclassification: Ferroptosis disorder enables repurposing of FDA-approved drugs
  • Omaveloxolone (Skyclarys®): NRF2 activator already approved for Friedreich's ataxia
  • Combination therapy: Iron chelation + NRF2 activation + lipid peroxidation inhibition
  • Clinical trial design: Enrich for patients with druggable NRF2 pathway variants
Niemann-Pick Ferroptosis Pathway with Modifiers

Complete ferroptosis pathway with modifier genes and FDA-approved drug targets. NPC1 oxidation triggers iron accumulation → lipid peroxidation → ferroptotic cell death. Modifier genes affect the cholesterol pathway (NPC1, NPC2, MALL, COX10, GCLC), sphingomyelin pathway (SMPD1, AGMO, PAH, NPC2), and lipid peroxidase pathway (ALOX5, AGMO, PSMB8). NRF2 activation by omaveloxolone targets multiple pathway components including ALOX5, PSMB8, GCLC, STING, and SLC40A1.

Why Traditional Approaches Miss This

Approach Limitation
Standard genetic testing Identifies the primary mutation but ignores modifier variants that determine severity
Whole genome sequencing Generates millions of variants but lacks the framework to identify which modifiers matter
Our platform Extracts subtype-defining signals from family data and integrates regulatory variants to reveal functional subtypes

Platform Applications

Our molecular subtyping platform enables precision medicine across diverse therapeutic areas and stakeholders.

Genetic Testing Companies

  • Molecular subtype classification for any condition
  • Pathway-based interpretation of variants
  • Actionable insights beyond standard VCF analysis
  • Clinical trial matching opportunities

Healthcare Systems

  • Subtype-specific treatment recommendations
  • Risk stratification for comorbidities
  • Prognostic information
  • Integration with existing EHR workflows

Pharmaceutical Companies

  • Identify mechanistically defined patient populations
  • Enrich clinical trials for responders
  • Reduce trial failure rates
  • Enable precision medicine drug development

Research Institutions

  • Identify novel disease subtypes
  • Generate mechanistic hypotheses
  • Prioritize therapeutic targets
  • Enable population-scale studies

Broader Applications

The methods validated in autism apply to any complex condition with a genetic component and available family-based or population genomic data. Potential applications include: psychiatric disorders (schizophrenia, bipolar disorder, ADHD), neurodevelopmental conditions (intellectual disability, epilepsy), cancer molecular subtypes, cardiovascular disease stratification, autoimmune conditions, rare disease characterization, pharmacogenomic response prediction, and complex trait architecture.

Latest Insights

Short, evidence-driven perspectives on what's working — and what's not — in precision medicine.

February 2026

Precision Medicine Is Avoiding Its Hardest Problem

Precision medicine promises healthcare tailored to the individual. But there's an uncomfortable truth: the "precision" part is still struggling — especially in genetics.

Polygenic risk scores (PRS) are often presented as a breakthrough. They combine thousands of tiny genetic signals to predict disease risk. Yet for many real-world clinical decisions, the improvements are small, inconsistent across populations, and rarely definitive for individual care.

Even in human height, one of the most heritable and best-studied traits, the largest genetic studies show a ceiling. Tens of thousands of common variants explain only part of the biology, leaving substantial variation outside the model.

Instead of closing those biological gaps, the industry is pivoting toward:

  • AI models
  • Wearables
  • Electronic health records
  • Behavioral and lifestyle data

These tools are valuable — but they are increasingly compensating for a core limitation: our genetic measurements remain incomplete.

PRS rely mostly on common SNPs, which are easy to measure but capture only a slice of genomic complexity. They largely miss structural variation, gene interactions, and regulatory effects that often drive real disease mechanisms.

The industry is building increasingly sophisticated analytics on top of a simplified view of genetics.

That's not precision. That's approximation.

If precision medicine is truly about the individual, we can't keep skimming the surface of the one dataset that is fundamentally individual: the genome itself. Until we confront that, AI will keep getting smarter while biology remains under-measured.

Partnership Opportunities

We are seeking strategic partners to apply our molecular subtyping platform across therapeutic areas.

We are open to collaborations with academic medical centers, pharmaceutical and biotechnology companies, genetic testing providers, and rare disease foundations. Our platform can be applied to existing datasets, making partnerships straightforward to initiate.

Get In Touch

Leadership

MG

Michael R. Garvin, Ph.D.

Founder & Chief Scientific Officer | Research Professor, University of New Mexico

Dr. Garvin is a molecular geneticist with over 20 years of experience in precision medicine and computational biology. He holds a Ph.D. from the University of Alaska Fairbanks and a Certificate in Personalized & Genomic Medicine from the University of Colorado Denver. His career spans Oak Ridge National Laboratory, Oregon State University, and biotechnology companies including Tularik (Amgen) and CV Therapeutics (Gilead).

His research focuses on identifying the genetic basis of complex diseases through novel computational approaches. With over 37 peer-reviewed publications in journals including Genome Biology, eLife, and PLoS Genetics, and over $5 million in NIH and DOE funding, Dr. Garvin has established a record in precision medicine research.

His COVID-19 research became the 2nd most viewed article in eLife history with over 154,000 views. His autism subtyping work is published in Frontiers in Molecular Neuroscience and HGG Advances. Dr. Garvin founded Williwaw Biosciences after discovering that family-based genomic data contains rich subtyping information systematically discarded by conventional analysis pipelines.


BG

Ben Garvin

Co-Founder & Chief Business Officer

Ben Garvin brings extensive experience in technology commercialization and strategic partnerships, with a track record of selling advanced algorithmic and data products to large enterprises.

As Chief Business Officer, Ben leads business development, strategic partnerships, and commercialization efforts for Williwaw Biosciences.

Contact

Interested in applying our platform to your therapeutic area or exploring partnership opportunities?

mike@williwawbio.com

For clinical partnerships, licensing inquiries, or investment opportunities, please reach out directly.