Try it → evaluate variant pathogenicity with interpretable, residue-aware probabilities.
bio3d.cs.huji.ac.il/webserver/ra...
Try it → evaluate variant pathogenicity with interpretable, residue-aware probabilities.
bio3d.cs.huji.ac.il/webserver/ra...
Mis-calibrated probabilities = inconsistent evidence for clinical classification (and you may also improve AUROC).
Mis-calibrated probabilities = inconsistent evidence for clinical classification (and you may also improve AUROC).
RaCoon improves ESM1b substantially:
• ClinVar AUROC: 0.930 → 0.941
• ProteinGym AUROC: 0.912 → 0.924
• Per-protein AUROC improves too
• Calibration error (ECE/MCE) drops across all subgroups
RaCoon improves ESM1b substantially:
• ClinVar AUROC: 0.930 → 0.941
• ProteinGym AUROC: 0.912 → 0.924
• Per-protein AUROC improves too
• Calibration error (ECE/MCE) drops across all subgroups
1. Split variants by key residue properties
2. Fit GMMs to benign vs. pathogenic scores in each subgroup
3. Convert raw ESM1b LLRs into calibrated probabilities
4. No direct label exposure → labels only used for prior estimation
1. Split variants by key residue properties
2. Fit GMMs to benign vs. pathogenic scores in each subgroup
3. Convert raw ESM1b LLRs into calibrated probabilities
4. No direct label exposure → labels only used for prior estimation
(It calibrates like a raccoon sorts trash: by categories. 😉)
(It calibrates like a raccoon sorts trash: by categories. 😉)