Andrew Carroll
acarroll.bsky.social
Andrew Carroll
@acarroll.bsky.social
Product lead Genomics Google Research
Reposted by Andrew Carroll
🐧We researched one of the world’s rarest #penguins. The yellow‑eyed penguin (aka hoiho/takaraka) isn’t one homogeneous species after all!

www.biorxiv.org/content/10.1...

#hoiho #conservation #genomics #birds #nzwildlife #endangered #wildlife #nature
October 28, 2025 at 8:08 PM
Reposted by Andrew Carroll
Accurate somatic small variant discovery for multiple sequencing technologies with DeepSomatic www.nature.com/articles/s41... (read free: rdcu.be/eLny0) github.com/google/deeps...
October 16, 2025 at 7:29 PM
Reposted by Andrew Carroll
Delighted to finally announce a preprint describing the Q100 project! “A complete diploid human genome benchmark for personalized genomics” For which we finished HG002 to near-perfect accuracy: www.biorxiv.org/content/10.1... 🧵[1/14]
A complete diploid human genome benchmark for personalized genomics
Human genome resequencing typically involves mapping reads to a reference genome to call variants; however, this approach suffers from both technical and reference biases, leaving many duplicated and ...
www.biorxiv.org
September 22, 2025 at 5:01 PM
I'll be speaking in this webinar (go.roche.com/sbx-d) on September 10, where I'll share our benchmarks and observations for Roche's SBX sequencing instrument, as well as models developed by our team for SBX data.
Germline Small Variant Calling Workflow for SBX Duplex Data
Wednesday, September 10, 2025 at 12:00 PM Eastern Daylight Time.
go.roche.com
August 21, 2025 at 12:18 AM
Release of DeepVariant and DeepSomatic v1.9

DV: Now train on HG002 T2T-Q100. Error reduction of 12% for Illumina and 30% for PacBio on this truth set. 25% faster. DeepTrio is 5x faster (20h -> 4h).

DS: New models FFPE_TUMOR_ONLY for {WGS, WES}. Much improved WGS models.

github.com/google/deepv...
Release DeepVariant 1.9.0 · google/deepvariant
DeepVariant: In this version we have updated our training scheme for the HG002 sample with the newly released HG002-T2T truth set which improves accuracy against that truth set. Our labeling metho...
github.com
May 13, 2025 at 8:23 PM
Reposted by Andrew Carroll
Incredibly moving Justin Trudeau remarks:

"We have fought and died alongside you....During your darkest hours...we were always there. Standing with you, grieving with you, the American people....Canadians are a little perplexed as to why our closest friends and neighbors are choosing to target us."
February 2, 2025 at 3:39 AM
Release of DeepVariant 1.8. Large speed improvement (~67% faster) via small model for easy sites. New Pangenome-aware option. Reduces error by ~30% for vg-mapped WGS, ~10% for BWA WGS, ~5% BWA exome. New config for custom model users, see release notes.

(github.com/google/deepv...)
December 5, 2024 at 5:57 PM
Reposted by Andrew Carroll
How do we make a pangenome maximally relevant for the study of a new sample? www.biorxiv.org/content/10.1...
Personalized Pangenome References
bioRxiv - the preprint server for biology, operated by Cold Spring Harbor Laboratory, a research and educational institution
www.biorxiv.org
December 15, 2023 at 3:34 PM
Release of DeepVariant v1.6.

Support for haploid regions, chrX/Y.
Workflow for Pangenome FASTQ-to-VCF.
Major DeepTrio improvements for de novo variants.
Models for CompleteGenomics T7, G400
Add NovaSeqX to training data

Release by Kishwar Shafin

github.com/google/deepv...
Release DeepVariant 1.6.0 · google/deepvariant
Improved support for haploid regions, chrX and chY. Users can specify haploid regions with a flag. Updated case studies show usage and metrics. Added pangenome workflow (FASTQ-to-VCF mapping with V...
github.com
October 26, 2023 at 4:32 PM
Reposted by Andrew Carroll
Proud of Ayse Keskus and Asher Bryant in my group for making this happen! This work is a collaboration with Children's Mercy, UCSC and Google Health - who are also releasing the first version of DeepSomatic today: github.com/google/deeps...
October 24, 2023 at 4:57 PM
Initial release of DeepSomatic, which identifies subclonal variants when given tumor and normal BAM files. Pre-trained models and case studies available for Illumina and PacBio. Development led by Kishwar Shafin which built off a framework by Pi-Chuan Chang.

(github.com/google/deeps...)
GitHub - google/deepsomatic
Contribute to google/deepsomatic development by creating an account on GitHub.
github.com
October 24, 2023 at 4:53 PM
Reposted by Andrew Carroll
Best resource for getting extracellular domain localization from Ensembl gene/protein IDs?

I tried the subcellular locations in HPA but the membrane annotation is mostly fully intracellular proteins.

🧪🧬🖥️🔬
September 14, 2023 at 1:41 AM