Yusuf Roohani
yusufroohani.bsky.social
Yusuf Roohani
@yusufroohani.bsky.social
Machine Learning & Systems Biology. ML Group Leader @arcinstitute. PhD @StanfordAILab

http://www.yusufroohani.com
We're hiring! Come join the team and scale new heights with us! 🏔️

arcinstitute.org/jobs
February 25, 2025 at 2:35 PM
scBaseCamp is released as part of the Arc Virtual Cell Atlas!

Great work by Nick Youngblut, Chris Carpenter, Alex Dobin, Dave Burke, @genophoria.bsky.social and team

📢Announcement: arcinstitute.org/news/news/ar...

🔗Data access: github.com/ArcInstitute...

📄Report: arcinstitute.org/manuscripts/...
Arc Virtual Cell Atlas launches, combining data from over 300 million cells | Arc Institute
Arc Institute today launched the Arc Virtual Cell Atlas, a growing resource for computation-ready single-cell measurements, starting with data from over 300 million cells. The initial release of the A...
arcinstitute.org
February 25, 2025 at 2:35 PM
Uniform processing lowers technical variation between scBaseCamp datasets.

Technical factors such as library chemistry and suspension type (single-cell vs single-nucleus) exhibited comparable or lower silhouette scores than biologically meaningful categories like tissue type
February 25, 2025 at 2:35 PM
scBaseCamp is the first large biological data repository curated by an AI agent

We built a hierarchical agentic workflow (SRAgent) to automate discovery, metadata extraction & data processing

It is consistent, easily scalable and automatically updates when new data is available
February 25, 2025 at 2:35 PM
scBaseCamp was built by directly mining all publicly accessible 10X Genomics scRNAseq data from the Sequence Read Archive (SRA)

With over 230M cells drawn from 21 species and 72 tissues, scBaseCamp is significantly larger and more diverse than existing single-cell data repositories
February 25, 2025 at 2:35 PM
Why do you want to switch
December 4, 2024 at 9:01 PM
very easy to do this in Pycharm
December 4, 2024 at 8:43 PM