- MUSET can do 618 GB in <10 hours and <20GB RAM (with filtering of rows).
- 2-3 orders of magnitude matrix size reduction.
- Fast, low memory (but high disk).
- Direct abundance matrix generation.
- MUSET can do 618 GB in <10 hours and <20GB RAM (with filtering of rows).
- 2-3 orders of magnitude matrix size reduction.
- Fast, low memory (but high disk).
- Direct abundance matrix generation.
MUSET's pipeline is straightforward yet powerful 🛠️
- Uses kmtricks for efficient k-mer counting.
- Applies intelligent filtering based on their abundance in the samples.
- Generates unitigs using GGCAT.
- Constructs compact abundance matrices relying on SSHash.
MUSET's pipeline is straightforward yet powerful 🛠️
- Uses kmtricks for efficient k-mer counting.
- Applies intelligent filtering based on their abundance in the samples.
- Generates unitigs using GGCAT.
- Constructs compact abundance matrices relying on SSHash.
💡 Key innovation? We track not just presence/absence but actual sequence abundance across samples. 📈
💡 Key innovation? We track not just presence/absence but actual sequence abundance across samples. 📈