The absolute scale of reported abundances is more or less arbitrary. For DIA data, what is typically reported is area below the XIC peak. Deal with that, the question becomes whether you integrate over seconds ore minutes. Makes an absolute difference of 60x but the choice is arbitrary.
August 19, 2025 at 11:03 AM
The absolute scale of reported abundances is more or less arbitrary. For DIA data, what is typically reported is area below the XIC peak. Deal with that, the question becomes whether you integrate over seconds ore minutes. Makes an absolute difference of 60x but the choice is arbitrary.
For the name "Kuiper" I wanted to continue the space-theme we have going in Spectronaut. Kuiper was good fit for the name as like the Immuno-Class 1 search space, the Kuiper belt consists of millions of small objects.
June 27, 2025 at 9:57 AM
For the name "Kuiper" I wanted to continue the space-theme we have going in Spectronaut. Kuiper was good fit for the name as like the Immuno-Class 1 search space, the Kuiper belt consists of millions of small objects.
The focus for Kuiper was on tackling unspecific searches. The open-search aspect was an unplanned side effect of how the unspecific engine was designed and fell into place almost by itself.
June 25, 2025 at 4:24 AM
The focus for Kuiper was on tackling unspecific searches. The open-search aspect was an unplanned side effect of how the unspecific engine was designed and fell into place almost by itself.
That is incorrect. Kuiper at its core is an open search engine. We simply chose to only report those matches we can afterwards resolve against existing modifications rather then reporting a list delta masses. Calling it "not truly open" is a technicality at this point.
June 25, 2025 at 4:14 AM
That is incorrect. Kuiper at its core is an open search engine. We simply chose to only report those matches we can afterwards resolve against existing modifications rather then reporting a list delta masses. Calling it "not truly open" is a technicality at this point.
One important part I forgot to mention, but Kuiper is completely data driven (purely an algorithmic solution) that is not dependent on any pre-trained HLA predictors. You don't need any prior knowledge of your samples to do a library-free, directDIA search using SN20 and Kuiper.
June 6, 2025 at 7:25 AM
One important part I forgot to mention, but Kuiper is completely data driven (purely an algorithmic solution) that is not dependent on any pre-trained HLA predictors. You don't need any prior knowledge of your samples to do a library-free, directDIA search using SN20 and Kuiper.
Working on Kuiper was a lot of fun. From the first prototype developed roughly a year ago, to the first integrated version in October to countless improvements and optimizations all the way to what was released now in Spectronaut 20. Looking forward to all of your feedback.
Oli
June 6, 2025 at 7:21 AM
Working on Kuiper was a lot of fun. From the first prototype developed roughly a year ago, to the first integrated version in October to countless improvements and optimizations all the way to what was released now in Spectronaut 20. Looking forward to all of your feedback.
But one of the coolest things about Kuiper, I already teased in an earlier post. Kuiper is so powerful, that you can do an unspecific open modification search on a full 7-52 length cell-lysate search space with only a minor increase in search time compared to a closed search.
June 6, 2025 at 7:21 AM
But one of the coolest things about Kuiper, I already teased in an earlier post. Kuiper is so powerful, that you can do an unspecific open modification search on a full 7-52 length cell-lysate search space with only a minor increase in search time compared to a closed search.
We obviously spent a lot of time making sure that the results you get are trustworthy. Not only from a statistical point of view but also using biological result validation. e.g.: we could show that the overlapping peptides between individual patients correspond to their shared HLA allele.
June 6, 2025 at 7:21 AM
We obviously spent a lot of time making sure that the results you get are trustworthy. Not only from a statistical point of view but also using biological result validation. e.g.: we could show that the overlapping peptides between individual patients correspond to their shared HLA allele.
The runtime improvements are what is most impressive in my eyes. We can now do unspecific class-1 and class-2 in ~10 minutes per DIA raw file (library free, directDIA including all steps). That marks a 78% and 86% reduction in search time compared to SN19.
June 6, 2025 at 7:21 AM
The runtime improvements are what is most impressive in my eyes. We can now do unspecific class-1 and class-2 in ~10 minutes per DIA raw file (library free, directDIA including all steps). That marks a 78% and 86% reduction in search time compared to SN19.
Our focus was primarily on immunopeptidomics (class 1 and 2) which is where you will see the biggest total gains in terms of IDs (+75% on average for class-1 and +18% for class-2).
June 6, 2025 at 7:21 AM
Our focus was primarily on immunopeptidomics (class 1 and 2) which is where you will see the biggest total gains in terms of IDs (+75% on average for class-1 and +18% for class-2).
Kuiper is a novel search-engine unlike anything you have seen before. It uses a new approach for MS2 indexing that is incredibly light-weight (full unspecific MS2 index of a 7-52 peptide length search space only occupies about 5GB in memory). Together with Pulsar, it can process DIA as well as DDA.
June 6, 2025 at 7:21 AM
Kuiper is a novel search-engine unlike anything you have seen before. It uses a new approach for MS2 indexing that is incredibly light-weight (full unspecific MS2 index of a 7-52 peptide length search space only occupies about 5GB in memory). Together with Pulsar, it can process DIA as well as DDA.
Most search-engines today start their development focus on the tryptic search case and are then expanded to other use-cases. We thought "how would a modern search engine look like when you start out from the unspecific use-case". What we came up with, is Kuiper.
June 6, 2025 at 7:21 AM
Most search-engines today start their development focus on the tryptic search case and are then expanded to other use-cases. We thought "how would a modern search engine look like when you start out from the unspecific use-case". What we came up with, is Kuiper.
Compared to a fully specific tryptic search-space, unspecific search-spaces (which produce every possible sub-sequence of the specified length-range), very quickly explode in search-space complexity. A fully unspecific search-space (length 7-52) is almost 900x larger then the tryptic equivalent.
June 6, 2025 at 7:21 AM
Compared to a fully specific tryptic search-space, unspecific search-spaces (which produce every possible sub-sequence of the specified length-range), very quickly explode in search-space complexity. A fully unspecific search-space (length 7-52) is almost 900x larger then the tryptic equivalent.
We have been looking into it for some selected datasets and also more systematically for our AI training data. So far the conclusion was that, while we did not see a lot of it, we were discussing adding a detection and flagging in the final results. But that won't be in the 20.0 release yet.
May 31, 2025 at 4:32 AM
We have been looking into it for some selected datasets and also more systematically for our AI training data. So far the conclusion was that, while we did not see a lot of it, we were discussing adding a detection and flagging in the final results. But that won't be in the 20.0 release yet.
I will be presenting Kuiper (and especially what Spectronaut 20 can do for Immunopeptidomics) at @biognosys.bsky.social breakfast seminar on Tuesday and also at my poster (TP 183)
May 30, 2025 at 12:47 PM
I will be presenting Kuiper (and especially what Spectronaut 20 can do for Immunopeptidomics) at @biognosys.bsky.social breakfast seminar on Tuesday and also at my poster (TP 183)
When I joined Biognosys in 2011 as part of my master thesis to develop a software for DIA / SWATH data analysis. Never done any mass-spec before that and my proteomics knowledge was primarily down to what we did on 3D structures during my studies.
March 21, 2025 at 12:33 PM
When I joined Biognosys in 2011 as part of my master thesis to develop a software for DIA / SWATH data analysis. Never done any mass-spec before that and my proteomics knowledge was primarily down to what we did on 3D structures during my studies.