Sebastian Paez
banner
jspaezp.bsky.social
Sebastian Paez
@jspaezp.bsky.social
Data scientist @ Talus Bio.
Kind of into data, proteomics, open source and biology.
I feel like that question is a trap ...
November 26, 2025 at 7:38 PM
How do you read from the wiff directly? (I am not super familiar with the state of SDKs for wiff/wiff2, thanks beforehand)
May 13, 2025 at 11:24 PM
I think @ypriverol.bsky.social might have some hard data on this one
May 6, 2025 at 6:42 PM
In the past I have written stuff to delete things more than X years old that are not raw file and that seems to already give a pretty good compromise. pdresults, pep.xmls, maxquant .peaks and that kind of stuff are massive files. (If I recall my past life that was taking up ~75% of the space)
March 5, 2025 at 11:39 PM
Well that is false ... I calculated it for 120 TB, not 380 ... so it would actually be ~ 475 USD/month
March 5, 2025 at 6:34 AM
Couldnt help myself from crunching the numbers of how much this would be in the cloud ... turns out its ~ 155 bucks/month cloud.google.com/products/cal...
Google Cloud Pricing Calculator
Create your own Custom Price Quote for the products offered through Google Cloud based on number, usage, and power of servers
cloud.google.com
March 5, 2025 at 6:33 AM
It also depends on the tool/acquisition method. Some can be understood as missing at random and some cannot (or at least different ratios of the two)
January 30, 2025 at 10:01 AM
bsky is descending into degeneracy D:
bsky.app/profile/ucdp...
@michaellazear.bsky.social & @jspaezp.bsky.social I'm Running Sage on a dual Epyc 128 thread box w/ 2TB memory (I think) . Searching 19 ddaPASF files directly with built in timsRUST!! Dang this is screaming fast !!
January 22, 2025 at 10:42 PM
@swillems.bsky.social do you have any insights on this one ? From the data I've seen the pro data is a bit larger than the ultra series ... Maybe you used compression on the pro and not the ultra ?
January 20, 2025 at 4:30 PM
Just fyi ... Lfq is something that is not supported on the releases yet but we are thinking on how to have a good implementation for it. (We have an experimental implementation .. DM me if you want to try it out)
January 16, 2025 at 4:16 PM
Why did the project name have a space ? What kind of savagery is that? :P
December 31, 2024 at 5:15 PM
I think it is very interesting but I was wondering why your approach does not deal explicitly with missing values (more accurately, missing values are excluded from the CV calculation). Is the assumption that all missing values are missing at random here?
November 25, 2024 at 7:10 PM
Some of us like writting the software a lot more than any of those :P
November 15, 2024 at 10:14 PM