淡古聯盟 if you want to search for it.
The upside is, after 3 days of downtime you can just truncate the likes table to free up space 😅
Will probably start working in a day or so.
The upside is, after 3 days of downtime you can just truncate the likes table to free up space 😅
Will probably start working in a day or so.
Going to bite the bullet and rebuild the likes index. Feed will probably be offline for some period of time after the current cache expires. Could be an hours but my day is your night, right?
Going to bite the bullet and rebuild the likes index. Feed will probably be offline for some period of time after the current cache expires. Could be an hours but my day is your night, right?
This is what happens when you let an LLM label clusters based on alt tags, 0.001% of the time 🧐
www.transparent.se/image-cluste...
The alt tag is almost always "IEMBot Image TBD" so.... 🤡
This is what happens when you let an LLM label clusters based on alt tags, 0.001% of the time 🧐
www.transparent.se/image-cluste...
The alt tag is almost always "IEMBot Image TBD" so.... 🤡
Unrelated, I see an email alert that your checking account is overdrawn by ($736).
Unrelated, I see an email alert that your checking account is overdrawn by ($736).
That makes sense, right?
pg tables get large but it could probably fit on an 8gb vm
That makes sense, right?
pg tables get large but it could probably fit on an 8gb vm
It'll be a surprise when it goes live...
Maybe 30% label reuse, I had a substantially larger set this week vs last week. Must have missed some.
It'll be a surprise when it goes live...
Maybe 30% label reuse, I had a substantially larger set this week vs last week. Must have missed some.
(Just like how players said for years they explicitly wanted a Switch Pro)
www.resetera.com/threads/11...
(Just like how players said for years they explicitly wanted a Switch Pro)
Any day now…
(I’ve been seeing this for about 2 weeks)
Any day now…
(I’ve been seeing this for about 2 weeks)
rb_jaccard_dist is DISTANCE not SIMILARITY, so "most similar" would be least distant. And.... it looks like I flipped this to DESC by mistake.
I bet I'll find more mistakes if I keep going 😅
rb_jaccard_dist is DISTANCE not SIMILARITY, so "most similar" would be least distant. And.... it looks like I flipped this to DESC by mistake.
I bet I'll find more mistakes if I keep going 😅
Jetstream -> Postgres -> Python (ML)
www.transparent.se/image-cluste...
Jetstream -> Postgres -> Python (ML)
www.transparent.se/image-cluste...
Good news is that it now actually works without any errors or bugs, so I can probably productionalize it and run it on a GPU next week instead.
Good news is that it now actually works without any errors or bugs, so I can probably productionalize it and run it on a GPU next week instead.
I don’t know why this is so funny
I think I get a sticker when he gets back 🤷
I don’t know why this is so funny
I think I get a sticker when he gets back 🤷
In theory it could be great. You like dog photos? We can semantically find dog photos!
In theory it could be great. You like dog photos? We can semantically find dog photos!
Accidentally snuck a correlated subquery into a query I almost never ran and only invoked manually. Was fine at first.
So it went unnoticed.
Accidentally snuck a correlated subquery into a query I almost never ran and only invoked manually. Was fine at first.
So it went unnoticed.
Other things in comparison for scale
Debt accumulation to pay for Holiday spending, then paying off some of that debt with tax refunds.
Debt accumulation to pay for Holiday spending, then paying off some of that debt with tax refunds.
This last week I've been thinking "Huh, some of these posts in the feed are a bit off"
Then I checked scoring. User<->user<->creator similarity scores are very good. zScore decay for engagement metrics is good... I must be imagining things....
This last week I've been thinking "Huh, some of these posts in the feed are a bit off"
Then I checked scoring. User<->user<->creator similarity scores are very good. zScore decay for engagement metrics is good... I must be imagining things....
That's like 57m records..... 😅😅😅
It'll catch up in about an hour. copy from is fast.
Import worker only failed once this month but I guess I have to watch and restart it, lol...
That's like 57m records..... 😅😅😅
It'll catch up in about an hour. copy from is fast.
Import worker only failed once this month but I guess I have to watch and restart it, lol...
So now they can be merged together, and more importantly, you can keep assigning posts to them right after calculating embeddings, without re-clustering.
So now they can be merged together, and more importantly, you can keep assigning posts to them right after calculating embeddings, without re-clustering.
See users who repeatedly post the same post text, or groups of users who post the same exact post text (1 week timeframe)
www.transparent.se/copypasta.html
#dataviz #bluesky
See users who repeatedly post the same post text, or groups of users who post the same exact post text (1 week timeframe)
www.transparent.se/copypasta.html
#dataviz #bluesky