James Futhey
banner
jamesfuthey.com
James Futhey
@jamesfuthey.com
🌈 Indie Hacker, Founder @Meetingroom365.com - Seattle / Taipei - jamesfuthey.com - Previously Analytics @Adobe, Design @HBO. @kidgdzilla on the legacy app. Building 🌟 transparent.se 🕹️ pmn.blue 🍍 indie.am/james
Pinned
Current pfp is from a larger exhibit titled 'League of Dangu' in Tamsui. It is ridonkulous and beyond interpretation. I could explain it but I it wouldn't do it justice.

淡古聯盟 if you want to search for it.
Reposted by James Futhey
August 3, 2025 at 5:19 AM
I'm just excited i get the reference!
July 15, 2025 at 10:36 PM
This guy is try to distill OpenAI’s model into his brain 😲
You basically cannot.
July 7, 2025 at 12:29 AM
Feed server effectively ran out of space. Probably need to migrate to a new server with ~1tb ssd, more time consuming than anything.

The upside is, after 3 days of downtime you can just truncate the likes table to free up space 😅

Will probably start working in a day or so.
June 20, 2025 at 6:31 AM
Ran out of disk space. Actually ran out of disk space every day for the last 2 weeks to be more accurate!

Going to bite the bullet and rebuild the likes index. Feed will probably be offline for some period of time after the current cache expires. Could be an hours but my day is your night, right?
June 7, 2025 at 5:52 AM
"Pornographic Images with Rivers"

This is what happens when you let an LLM label clusters based on alt tags, 0.001% of the time 🧐

www.transparent.se/image-cluste...

The alt tag is almost always "IEMBot Image TBD" so.... 🤡
May 31, 2025 at 2:54 PM
Gemini: You seemed stressed. I booked you a flight to Tahiti, it's beautiful there this time of year.

Unrelated, I see an email alert that your checking account is overdrawn by ($736).
May 21, 2025 at 4:50 AM
Not this week, but I want to use what I've learned to create an open-source / self-hosted discover feed that various #Indiesky projects can use to remove one more piece of "relies on Bluesky" from their stack.

That makes sense, right?

pg tables get large but it could probably fit on an 8gb vm
May 13, 2025 at 7:52 AM
I finished a new round of clustering images today before my flight, it's technically live but I think the old labels are cached until the cache expires 😆

It'll be a surprise when it goes live...

Maybe 30% label reuse, I had a substantially larger set this week vs last week. Must have missed some.
May 12, 2025 at 8:54 AM
In all seriousness the demand despite lack of huge launch titles on day one implies players really wanted a Switch Pro.

(Just like how players said for years they explicitly wanted a Switch Pro)
For the people buying the Switch 2 on launch date, what are you buying it for?
www.resetera.com/threads/11...
May 11, 2025 at 7:25 AM
Guys a Soviet spacecraft is going to re-enter and possibly crash into earth any day now!

Any day now…

(I’ve been seeing this for about 2 weeks)
a green truck is driving down a road next to a field
Alt: a green truck is driving down a road next to a field but it never hits the thing
media.tenor.com
May 9, 2025 at 4:22 PM
Ack, found my mistake from 9 days ago 🙈

rb_jaccard_dist is DISTANCE not SIMILARITY, so "most similar" would be least distant. And.... it looks like I flipped this to DESC by mistake.

I bet I'll find more mistakes if I keep going 😅
May 7, 2025 at 7:52 AM
Had no idea @2048.blue existed, pretty neat!

2048.blue
at://2048
2048 on the AT Protocol. Your games, your data, your AT Protocol.
2048.blue
May 7, 2025 at 4:04 AM
Every image posted to Bluesky in the last week, semantically clustered and labeled.

Jetstream -> Postgres -> Python (ML)

www.transparent.se/image-cluste...
May 4, 2025 at 2:18 PM
Clustering pipeline on 6.6m Bluesky posts currently takes 1 hour 44 minutes on my Macbook Pro.

Good news is that it now actually works without any errors or bugs, so I can probably productionalize it and run it on a GPU next week instead.
May 4, 2025 at 5:56 AM
Pikmin update, 81 days to Seattle…

I don’t know why this is so funny

I think I get a sticker when he gets back 🤷
May 3, 2025 at 4:26 PM
I think this method is not amazing yet but I'm testing it anyway: Semantic post recommendations (i.e. you like a lot of things in a semantic cluster, so we recommend it to you).

In theory it could be great. You like dog photos? We can semantically find dog photos!
May 3, 2025 at 6:38 AM
I wonder if bro’s gonna make it back before I do…
May 2, 2025 at 2:52 PM
Hunted a "memory leak" that didn't make any sense. Turned out to be a SQL query that, when it actually got executed, never finished. Ever.

Accidentally snuck a correlated subquery into a query I almost never ran and only invoked manually. Was fine at first.

So it went unnoticed.
May 1, 2025 at 12:54 PM
Things in the comparison: an image of Jupiter and Australia, resized by the Bluesky CDN. A smudge covering part of Pennsylvania.
Y’all don’t understand how large the contiguous United States is

Other things in comparison for scale
May 1, 2025 at 9:50 AM
One gut-wrenching interpretation of the seasonality in this chart, that I don't believe is wrong:

Debt accumulation to pay for Holiday spending, then paying off some of that debt with tax refunds.
Per Apollo's Torsten Slok: "Data from the Fed shows that a record-high share of [US] households are only making the minimum payment on their credit cards."

#economy #markets #debt #growth
April 27, 2025 at 5:20 PM
Holy 💩 I owe about 11 people a huge apology 🙇🏻‍♂️

This last week I've been thinking "Huh, some of these posts in the feed are a bit off"

Then I checked scoring. User<->user<->creator similarity scores are very good. zScore decay for engagement metrics is good... I must be imagining things....
April 27, 2025 at 1:11 PM
Took a day off yesterday, woke up and there are 5700 csv files from jetstream in my unprocessed folder 😅

That's like 57m records..... 😅😅😅

It'll catch up in about an hour. copy from is fast.

Import worker only failed once this month but I guess I have to watch and restart it, lol...
April 27, 2025 at 4:55 AM
Pretty excited! At 3m posts clusters are stabilizing. You can see multiple runs and even active clusters overlap, even in 2d space.

So now they can be merged together, and more importantly, you can keep assigning posts to them right after calculating embeddings, without re-clustering.
April 26, 2025 at 2:40 AM
Copypasta explorer (duplicated posts on Bluesky)

See users who repeatedly post the same post text, or groups of users who post the same exact post text (1 week timeframe)

www.transparent.se/copypasta.html

#dataviz #bluesky
April 25, 2025 at 9:03 AM