Samhan
samhan.bsky.social
Samhan
@samhan.bsky.social
Reposted by Samhan
might be time
September 20, 2025 at 10:35 PM
Reposted by Samhan
Some notes on the new DeepSeek-R1-0528 - a completely different model from the R1 they released in January, despite having a very similar name

Terrible LLM naming has managed to infect the Chinese AI labs too

simonwillison.net/2025/May/31/...
deepseek-ai/DeepSeek-R1-0528
Sadly the trend for *terrible naming* of models has infested the Chinese AI labs as well. DeepSeek-R1-0528 is a brand new and much improved open weights reasoning model from DeepSeek, …
simonwillison.net
May 31, 2025 at 10:02 PM
Reposted by Samhan
In case it interests anyone, I managed to set up a demo of GRPO RL training in Colab. It’s an adaptation of Will Brown instant classic for math reasoning. Replace llama 1B with qwen 0.5b and inference with vllm. Full training in about 2 hours.

colab.research.google.com/drive/1bfhs1...
February 2, 2025 at 1:49 PM
Reposted by Samhan
“The drop in demand for IT professions could signal long-term shifts due to technological changes, particularly artificial intelligence.”

www.swissinfo.ch/eng/workplac...
January 23, 2025 at 5:55 AM
Reposted by Samhan
Painting of a group of hatchling Tyrannosaurus rex- showcasing the fact that even these massive predators started out as fluffy little youngsters! #paleoart #dinosaurart #trex
October 30, 2024 at 6:10 PM
Reposted by Samhan
Reid Hoffman’s Financial Times op-ed captures Silicon Valley VCs’ hopes and fears about Trump. They hope for eased scrutiny on tech acquisitions, relaxed crypto/AI regulations, and nuclear energy support.

Their fear? Elon Musk as a power broker picking winners and losers. Both outcomes seem likely.
November 23, 2024 at 6:35 PM
Reposted by Samhan
Many people are saying. ;) hdsr.mitpress.mit.edu/pub/8dqgwqiu...
November 22, 2024 at 8:41 PM
Reposted by Samhan
I did it! I wrote up my Posting Middle Classes Theory, and tried to explain why I think Threads failed where Bluesky succeeded: youngvulgarian.substack.com/p/down-in-th... (free to read)
November 22, 2024 at 11:38 AM
Reposted by Samhan
Daily paper:
GPT-4V(ision) is a Generalist Web Agent, if Grounded
arxiv.org/abs/2401.01614
GPT-4V(ision) is a Generalist Web Agent, if Grounded
The recent development on large multimodal models (LMMs), especially GPT-4V(ision) and Gemini, has been quickly expanding the capability boundaries of multimodal models beyond traditional tasks like i...
arxiv.org
November 20, 2024 at 9:22 PM
Reposted by Samhan
Alibaba has their own version on GPT-o1. This might be the best description of “o1-type”systems so far arxiv.org/abs/2411.14405
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
Currently OpenAI o1 has sparked a surge of interest in the study of large reasoning models (LRM). Building on this momentum, Marco-o1 not only focuses on disciplines with standard answers, such as mat...
arxiv.org
November 22, 2024 at 12:18 PM
Reposted by Samhan
Winter is here in Switzerland!
November 22, 2024 at 6:56 AM
Reposted by Samhan
If you're new to Bluesky, or just want to understand how it's profoundly different from platforms like X, I wrote this article last year about federated social media -- Bluesky and Mastodon specifically -- and how they fit into the US history of federalism. www.theatlantic.com/technology/a...
Ben Franklin Would Have Loved Bluesky
Facebook and Twitter seem less relevant by the day. They may be replaced by new “federated” platforms.
www.theatlantic.com
November 18, 2024 at 3:02 AM
Reposted by Samhan
The #1 overall free app on the App Store (iOS) and Google Play (Android) is... Bluesky??

This is pretty incredible from an independent team, with a fraction of the budget as most other apps on these charts.

(This represents current downloads, but it still huge. Congrats Bluesky team!)
November 17, 2024 at 8:43 PM
Reposted by Samhan
How cool is this: this is from the Bluesky firehose!

What's so cool about this is:

1. The throwback style

2. That *any* dev can do this! Thanks to there being an API for this (no other social network allows this AFAIK)

This API is a big reason to be bullish on Bluesky

firehose3d.theo.io
November 17, 2024 at 9:40 AM
Reposted by Samhan
Some folks look to be using AI as deflection—I don’t have to deal with my own fear/boundaries/control issues/blaming because AI will make programmers so efficient that there will be no conflicts to trigger my shit.

Nope
November 17, 2024 at 6:29 PM
Reposted by Samhan
Untitled, 150 cm x 45 cm, acrylic on glass. These abstracts often start with a single gesture, in this case the sort of squid-looking figure on the left. The rest fills itself in around that.
November 18, 2024 at 12:02 AM
Reposted by Samhan
In case it's useful for you as well, Word2Vec converted in a sane binary format. huggingface.co/antirez/word...
antirez/word2vec-simple-parsing · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
huggingface.co
November 17, 2024 at 12:35 PM
Reposted by Samhan
Bluesky never felt so active as it has, the last few days. It's great to be here!

(I'm here for software engineering and tech topics - both following these, and sharing more about them. I write The Pragmatic Engineer and host the podcast with the same name. E.g. a recent article)
November 17, 2024 at 2:16 PM
Reposted by Samhan
Nice, @theo.io made exactly what I was looking for. It works perfectly, found a ton of Bluesky people I didn't know were here!
Bluesky Network Analyzer
Find accounts that you don't follow (yet) but are followed by lots of accounts that you do follow.
bsky-follow-finder.theo.io
November 15, 2024 at 6:09 PM
Reposted by Samhan
what are good starter packs for: AI researchers, AI Systems people, GenAI hackers, LLM enthusiasts?
November 16, 2024 at 1:49 AM
Reposted by Samhan
What do you call a sheep who can sing and dance?

Lady Ba Ba.

(It’s not her fault. She was shorn that way.)
November 15, 2024 at 12:42 AM
Bluesky seems to be the first decentralised social network to truly take off. Just like Bitcoin protocol proved influential so will AT.
November 15, 2024 at 4:15 PM
Reposted by Samhan
if only 16yo me could have known how often adult me would need to solve some graphics programming problem with the ditty my maths teacher Mr Pickles taught us

🎵 twiddly dum, twiddly dee
🎶 around the moon is pi times d
🎵 but if a hole you want repaired
🎶 then you must use pi r squared

#mrpickles
November 15, 2024 at 3:54 PM