Adi Mukherjee
adim.in
Adi Mukherjee
@adim.in
Software Engineer, currently on sabbatical in Japan. Prev: Apple SRE. Working on something new.
Reposted by Adi Mukherjee
fuckin cool
Direct hit on the Sunset Fire in Hollywood. 🎯

Nice work, @lafd.bsky.social #LAFD c/o @nbcla.bsky.social 👨🏼‍🚒
January 9, 2025 at 4:54 AM
Really informative episode with SemiAnalysis’ Dylan Patel: share.snipd.com/episode/add3...
AI Semiconductor Landscape feat. Dylan Patel | BG2 w/ Bill Gurley & Brad Gerstner
AI Semiconductor Landscape feat. Dylan Patel | BG2 w/ Bill Gurley & Brad Gerstner
share.snipd.com
January 8, 2025 at 5:42 AM
Interesting video about building isochromic maps: youtu.be/rC2VQ-oyDG0?...
I made maps that show time instead of space
YouTube video by Václav Volhejn
youtu.be
January 2, 2025 at 1:10 AM
Reposted by Adi Mukherjee
All of Randall Munroe's books are GOAT for kids' non-fiction.
December 26, 2024 at 5:10 PM
Great blog covering the progress this year.
“Asking o1 to complete proofs in creative ways is effectively asking it to be a research colleague. The model doesn't have to get proofs right to be useful, it just has to help us be better researchers.”
Good example of utility that evals fail to capture.
Bluesky can be a fraught place to post about AI but it is worth noting that the buzz over o1 (& now o3) is not “hype.” We know o1 can actually do some very hard tasks (see my post) & o3 appears to represent a big further leap.

They aren’t AGI, but will matter. www.oneusefulthing.org/p/what-just-...
What just happened
A transformative month rewrites the capabilities of AI
www.oneusefulthing.org
December 25, 2024 at 1:27 AM
Reposted by Adi Mukherjee
Benchmarks are flawed but a way to trace AI over the last year is GPQA Diamond. This is a Google-proof question set that experts get 81% right in their fields & highly skilled non-experts with 30 minutes per question and Google use get 22%

GPT-4 got 37% at the start of 2024. o1 got 78%. o3 is 87.7%
December 24, 2024 at 10:58 AM
Reposted by Adi Mukherjee
I wish people would post more links to interesting things

I feel like Twitter and LinkedIn and Instagram and TikTok have pushed a lot of people out of the habit of doing that, by penalizing shared links in the various "algorithms"

Bluesky doesn't have that misfeature, thankfully!
December 22, 2024 at 12:40 AM
Comparing NotebookLM audio overviews to @elevenlabsio.bsky.social’s GenFM podcasts: I’m still blown away by the naturalness of NotebookLM’s conversation, but prefer GenFM’s level of detail, even though it’s a more stilted conversation
December 22, 2024 at 9:29 AM
OpenAI released its 2nd gen reasoning model, o3 (yeah, even they admitted they suck at names).
The evals are perhaps the final nail in the coffin for the scaling wall hypothesis, showing that AI models aren’t hitting a plateau in capabilities.
arcprize.org/blog/oai-o3-...
OpenAI o3 Breakthrough High Score on ARC-AGI-Pub
OpenAI o3 scores 75.7% on ARC-AGI public leaderboard.
arcprize.org
December 22, 2024 at 7:24 AM
Lots of apps have had text-to-speech for years, but ElevenLabs voices really stand out to me for naturalness of enunciation. I use it a lot for listening to articles.
elevenlabs.io/blog/introdu...
ElevenLabs — Introducing the ElevenLabs Reader App | ElevenLabs
The ElevenLabs Reader App lets you listen to any text content, with ElevenLabs voices, on the go
elevenlabs.io
December 22, 2024 at 7:15 AM