Lightnews — Scholar-powered news

Reposted by Adi Mukherjee

"fart"

@jonhendren.com

fuckin cool

Bryan Boatman @bryanboatman.bsky.social · Jan 9

Direct hit on the Sunset Fire in Hollywood. 🎯

Nice work, @lafd.bsky.social #LAFD c/o @nbcla.bsky.social 👨🏼‍🚒

January 9, 2025 at 4:54 AM

Adi Mukherjee

@adim.in

Really informative episode with SemiAnalysis’ Dylan Patel: share.snipd.com/episode/add3...

AI Semiconductor Landscape feat. Dylan Patel | BG2 w/ Bill Gurley & Brad Gerstner

share.snipd.com

January 8, 2025 at 5:42 AM

Adi Mukherjee

@adim.in

Interesting video about building isochromic maps: youtu.be/rC2VQ-oyDG0?...

I made maps that show time instead of space

YouTube video by Václav Volhejn

youtu.be

January 2, 2025 at 1:10 AM

Reposted by Adi Mukherjee

Ryan Moulton

@moultano.bsky.social

All of Randall Munroe's books are GOAT for kids' non-fiction.

December 26, 2024 at 5:10 PM

Adi Mukherjee

@adim.in

Great blog covering the progress this year.
“Asking o1 to complete proofs in creative ways is effectively asking it to be a research colleague. The model doesn't have to get proofs right to be useful, it just has to help us be better researchers.”
Good example of utility that evals fail to capture.

Ethan Mollick @emollick.bsky.social · Dec 22

Bluesky can be a fraught place to post about AI but it is worth noting that the buzz over o1 (& now o3) is not “hype.” We know o1 can actually do some very hard tasks (see my post) & o3 appears to represent a big further leap.

They aren’t AGI, but will matter. www.oneusefulthing.org/p/what-just-...

What just happened

A transformative month rewrites the capabilities of AI

www.oneusefulthing.org

December 25, 2024 at 1:27 AM

Reposted by Adi Mukherjee

Ethan Mollick

@emollick.bsky.social

Benchmarks are flawed but a way to trace AI over the last year is GPQA Diamond. This is a Google-proof question set that experts get 81% right in their fields & highly skilled non-experts with 30 minutes per question and Google use get 22%

GPT-4 got 37% at the start of 2024. o1 got 78%. o3 is 87.7%

December 24, 2024 at 10:58 AM

Reposted by Adi Mukherjee

Justin Cormack

@justincormack.bsky.social

Tools for your LLM in containers? Yes please! www.docker.com/blog/the-mod...

The Model Context Protocol: Simplifying Building AI apps with Anthropic Claude Desktop and Docker

Discover how the Model Context Protocol (MCP) simplifies building AI applications by seamlessly integrating Anthropic Claude with Docker Desktop, enhancing developer productivity and workflow efficien...

www.docker.com

December 24, 2024 at 11:03 AM

Reposted by Adi Mukherjee

Simon Willison

@simonwillison.net

I wish people would post more links to interesting things

I feel like Twitter and LinkedIn and Instagram and TikTok have pushed a lot of people out of the habit of doing that, by penalizing shared links in the various "algorithms"

Bluesky doesn't have that misfeature, thankfully!

December 22, 2024 at 12:40 AM

Adi Mukherjee

@adim.in

Comparing NotebookLM audio overviews to @elevenlabsio.bsky.social’s GenFM podcasts: I’m still blown away by the naturalness of NotebookLM’s conversation, but prefer GenFM’s level of detail, even though it’s a more stilted conversation

December 22, 2024 at 9:29 AM

Adi Mukherjee

@adim.in

OpenAI released its 2nd gen reasoning model, o3 (yeah, even they admitted they suck at names).
The evals are perhaps the final nail in the coffin for the scaling wall hypothesis, showing that AI models aren’t hitting a plateau in capabilities.
arcprize.org/blog/oai-o3-...

OpenAI o3 Breakthrough High Score on ARC-AGI-Pub

OpenAI o3 scores 75.7% on ARC-AGI public leaderboard.

arcprize.org

December 22, 2024 at 7:24 AM

Adi Mukherjee

@adim.in

Lots of apps have had text-to-speech for years, but ElevenLabs voices really stand out to me for naturalness of enunciation. I use it a lot for listening to articles.
elevenlabs.io/blog/introdu...

ElevenLabs — Introducing the ElevenLabs Reader App | ElevenLabs

The ElevenLabs Reader App lets you listen to any text content, with ElevenLabs voices, on the go

elevenlabs.io

December 22, 2024 at 7:15 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news