Lightnews — Scholar-powered news

Alex Makelov

@amakelov.bsky.social

Mechanistic interpretability
Creator of https://github.com/amakelov/mandala
prev. Harvard/MIT
machine learning, theoretical computer science, competition math.

Posts Replies Media Videos

Reposted by Alex Makelov

David Bau

@davidbau.bsky.social

Today we launch a new open research community

It is called ARBOR:
arborproject.github.io/

please join us.
bsky.app/profile/ajy...

February 20, 2025 at 10:15 PM

Reposted by Alex Makelov

Martin Wattenberg

@wattenberg.bsky.social

The math benchmarks I want:
1. OopsBench: given a faulty proof with numbered steps, which step contains an unfixable logical flaw?
2. DunnoMath: half the problems are taken from FrontierMath, half are almost certainly unsolvable. Major points off for guessing an answer to an unsolvable problem.

December 23, 2024 at 2:21 PM

Reposted by Alex Makelov

NDIF Team

@ndif-team.bsky.social

Large language models show fascinating changes in capability with scaling parameters, but scaling also vastly increases the resources required for experimentation on model internals. NDIF is currently hosting the largest open sourced model, Llama 405b, for YOU to run research on!

December 20, 2024 at 7:49 PM

Alex Makelov

@amakelov.bsky.social

Talk is cheap. Show me the CoT

December 20, 2024 at 7:46 PM

Alex Makelov

@amakelov.bsky.social

SaaS (Santa as a Service)

December 12, 2024 at 6:18 PM

Alex Makelov

@amakelov.bsky.social

Some fun with o1 from OpenAI: there's a math problem I often give to "reasoning" AIs to try them out. It's basically to prove that there's a number less than 1 billion that you can write in 1000 different ways as a sum of 3 squares (precise statement in the pic).

December 5, 2024 at 9:15 PM

Alex Makelov

@amakelov.bsky.social

yes, this is what mechanistic interpretability research looks like

Cat sitting on a chair in front of a parked black car with its rear wheel removed and a hydraulic jack supporting it

November 24, 2024 at 7:51 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news