Lightnews — Scholar-powered news

Cem Anil

@cemanil.bsky.social

1.7K followers 150 following 0 posts

Research scientist at Anthropic.
PhD in machine learning from the University of Toronto and Vector Institute.
Prev: NVIDIA, Google

Posts Replies Media Videos

Reposted by Cem Anil

Sander Dieleman

@sedielem.bsky.social

Here's another oldie from 2020: an entire blog post about the weirdness of high-dimensional probability distributions.

With generative modelling having taken off as it has, understanding typicality and its implications is all the more important, but it still isn't talked about much in ML circles!

Musings on typicality

A summary of my current thoughts on typicality, and its relevance to likelihood-based generative models.

sander.ai

November 22, 2024 at 12:25 AM

Reposted by Cem Anil

Laura

@lauraruis.bsky.social

How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this:

Procedural knowledge in pretraining drives LLM reasoning ⚙️🔢

🧵⬇️

November 20, 2024 at 4:35 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news