Lightnews — Scholar-powered news

Sweta Karlekar

@swetakar.bsky.social

2.6K followers 1.2K following 31 posts

Machine learning PhD student @ Blei Lab in Columbia University

Working in mechanistic interpretability, nlp, causal inference, and probabilistic modeling!

Previously at Meta for ~3 years on the Bayesian Modeling & Generative AI teams.

🔗 www.sweta.dev

Posts Replies Media Videos

Sweta Karlekar

@swetakar.bsky.social

Sorry John, that isn’t my area of expertise!

November 25, 2024 at 12:44 AM

Sweta Karlekar

@swetakar.bsky.social

This is very interesting! Do you have any intuition as to whether or not this phenomenon happens only with very simple “reasoning” steps? Does relying on retrieval increase as you progress from simple math to more advanced prompts like GSM8K or adversarially designed prompts (like adding noise)?

November 24, 2024 at 4:29 PM

Sweta Karlekar

@swetakar.bsky.social

Learning doesn’t have to mean explicit weight changes; ICL can be viewed as temporary implicit finetuning (arxiv.org/abs/2212.10559) or like a “state” change to the model instead of a weight change, akin to how learning happens in fast RL vs slow RL (www.cell.com/trends/cogni...).

Why Can GPT Learn In-Context? Language Models Implicitly Perform Gradient Descent as Meta-Optimizers

Large pretrained language models have shown surprising in-context learning (ICL) ability. With a few demonstration input-label pairs, they can predict the label for an unseen input without parameter u...

arxiv.org

November 22, 2024 at 10:31 PM

Sweta Karlekar

@swetakar.bsky.social

Hi! Our lab does Bayesian stuff :) Could you add Dave Blei's lab to this pack as well if it's not already full? @bleilab.bsky.social

November 20, 2024 at 3:38 PM

Sweta Karlekar

@swetakar.bsky.social

Could you add Dave Blei's lab to this pack as well if it's not already full? @bleilab.bsky.social

November 20, 2024 at 3:37 PM

Sweta Karlekar

@swetakar.bsky.social

Could you add Dave Blei's lab to this pack as well if it's not already full? @bleilab.bsky.social

November 20, 2024 at 3:36 PM

Sweta Karlekar

@swetakar.bsky.social

Could you add Dave blei's lab to this pack as well if it's not already full! @bleilab.bsky.social

November 20, 2024 at 3:36 PM

Sweta Karlekar

@swetakar.bsky.social

Oh, I’ve been meaning to check out that YouTube series—thanks! Also sadly, there's no class website, but I can share the "super quick intro to mech interp" presentation I made. It’s somewhat rough, but hopefully, it gets the main points across! sweta.dev/files/intro_...

sweta.dev

November 20, 2024 at 3:08 PM

Reposted by Sweta Karlekar

Martyn Plummer

@martynplummer.bsky.social

📢 Post-Bayesian online seminar series coming!📢
To stay posted, sign up at
tinyurl.com/postBayes
We'll discuss cutting-edge methods for posteriors that no longer rely on Bayes Theorem.
(e.g., PAC-Bayes, generalised Bayes, Martingale posteriors, ...)
Pls circulate widely!