Lightnews — Scholar-powered news

Lei M. Zhang

@l32zhang.bsky.social

1.9K followers 490 following 3 posts

Research Scientist @ Google DeepMind. Previously @ OpenAI. Building AGI. 🤖

Posts Replies Media Videos

Reposted by Lei M. Zhang

Seamus Blackley

@seamus.bsky.social

Reader, I do not know the demons that haunt your dreams, the frustrations and insults of life that have brought you to your knees, the sadness you battle in your heart.

But know this: there is pure goodness in this world. It exists. Sometime we see a glimpse of it.

Today, I got such a peek.

The lucky author holds a single, roasted cacao bean with 1/2 the shell removed so we can see the pure chocolate inside. Behind his hand is a roasting tray filled with beans.

The smell. THE SMELL.

This photo shows an aluminum roasting tray filled with a layer of cacao beans. When roasting, the aroma of the fermentation suddenly gives way to the overwhelming joy of fresh brownies. Yes, there is a spirit watching over us all.

November 17, 2024 at 5:45 PM

Lei M. Zhang

@l32zhang.bsky.social

We tried something similar in adaptive agents arxiv.org/abs/2301.07608 (multiturn adaptation + RL). This was with transformers but not LLMs. Someone should try this with LLMs!

Human-Timescale Adaptation in an Open-Ended Task Space

Foundation models have shown impressive adaptation and scalability in supervised and self-supervised learning problems, but so far these successes have not fully translated to reinforcement learning (...

arxiv.org

November 21, 2024 at 2:20 AM

Reposted by Lei M. Zhang

kyunghyuncho.bsky.social

@kyunghyuncho.bsky.social

now, if we think of p(output | prompt, a few examples) as a predictive distribution p(y|x, D) ... it looks very much like learning to me :)

see e.g. my slide deck on drive.google.com/file/d/1B-Ka...

November 21, 2024 at 12:43 AM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news