Web: https://mlcollective.org/
Twitter: @ml_collective
present his paper at DLCT!
The Geometry of Self-Verification in a Task-Specific Reasoning Model
arxiv.org/abs/2504.14379
Zoom below 👇
present his paper at DLCT!
The Geometry of Self-Verification in a Task-Specific Reasoning Model
arxiv.org/abs/2504.14379
Zoom below 👇
@nsaphra.bsky.social presents at DLCT:
Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs
Angelica Chen, Ravid Shwartz-Ziv, Kyunghyun Cho, Matthew L. Leavitt, Naomi Saphra
arxiv.org/abs/2309.07311
Zoom 👇
@nsaphra.bsky.social presents at DLCT:
Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs
Angelica Chen, Ravid Shwartz-Ziv, Kyunghyun Cho, Matthew L. Leavitt, Naomi Saphra
arxiv.org/abs/2309.07311
Zoom 👇
Paper: transformer-circuits.pub/2025/attribu...
Come for the chain of thought, stay for the rabbits and habbits.
Zoom info below 👇
Paper: transformer-circuits.pub/2025/attribu...
Come for the chain of thought, stay for the rabbits and habbits.
Zoom info below 👇
Zoom info at mlcollective.org/events/resea...
Zoom info at mlcollective.org/events/resea...
Up first: Subhash Kantamneni will show how LLMs represent numbers on a helix and use this representation to add!
Join Friday at 10am PT, zoom/etc here: mlcollective.org/dlct/
Up first: Subhash Kantamneni will show how LLMs represent numbers on a helix and use this representation to add!
Join Friday at 10am PT, zoom/etc here: mlcollective.org/dlct/
Asynchronous Perception Machine for Efficient Test Time Training
Rajat Modi · Yogesh Rawat
West Ballroom A-D #6304
Asynchronous Perception Machine for Efficient Test Time Training
Rajat Modi · Yogesh Rawat
West Ballroom A-D #6304