Strategic Hypothesis Testing
by
@yatongchen.bsky.social
Watch here: youtu.be/VcKpRuUi4cQ
Strategic Hypothesis Testing
by
@yatongchen.bsky.social
Watch here: youtu.be/VcKpRuUi4cQ
These positions offer the exciting possibility of co-appointments with the @mpi-is.bsky.social and the @tuebingen-ai.bsky.social .
📌 Apply here: institute-tue.ellis.eu/en/jobs/PI-c...
These positions offer the exciting possibility of co-appointments with the @mpi-is.bsky.social and the @tuebingen-ai.bsky.social .
📌 Apply here: institute-tue.ellis.eu/en/jobs/PI-c...
The Curse of Depth in Large Language Models
by
@shiweiliu.bsky.social
Watch here: youtu.be/knVOH3oM_-I
The Curse of Depth in Large Language Models
by
@shiweiliu.bsky.social
Watch here: youtu.be/knVOH3oM_-I
I conned somebody into giving me a faculty job!
I’m starting as a W1 Tenure-Track Professor at Goethe University Frankfurt in a week (lol), in the Faculty of CS and Math
and I'm recruiting PhD students 🤗
I conned somebody into giving me a faculty job!
I’m starting as a W1 Tenure-Track Professor at Goethe University Frankfurt in a week (lol), in the Faculty of CS and Math
and I'm recruiting PhD students 🤗
AI Safety and Alignment
by
@maksym-andr.bsky.social
Watch here: youtu.be/7WRW8MDQ8bk
AI Safety and Alignment
by
@maksym-andr.bsky.social
Watch here: youtu.be/7WRW8MDQ8bk
Why LLM Benchmarks are Broken and How to Fix It?
by
Guanhua Zhang
Watch here: youtu.be/X820NwnHu-c
Why LLM Benchmarks are Broken and How to Fix It?
by
Guanhua Zhang
Watch here: youtu.be/X820NwnHu-c
arxiv.org/abs/2508.21620
I've written a self-contained introductory monograph on this!
arxiv.org/abs/2508.21620
I've written a self-contained introductory monograph on this!
How much can we forget about Data Contamination?
by
@sbordt.bsky.social
Watch here: youtu.be/T9Y5-rngOLg
How much can we forget about Data Contamination?
by
@sbordt.bsky.social
Watch here: youtu.be/T9Y5-rngOLg
Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics
by
@chs20.bsky.social & Naman Deep Singh
Watch here: www.youtube.com/watch?v=sK9Y...
Adversarially Robust CLIP Models Can Induce Better (Robust) Perceptual Metrics
by
@chs20.bsky.social & Naman Deep Singh
Watch here: www.youtube.com/watch?v=sK9Y...
Large Language Models Are Zero-Shot Problem Solvers—Just Like Modern Computers
by
Tim Z. Xiao
Watch here: www.youtube.com/watch?v=ySHu...
Large Language Models Are Zero-Shot Problem Solvers—Just Like Modern Computers
by
Tim Z. Xiao
Watch here: www.youtube.com/watch?v=ySHu...
Effortless, Simulation-Efficient Bayesian Inference using Tabular Foundation Models
by
@vetterj.bsky.social & Manuel Gloeckler from @mackelab.bsky.social
Watch here: youtube.com/watch?v=Wx2p...
Effortless, Simulation-Efficient Bayesian Inference using Tabular Foundation Models
by
@vetterj.bsky.social & Manuel Gloeckler from @mackelab.bsky.social
Watch here: youtube.com/watch?v=Wx2p...
www.rnd.de/politik/bund...
A short thread 🧵
In RNNs with N units with ReLU(x-b) activations the phase space is partioned in 2^N regions by hyperplanes at x=b 1/7
A short thread 🧵
In RNNs with N units with ReLU(x-b) activations the phase space is partioned in 2^N regions by hyperplanes at x=b 1/7
Paper: openreview.net/forum?id=0cg...
Code: github.com/mackelab/sou...
(1/8)
Paper: openreview.net/forum?id=0cg...
Code: github.com/mackelab/sou...
(1/8)
📃 tinyurl.com/22rvzc4f
💻 github.com/AKuzina/dvp_...
📃 tinyurl.com/22rvzc4f
💻 github.com/AKuzina/dvp_...