DARK lab
ucldark.com
DARK lab
@ucldark.com
UCL Deciding, Acting, and Reasoning with Knowledge (DARK) Lab at UCL led by @rockt.ai, @egrefen.bsky.social,
@jparkerholder.bsky.social, and Roberta Raileanu.

ucldark.com
Reposted by DARK lab
@ucl-dark.bsky.social entered the stage! Thanks @lauraruis.bsky.social :)
November 25, 2024 at 2:36 PM
Check out Tim's start pack for Open-Endedness on Bluesky!
Now that @jeffclune.bsky.social and @joelbot3000.bsky.social are here, time for an Open-Endedness starter pack.

go.bsky.app/MdVxrtD
November 25, 2024 at 2:16 PM
Reposted by DARK lab
How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this:

Procedural knowledge in pretraining drives LLM reasoning ⚙️🔢

🧵⬇️
November 20, 2024 at 4:35 PM
Reposted by DARK lab
The LLM parrot analogy is dead. Fantastic work by UCL DARK's @lauraruis.bsky.social on rigorously investigating whether LLMs learn reasoning from procedural knowledge during pretraining.
How do LLMs learn to reason from data? Are they ~retrieving the answers from parametric knowledge🦜? In our new preprint, we look at the pretraining data and find evidence against this:

Procedural knowledge in pretraining drives LLM reasoning ⚙️🔢

🧵⬇️
November 22, 2024 at 11:12 AM
Reposted by DARK lab
Excited to announce "BALROG: a Benchmark for Agentic LLM and VLM Reasoning On Games" led b UCL DARK's @dpaglieri.bsky.social! Douwe Kiela plot below is maybe the scariest for AI progress — LLM benchmarks are saturating at an accelerating rate. BALROG to the rescue. This will keep us busy for years.
November 22, 2024 at 11:27 AM