andrewperrault.bsky.social
andrewperrault.bsky.social
@andrewperrault.bsky.social
Reposted by andrewperrault.bsky.social
Should LLMs be used to review papers? AAAI is piloting LLM-generated reviews this year. I wrote a blog post arguing that using LLMs as reviewers can have bad downstream consequences for science by centralizing judgments about what constitutes good research.

bryanwilder.github.io/files/llmrev...
Equilibrium effects of LLM reviewing
Equilibrium effects of LLM reviewing
bryanwilder.github.io
May 26, 2025 at 6:20 PM
Reposted by andrewperrault.bsky.social
Steering language models by directly intervening on internal activations is appealing–but does it generalize?

We study 3 popular steering methods with 36 models from 14 families (1.5-70B), exposing brittle performance and fundamental flaws in underlying assumptions
🧵👇
(1/10)
April 8, 2025 at 11:34 AM