Lightnews — Scholar-powered news

Will Kurt

@willkurt.bsky.social

"The idea of an environment scarcely makes any sense since you can never draw a boundary line that would distinguish an organism from what surrounds it." - Bruno Latour

Posts Replies Media Videos

Will Kurt

@willkurt.bsky.social

The current messiness around LLM evaluations is ultimately caught up in the limits of working under conditions of pure empericism.

We’ll never dig ourselves entirely out of this hole until theory starts to catch up with practice.

Paper after paper overreaches and attempts impossible general claims

November 25, 2024 at 7:14 AM

Reposted by Will Kurt

saganite.bsky.social

@saganite.bsky.social

LLM observation of the day: I think that guided/constrained generation gets a bad rap. There was one paper making the rounds about how guided generation harms reasoning ability that everyone took as gospel.

November 13, 2024 at 11:06 PM

Reposted by Will Kurt

.txt

@dottxtai.bsky.social

A new paper, "Let Me Speak Freely" has been spreading rumors that structured generation hurts LLM evaluation performance.

Well, we've taken a look and found serious issue in this paper, and shown, once again, that structured generation *improves* evaluation performance!

November 21, 2024 at 6:33 PM

Reposted by Will Kurt

.txt

@dottxtai.bsky.social

Our new blog post is out!

@willkurt.bsky.social provides a rebuttal for a reasonably well known paper which concluded that structured generation with LLMs always resulted in worse performance.

We do not find the same thing.

blog.dottxt.co/say-what-you...

A graph showing that structured generation performs better than unstructured generation.

November 21, 2024 at 6:23 PM

Will Kurt

@willkurt.bsky.social

First post! Created this account awhile ago, but things seem to be picking up and it has a very nice "old Twitter" feel to it here!

November 12, 2024 at 7:59 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news