Lightnews — Scholar-powered news

Sondre Wold

@erdnosw.bsky.social

27 followers 76 following 3 posts

PhD candidate in ML/NLP at the University of Oslo, researching compositionality and generalization in language models.

Posts Replies Media Videos

Sondre Wold

@erdnosw.bsky.social

Yeah, I see a bunch of stuff submitted to a range of conferences now, seems like the feed is including everything from everywhere.

March 5, 2025 at 7:01 PM

Sondre Wold

@erdnosw.bsky.social

I'm also added. I also see deanonymized submissions to ARR in my Activities tab, I think. Maybe a big error in OpenReview?

March 5, 2025 at 6:56 PM

Sondre Wold

@erdnosw.bsky.social

Slike tester er først og fremst markedsføring og er ofte tuklet med. Det finnes flere eksempler på oppgaver som er banale for folk, men hvor dagens modeller ikke får til stort, som f.eks den nylig avsluttede ARC AGI konurransen, hvor o1 får ~21% , v.s 85% for mennesker. arcprize.org/blog/openai-...

OpenAI o1 Results on ARC-AGI-Pub

How far are the o1 preview and mini models from AGI?

arcprize.org

November 20, 2024 at 7:46 PM

Add to Home Screen

Light up
your news

Add to Home Screen

Light upyour news

Sign in to Lightnews

Sign up to start reading

Connect Bluesky

Connect with Bluesky

Light up
your news