Sondre Wold
erdnosw.bsky.social
Sondre Wold
@erdnosw.bsky.social
PhD candidate in ML/NLP at the University of Oslo, researching compositionality and generalization in language models.
Yeah, I see a bunch of stuff submitted to a range of conferences now, seems like the feed is including everything from everywhere.
March 5, 2025 at 7:01 PM
I'm also added. I also see deanonymized submissions to ARR in my Activities tab, I think. Maybe a big error in OpenReview?
March 5, 2025 at 6:56 PM
Slike tester er først og fremst markedsføring og er ofte tuklet med. Det finnes flere eksempler på oppgaver som er banale for folk, men hvor dagens modeller ikke får til stort, som f.eks den nylig avsluttede ARC AGI konurransen, hvor o1 får ~21% , v.s 85% for mennesker. arcprize.org/blog/openai-...
OpenAI o1 Results on ARC-AGI-Pub
How far are the o1 preview and mini models from AGI?
arcprize.org
November 20, 2024 at 7:46 PM