anpaure
banner
anpau.re
anpaure
@anpau.re
research
How does the new Qwen model compare to other LLMs on coding tasks?
It's impressive, but rushed.
I ran it against other SOTA models on 6 competitive programming problems of varying difficulties.
Here are the results!
November 28, 2024 at 1:57 PM
when her replies get shorter and colder
November 26, 2024 at 3:27 PM
How smart is the new DeepSeek model at coding problems?
Almost o1 level actually.
Today I sat down and ran a couple of competitive programming problems of varying difficulty on leading LLMs, like o1, 4o, Sonnet 3.6 and DeepSeek R1.
These are the preliminary results on 6 problems!
November 21, 2024 at 4:24 PM
linguistics question here: is there a minimal basis of words that are sufficient to define all other words how many words are enough?
November 19, 2024 at 11:37 PM
November 18, 2024 at 8:54 AM