Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on the LM Arena leaderboard. 🥇
Now integrating thinking capabilities, 2.5 Pro Experimental is our most performant Gemini model yet. It’s #1 on the LM Arena leaderboard. 🥇
by @talschuster.bsky.social et al.
Converts pre-trained transformers to a more efficient version by turning blocks of layers into a single layer which is iterated. Lots of interesting tricks!
arxiv.org/abs/2410.20672
by @talschuster.bsky.social et al.
Converts pre-trained transformers to a more efficient version by turning blocks of layers into a single layer which is iterated. Lots of interesting tricks!
arxiv.org/abs/2410.20672
Happy one year anniversary Gemini team!
Happy one year anniversary Gemini team!
I upgraded my llm-gemini plugin to support it and then got the best result yet for my "Generate an SVG of a pelican riding a bicycle" benchmark
simonwillison.net/2024/Dec/6/g...
I upgraded my llm-gemini plugin to support it and then got the best result yet for my "Generate an SVG of a pelican riding a bicycle" benchmark
simonwillison.net/2024/Dec/6/g...
I would like to view:
70% new ML and LLM research and cool results.
10% funny videos with cute animals.
5% sports (but no spoilers if I'm planning to watch the reply later).
5% travel and life hacks.
5% general tech.
5% random.
Regards,
I would like to view:
70% new ML and LLM research and cool results.
10% funny videos with cute animals.
5% sports (but no spoilers if I'm planning to watch the reply later).
5% travel and life hacks.
5% general tech.
5% random.
Regards,