More at https://maciej.gryka.net/
you can run them locally (1B!) and they work as well as much bigger generic models; SLMs FTW!
you can run them locally (1B!) and they work as well as much bigger generic models; SLMs FTW!
Come say hi if Elixir and/or AI is your jam. I'll show how to use distillation (featuring distil labs ofc) to make exmeralda.chat run on a smaller model.
#myelixirstatus
Come say hi if Elixir and/or AI is your jam. I'll show how to use distillation (featuring distil labs ofc) to make exmeralda.chat run on a smaller model.
#myelixirstatus
As a fun demo, we trained a model to help you remember git commands and it's only 3B so you can run it locally!
www.distillabs.ai/blog/gitara-...
As a fun demo, we trained a model to help you remember git commands and it's only 3B so you can run it locally!
www.distillabs.ai/blog/gitara-...
📅 Hope to see you there! 💜
#ElixirLang
www.meetup.com/elixir-berli...
📅 Hope to see you there! 💜
#ElixirLang
www.meetup.com/elixir-berli...
simonwillison.net/2025/Jun/23/...
simonwillison.net/2025/Jun/23/...
May 26th 10am: x.com/xuandongzhao... "Learning to Reason without External Rewards: LLMs can learn complex reasoning without access to ground-truth answers, simply by optimizing their own internal sense of confidence."
May 26th 10am: x.com/xuandongzhao... "Learning to Reason without External Rewards: LLMs can learn complex reasoning without access to ground-truth answers, simply by optimizing their own internal sense of confidence."
My first thought was "ROBOT FIGHT", but actually they're cordial and take each other's feedback well.
My first thought was "ROBOT FIGHT", but actually they're cordial and take each other's feedback well.
I'm not sure exactly why that is, but somehow it feels more reliable/trustworthy?
I'm not sure exactly why that is, but somehow it feels more reliable/trustworthy?
Claude Code needs guidance from a senior domain expert to work well IMO:
- Accidentally proposed an n^2 implementation
- Didn't realize it was correct about something else
- Made a very bad assumption I had to correct
- Corrective plan doubled memory use
It's good. Implemented a real feature, tests, benchmarks, understood memory sensitivity too. Roughly how I would have written this myself.
NOTE: this has not undergone code review, likely missing some subtlety. But a great first step
github.com/honeycombio/...
nc3rs.org.uk/3rs-resource...
The actual story of the shackled people being deported without even being told where they were being taken — maybe to countries where they’d face torture or death — while their terrified children cried.😢
The White House posted that pic with “ASMR” tag.
www.nytimes.com/2025/02/18/w...
The actual story of the shackled people being deported without even being told where they were being taken — maybe to countries where they’d face torture or death — while their terrified children cried.😢
The White House posted that pic with “ASMR” tag.
www.nytimes.com/2025/02/18/w...