arxiv.org/abs/2406.16838
arxiv.org/abs/2406.16838
A higher level summary on the key decisions along the way of scoping a problem, choosing a base model, optimization algorithm, etc. (+some thoughts on OpenAI's RL Finetuning).
https://buff.ly/3ZpY5IR
A higher level summary on the key decisions along the way of scoping a problem, choosing a base model, optimization algorithm, etc. (+some thoughts on OpenAI's RL Finetuning).
https://buff.ly/3ZpY5IR
Look at the @neuripsconf.bsky.social tutorials in 2024!
neurips.cc/virtual/2024...
14 tutorials; 6 have "LLM" in the title; 4 more cover foundation models, with large NLP coverage. That's > 70% 😲
Look at the @neuripsconf.bsky.social tutorials in 2024!
neurips.cc/virtual/2024...
14 tutorials; 6 have "LLM" in the title; 4 more cover foundation models, with large NLP coverage. That's > 70% 😲
🧵>>
🧵>>
www.reddit.com/r/MachineLea...
www.reddit.com/r/MachineLea...
As I'm building systems the most common questions (and review comments) I get asked is about the LL(M)M I'm using and not the systems and the problems they're solving ..
youtu.be/vRTcE19M-KE?...
As I'm building systems the most common questions (and review comments) I get asked is about the LL(M)M I'm using and not the systems and the problems they're solving ..
youtu.be/vRTcE19M-KE?...
For instance, something as simple as self - repairs in the same utterance confuses the model .. hmm ..
For instance, something as simple as self - repairs in the same utterance confuses the model .. hmm ..