radar.dk/artikel/flyv...
radar.dk/artikel/flyv...
Benchmarking with vLLM against Llama 3.1 8B for long contexts shows:
🔹 2.5x throughput improvement
🔹 2x lower latency
Repo: github.com/foundation-m...
Benchmarking with vLLM against Llama 3.1 8B for long contexts shows:
🔹 2.5x throughput improvement
🔹 2x lower latency
Repo: github.com/foundation-m...
There are great alternatives like Unsloth, Axolotl, and AutoTrain. But if you want a daily drive that does experimentation to production, it's TRL.
🧵 these community notebooks guide you through TRL's core:
There are great alternatives like Unsloth, Axolotl, and AutoTrain. But if you want a daily drive that does experimentation to production, it's TRL.
🧵 these community notebooks guide you through TRL's core:
Qwen QwQ 32B is in preview, does reasoning like o1 and Deepseek R1 but y'know, on your local machine!
🔗 Demo: https://thursdai.news/qwq-32b-prev
🌐 Model: https://thursdai.news/O9d9hi0
📃 Blog: https://thursdai.news/qwq-32b-pre
Qwen QwQ 32B is in preview, does reasoning like o1 and Deepseek R1 but y'know, on your local machine!
🔗 Demo: https://thursdai.news/qwq-32b-prev
🌐 Model: https://thursdai.news/O9d9hi0
📃 Blog: https://thursdai.news/qwq-32b-pre
#leadership
#leadership
Don't worry, I will be back to covering the normal topics and the war in Ukraine next time.
Don't worry, I will be back to covering the normal topics and the war in Ukraine next time.