Kayvane
kayvane.bsky.social
Kayvane
@kayvane.bsky.social
ML Engineer
Deploying vLLM’s openai compatible endpoint (you interact with it in the same way as you would the openai API) is a single line of code! You can set it up on serverless GPUs on Modal or another cloud provider. OSS AI in 5 lines of code. Experiment w/ Sampling Params, beam search, structured gen etc.
March 30, 2025 at 11:31 AM
C’est vrai que ecogiquement, cette branche de progres pourra cree exponentiellement plus de problemes que les modèles avant la série O-n puisqu’on utilise bcp plus d’electricité a chaque fois qu’on utilise le model 😳.
December 22, 2024 at 10:43 AM
Are you also working on a 7-8b model 👀👀
December 18, 2024 at 8:28 AM
Llama 3.1 8b right? I thought I missed a drop for second 🙃

Congrats on Pleias results!
December 18, 2024 at 7:56 AM
I’ve always wondered what programme you make these on?
November 28, 2024 at 7:48 AM
Would you use it for code documentation, is it easy to deploy to github pages?
November 24, 2024 at 4:22 PM
Spacy always write the most intuitive code, it always looks so elegant and easy to build with. Excited to give this a go 🙌🏼
November 24, 2024 at 3:49 PM
So if you work in a startup, complete all goals for the year in the first 6 weeks… Sounds about right 🥲
November 16, 2024 at 9:38 AM