http://mubasharaakhtar.com
Come by our poster in West Ballroom #5706 about the @mlcommons.org Croissant metadata format.
Paper: arxiv.org/pdf/2403.19546
Croissant format: github.com/mlcommons/cr...
Come by our poster in West Ballroom #5706 about the @mlcommons.org Croissant metadata format.
Paper: arxiv.org/pdf/2403.19546
Croissant format: github.com/mlcommons/cr...
Outperforms all models at similar GPU RAM usage and tokens throughputs
Blog post: huggingface.co/blog/smolvlm
Outperforms all models at similar GPU RAM usage and tokens throughputs
Blog post: huggingface.co/blog/smolvlm
We invented new methods for fine-tuning language models with RL and built upon best practices to scale synthetic instruction and preference data.
Demo, GitHub, paper, and models 👇
We invented new methods for fine-tuning language models with RL and built upon best practices to scale synthetic instruction and preference data.
Demo, GitHub, paper, and models 👇
go.bsky.app/TENRRBb
go.bsky.app/TENRRBb
go.bsky.app/JgneRQk
go.bsky.app/JgneRQk
Would love to hear any other tips if you have them!
This proved very popular on another (more evil) social media platform, so sharing here also 🙂
My 10 tips:
Would love to hear any other tips if you have them!
This proved very popular on another (more evil) social media platform, so sharing here also 🙂
My 10 tips:
bsky.app/starter-pack...
bsky.app/starter-pack...