PhD student @ CMU with Zico Kolter and Zack Lipton | Founding Member @datologyai.com | Prev. Comp Sc @iitdelhi
http://pratyushmaini.github.io/
(I don't belong in this list, but I don't know how to remove myself from this pack 😂)
go.bsky.app/9APVxQQ
(I don't belong in this list, but I don't know how to remove myself from this pack 😂)
go.bsky.app/9APVxQQ
arxiv.org/abs/2411.18674
Smol models are all the rage these days & knowledge distillation (KD) is key for model compression!
We show how data curation can effectively distill to yield SoTA FLOP-efficient {C/Sig}LIPs!!
🧵👇
arxiv.org/abs/2411.18674
Smol models are all the rage these days & knowledge distillation (KD) is key for model compression!
We show how data curation can effectively distill to yield SoTA FLOP-efficient {C/Sig}LIPs!!
🧵👇
“I tested the idea we discussed last time. Here are some results. It does not work. (… awkward silence)”
Such conversations happen so many times when meetings with students. How do we move forward?
You need …
“I tested the idea we discussed last time. Here are some results. It does not work. (… awkward silence)”
Such conversations happen so many times when meetings with students. How do we move forward?
You need …
We argue that this shift poses new risks including financial incentives & eval bias.
w/ @hbxnov.bsky.social
📝: pratyushmaini.github.io/blog/2024/ri... 🧵
We argue that this shift poses new risks including financial incentives & eval bias.
w/ @hbxnov.bsky.social
📝: pratyushmaini.github.io/blog/2024/ri... 🧵
arxiv.org/abs/2411.04118
arxiv.org/abs/2411.04118
Case in point: #EMNLP2024 ’s Best Paper Award.
I & @iamgroot42.bsky.social wrote a blog on what went wrong: www.anshumansuri.com/blog/2024/ca... 🧵
Case in point: #EMNLP2024 ’s Best Paper Award.
I & @iamgroot42.bsky.social wrote a blog on what went wrong: www.anshumansuri.com/blog/2024/ca... 🧵
Case in point: #EMNLP2024 ’s Best Paper Award.
I & @iamgroot42.bsky.social wrote a blog on what went wrong: www.anshumansuri.com/blog/2024/ca... 🧵
Techvember Ep 2: How we made the #1 LLM Pre-training Data Recipe.
Blog: 👉 tinyurl.com/best-llm-data 🧵
Techvember Ep 2: How we made the #1 LLM Pre-training Data Recipe.
Blog: 👉 tinyurl.com/best-llm-data 🧵