Shamik Bose
shamikbose.bsky.social
Shamik Bose
@shamikbose.bsky.social
AI Researcher working on making black-box systems less opaque, in order to give a better life to my future dogs. He/him
If previous Phi models are anything to go by, they will be great at benchmarks, but not for real applications. There are probably better alternatives for production ready SLMs
December 13, 2024 at 3:26 PM
Completely agreed. I would argue that they are an incredibly tangled box of wires that is open, but really hard to make sense of. Incidentally, I was also just reading a Mech Interp paper by your team which demonstrates just how difficult this is to do
December 10, 2024 at 5:53 PM
Once, it was being renovated. Two signs kept making me (and a bunch of other people) go round in circles. I finally had to ask the staff at duty free after getting thoroughly confused.
December 3, 2024 at 11:36 PM
Have to disagree. It’s CDG and it’s not even a competition
December 3, 2024 at 11:33 PM
Wonder if this is related to their sudden site today 🤔
December 3, 2024 at 2:48 PM
There seems to be a very strong anti-AI bias among this platform’s users. They say AI, but what they actually mean is LLMs
December 2, 2024 at 8:39 AM
There was Inspect, by the AI Safety Institute github.com/UKGovernment... and there was also LangSmith docs.smith.langchain.com
GitHub - UKGovernmentBEIS/inspect_ai: Inspect: A framework for large language model evaluations
Inspect: A framework for large language model evaluations - UKGovernmentBEIS/inspect_ai
github.com
November 18, 2024 at 2:03 PM
LMStudio has been pretty decent for me. However, not all models are supported by the mlx community yet. It’s growing fast, though
November 15, 2024 at 7:43 PM