Independent ML researcher consulting on LMs + data.
Previously: Salesforce Research, MetaMind, CommonCrawl, Harvard. 🇦🇺 in SF. He/him.
Personal blog: https://state.smerity.com
"Five amazing secrets that gradient optimizers don't want you to know! 🧵 1/9"
"Five amazing secrets that gradient optimizers don't want you to know! 🧵 1/9"
bsky.app/profile/smer...
There's value and art in a tweet compressing long form information. This can be done by anyone, not just the original author.
The feed becomes a high level "skim reader", progressing to depth when interested piqued.
bsky.app/profile/smer...
There's value and art in a tweet compressing long form information. This can be done by anyone, not just the original author.
The feed becomes a high level "skim reader", progressing to depth when interested piqued.
There's value and art in a tweet compressing long form information. This can be done by anyone, not just the original author.
The feed becomes a high level "skim reader", progressing to depth when interested piqued.
I think that's as much protective delusion as practical defense.
news.ycombinator.com/item?id=4226...
I think that's as much protective delusion as practical defense.
news.ycombinator.com/item?id=4226...
A contentious issue regardless ¯\_(ツ)_/¯
A contentious issue regardless ¯\_(ツ)_/¯
I appreciated the bridge app but it was definitely slow, had many false positives I filtered manually, and won't catch those moving after I migrated.
I appreciated the bridge app but it was definitely slow, had many false positives I filtered manually, and won't catch those moving after I migrated.