🤖 AI, LLMs, GenAI, NLP
🐍 Python Dev
🚀 Indie Hacker
🎮 Game Dev, ProcGen, Unity, C#
🏎️ F1 Fan
🇬🇧 UK Based
🦣 mastodonapp.uk/@StuartGray
✖️ x.com/StuartGray (inactive)
However, if you strongly disagree with a post to the point you're unable to refrain from insults, rude or unthinking replies then please, save us both a lot of time and block me now - because I will block you.
cyberplace.social/@GossiTheDog...
cyberplace.social/@GossiTheDog...
From late May – seems to be, in part at least, another tedious consequence of the Online Safety Act github.com/mysociety/wh... + github.com/mysociety/al...
#FOI #openweb
From late May – seems to be, in part at least, another tedious consequence of the Online Safety Act github.com/mysociety/wh... + github.com/mysociety/al...
#FOI #openweb
www.damiencharlotin.com/hallucinatio...
www.damiencharlotin.com/hallucinatio...
The UK tech minister has said a VPN ban is on the table.
The UK tech minister has said a VPN ban is on the table.
They are literally anti-algorithm now.
And yet non-ironically posting about it on an app only possible because of algorithms, on a software powered device!
🤦
They are literally anti-algorithm now.
And yet non-ironically posting about it on an app only possible because of algorithms, on a software powered device!
🤦
The UK tech minister has said a VPN ban is on the table.
The main age-verification lobbyist — a man who largely believes porn should be outlawed — admitted the state-level bills he pushed for won't work and were really a predicate for federal action.
He wants the DOJ to seize domains.
The main age-verification lobbyist — a man who largely believes porn should be outlawed — admitted the state-level bills he pushed for won't work and were really a predicate for federal action.
He wants the DOJ to seize domains.
🤯
(From Byron Tau’s Means of Control, which you should read)
#talkaboutSurveillanceCapitalism
🤯
(From Byron Tau’s Means of Control, which you should read)
#talkaboutSurveillanceCapitalism
President Trump has granted a pardon to a slew of Trump world figures, including Rudy Giuliani, Mark Meadows, and Sidney Powell, for their efforts to overturn the 2020 election.
President Trump has granted a pardon to a slew of Trump world figures, including Rudy Giuliani, Mark Meadows, and Sidney Powell, for their efforts to overturn the 2020 election.
We introduce Oolong, a dataset of simple-to-verify information aggregation questions over long inputs. No model achieves >50% accuracy at 128K on Oolong!
We introduce Oolong, a dataset of simple-to-verify information aggregation questions over long inputs. No model achieves >50% accuracy at 128K on Oolong!
she's in marketing, and leverages AI in ridiculously powerful ways
i like her phrasing — "Ops" is the superpower, not AI. AI is merely the tool
www.appliedaiformops.com/p/why-ops-sk...
she's in marketing, and leverages AI in ridiculously powerful ways
i like her phrasing — "Ops" is the superpower, not AI. AI is merely the tool
www.appliedaiformops.com/p/why-ops-sk...
The "Penrose Effect" seems to be a real thing - hypothesised in the 1930s and re-tested in the last decade or so:
Where you reduce your inpatient psychiatric provision, you'll see a correlated rise within 10yrs in prisons of seriously mentally ill prisoners.
The "Penrose Effect" seems to be a real thing - hypothesised in the 1930s and re-tested in the last decade or so:
Where you reduce your inpatient psychiatric provision, you'll see a correlated rise within 10yrs in prisons of seriously mentally ill prisoners.
But giving them access to an LLM for guidance significantly closes the gap. mgcuna.github.io/website/JMP_...
But giving them access to an LLM for guidance significantly closes the gap. mgcuna.github.io/website/JMP_...
We split MMLU in two parts (leaked/clean) and show that almost all models tend to perform better on leaked samples
We split MMLU in two parts (leaked/clean) and show that almost all models tend to perform better on leaked samples
These websites can then be found in CommonCrawl dumps that are generally used for pretraining data curation...
These websites can then be found in CommonCrawl dumps that are generally used for pretraining data curation...
For instance, the fraction of MMLU questions that are leaked in pretraining had gone from ~1% to 24% between OLMo-1 and 2 😬
For instance, the fraction of MMLU questions that are leaked in pretraining had gone from ~1% to 24% between OLMo-1 and 2 😬
We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data
(TLDR: we cheat and get good scores)
@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social
We trained 3 models - 1.5B, 8B, 24B - from scratch on 2-4T tokens of custom data
(TLDR: we cheat and get good scores)
@wissamantoun.bsky.social @rachelbawden.bsky.social @bensagot.bsky.social @zehavoc.bsky.social