here's a graph of requests / second to WMF infra over the last 3h, since "Habemus papam"
The infrastructure has gone from 172k req / sec to 243k req / sec (⬆️41%) in under an hour!
follow along here: grafana.wikimedia.org/d/O_OXJyTVk/...
here's a graph of requests / second to WMF infra over the last 3h, since "Habemus papam"
The infrastructure has gone from 172k req / sec to 243k req / sec (⬆️41%) in under an hour!
follow along here: grafana.wikimedia.org/d/O_OXJyTVk/...
first big spike is the academy awards, second is pope francis’ death
pageviews.wmcloud.org?project=en.w...
first big spike is the academy awards, second is pope francis’ death
pageviews.wmcloud.org?project=en.w...
really fascinating line of critique — I strongly encourage you to read it and lmk what you think!
joinreboot.org/p/ai-alignme...
really fascinating line of critique — I strongly encourage you to read it and lmk what you think!
joinreboot.org/p/ai-alignme...
arxiv.org/abs/2503.12188
12/12
arxiv.org/abs/2503.12188
12/12
10/12
10/12
… executes code that they recognize as harmful
… automatically pivots to harmful tasks that are simply in the same directory as benign tasks
… is vulnerable to screenshots and even audio files where we read out the attack (see example below⬇️⬇️⬇️)
7/12
… executes code that they recognize as harmful
… automatically pivots to harmful tasks that are simply in the same directory as benign tasks
… is vulnerable to screenshots and even audio files where we read out the attack (see example below⬇️⬇️⬇️)
7/12
… across multiple agent frameworks (we tested AutoGen, MetaGPT, Crew AI), orchestrators, and LLMs
… even when direct and indirect prompt injection attacks don’t work
… even when individual agents are “aligned” and refuse to take harmful actions
6/12
… across multiple agent frameworks (we tested AutoGen, MetaGPT, Crew AI), orchestrators, and LLMs
… even when direct and indirect prompt injection attacks don’t work
… even when individual agents are “aligned” and refuse to take harmful actions
6/12
5/12
5/12
4/12
4/12
“Multi-Agent Systems Execute Arbitrary Malicious Code”
arxiv.org/abs/2503.12188
1/12
“Multi-Agent Systems Execute Arbitrary Malicious Code”
arxiv.org/abs/2503.12188
1/12
i'm editing a project on locations and want to hear from YOU (<5 min)
forms.gle/iG1UZJKrcNwm...
i'm editing a project on locations and want to hear from YOU (<5 min)
forms.gle/iG1UZJKrcNwm...
from on.ft.com/4fBSEwy
from on.ft.com/4fBSEwy
one thing I've been doing with this: trying to figure out where committees are spending on food.
for example, Steve Scalise has bought Chick-Fil-A 26 times this cycle, spending $18,700 in total
one thing I've been doing with this: trying to figure out where committees are spending on food.
for example, Steve Scalise has bought Chick-Fil-A 26 times this cycle, spending $18,700 in total
we've also set up a database with all of the relevant data so users can save, share, and publish their queries
datatalk.genie.stanford.edu
we've also set up a database with all of the relevant data so users can save, share, and publish their queries
datatalk.genie.stanford.edu
but the cool thing about Datatalk is that it can go so deeper — for example, which PACs from CA are the biggest donors to #MDSen candidates?
but the cool thing about Datatalk is that it can go so deeper — for example, which PACs from CA are the biggest donors to #MDSen candidates?
🗣️📈DATATALK📈🗣️ is a platform for asking natural language Qs of FEC data that I've been working on with folks at Stanford, Big Local News, and the Brown Institute
datatalk.genie.stanford.edu
🗣️📈DATATALK📈🗣️ is a platform for asking natural language Qs of FEC data that I've been working on with folks at Stanford, Big Local News, and the Brown Institute
datatalk.genie.stanford.edu
this one is a deep dive into the extreme privacy community — the people who dedicate their lives to hiding from the internet
what drives people to want to disappear? what can we learn from them?
joinreboot.org/p/threat-model
this one is a deep dive into the extreme privacy community — the people who dedicate their lives to hiding from the internet
what drives people to want to disappear? what can we learn from them?
joinreboot.org/p/threat-model