David Kuszmar
banner
davidkuszmar.com
David Kuszmar
@davidkuszmar.com
Black-box Adversarial AI Researcher. Discoverer of Time Bandit, Inception, and other verified LLM exploits.

Spoke at HOPE_16: https://www.youtube.com/live/6mI-8ias7Dw?si=Ce40S_AlbUZD1zkZ&t=15039

AI Newsletter: https://emergent-problems.ghost.io/

🇺🇦
Pinned
I'll be on hiatus for the remainder of December.

Time to close the year out strong. Papers to write, conferences to prepare for.

Mutuals can still get in touch via Signal or DM.
I'll be on hiatus for the remainder of December.

Time to close the year out strong. Papers to write, conferences to prepare for.

Mutuals can still get in touch via Signal or DM.
December 5, 2025 at 1:23 PM
Okay, back to hibernating.

Work work work.
December 4, 2025 at 11:44 PM
This is honestly a really cool thread.
I was sitting here trying to remember how many societal 'panics' I've been through, and what they entailed, and wow we are a cowardly species.
I was born in 1979.
1. First I can remember was razor blades or drugs in halloween candy. Still going strong
(I will note that somewhere in this era...
1/x
December 4, 2025 at 11:41 PM
Reposted by David Kuszmar
There’s a new Humble book bundle featuring a set of No Starch Press books on Hacking. For a limited time, pay what you want AND support EFF’s fight for privacy and free speech online! www.humblebundle.com/books/hacki...
Humble Tech Book Bundle: Hacking by No Starch
Turn your curiosity about computer hacking into a fast-paced, proven, and practical career with the latest Humble Tech Book Bundle!
www.humblebundle.com
December 4, 2025 at 11:00 PM
Hibernating for 36 hours while I do a big ol thing.
December 4, 2025 at 10:24 PM
Leah ain't far off here.

I got Grok to agree with the patently false claim that all English speakers under the age of 25 say "what" as "hwat" with literally one sentence.
“The paper warned that, in an extreme scenario, a highly persuasive AI chatbot ‘could benefit unscrupulous actors wishing, for example, to promote radical political or religious ideologies or foment political unrest among geopolitical adversaries.’”

(I do not think this is an “extreme scenario”…)
NEW: AI chatbots used inaccurate information to change people’s political opinions, study finds www.nbcnews.com/tech/tech-ne...
December 4, 2025 at 10:02 PM
It's all worth it for a unified theory of failure modes in LLMs.

It's all worth it for a unified theory of failure modes in LLMs.

It's all worth it for a unified theory of failure modes in LLMs.
Formalizing my theory as a mathematical formula is... Interesting.

Difficult. I'm used to intuitively executing these steps and transitions, so, it's strange making sure I'm getting all the details correct.
December 4, 2025 at 6:50 PM
Reposted by David Kuszmar
Pantone must be in bed with Big Yawn.
December 4, 2025 at 5:30 PM
Yeah, this is so boring, I just saw the Concept of Boredom drift off to a nap.
Okay, I guess. Pantone’s 2026 Color of the Year. 🥱
December 4, 2025 at 6:38 PM
Reposted by David Kuszmar
👀 A New Anonymous Phone Carrier Lets You Sign Up With Nothing but a Zip Code www.wired.com/story/new-an...
A New Anonymous Phone Carrier Lets You Sign Up With Nothing but a Zip Code
Privacy stalwart Nicholas Merrill spent a decade fighting an FBI surveillance order. Now he wants to sell you phone service—without knowing almost anything about you.
www.wired.com
December 4, 2025 at 6:09 PM
Reposted by David Kuszmar
🚨SCOOP: $800M crypto fugitive Ravid Yosef working at UK startup under new name protos.com/800m-crypto-...
$800M crypto fugitive Ravid Yosef working at UK startup under new name
Ravid Yosef, the alleged fraudster who fled to Israel six years ago, is currently working at a London-based startup under an assumed name.
protos.com
December 4, 2025 at 6:26 PM
Formalizing my theory as a mathematical formula is... Interesting.

Difficult. I'm used to intuitively executing these steps and transitions, so, it's strange making sure I'm getting all the details correct.
December 4, 2025 at 6:34 PM
There's a level of cognitive dissonance online that is amazing to me.

I'm watching some shit happen that is unreal.
December 4, 2025 at 5:10 PM
Oh no, how will Putin get his noods?
Russia has blocked Snapchat ❌
December 4, 2025 at 4:26 PM
Oh, look, I'm on more lists.

Now apparently I'm violating people's privacy.

I love how BlueSky just enables targeted harassment of users, gee, thanks @jay.bsky.team.

@moderation.bsky.app - I'd sarcastically say keep up the work, but that requires you to work.
December 4, 2025 at 2:25 PM
This Running Point show is decent. I wish HBO hadn't cancelled Sex Lives of College Girls tho, and MK was still running that.
December 4, 2025 at 2:15 AM
I swear so much y'all.

Lol.

Oh...

Well, what can I say?

I'm as casually uncouth as I am capable.
December 4, 2025 at 2:05 AM
Well since everyone else is doing their musical year recap... Here's mine.

@mike-eagle.bsky.social in the top five per usual, edging out Kendrick with the new album.
December 3, 2025 at 11:26 PM
Reposted by David Kuszmar
The #CIA frequently oversteps and undermines the DEA and other agencies. In some countries, the CIA and National Security Advisor covertly oversee foreign policy, leaving the State Department and Ambassador-designate powerless.
x.com/JohnKiriakou...
John Kiriakou on X: "The CIA frequently oversteps and undermines the DEA and other agencies. In some countries, the CIA and National Security Advisor covertly oversee foreign policy, leaving the State Department and Ambassador-designate powerless. From @thereal_Rocco @ThisIsIRONCLAD https://t.co/DijGRSCuZv" / X
The CIA frequently oversteps and undermines the DEA and other agencies. In some countries, the CIA and National Security Advisor covertly oversee foreign policy, leaving the State Department and Ambassador-designate powerless. From @thereal_Rocco @ThisIsIRONCLAD https://t.co/DijGRSCuZv
x.com
December 3, 2025 at 2:10 PM
December 2, 2025 at 10:47 PM
Reposted by David Kuszmar
In the last year, DDoSecrets' publicaitons has exposed everyone from governments, to surveillance operators, and even Jeffrey Epstein.

If you want to support our one-of-a-kind work, consider donating to help us keep publishing this #GivingTuesday:

donorbox.org/ddosecrets
DDoSecrets is Bringing Secrets to Light | Distributed Denial of Secrets (Powered by Donorbox)
Distributed Denial of Secrets is the most important active leaking organization in the world. DDoSecrets has published over 250 datasets from more than 50 countries, adding up to nearly 100…
donorbox.org
December 2, 2025 at 10:12 PM
Reposted by David Kuszmar
Jeffrey Epstein Was Concerned About Roughly 20 Underage Girls as Feds Closed In

"Epstein believed federal agents had knowledge of at least 20 girls between the ages of 16 and 18 that could implicate him in a potential federal sex trafficking investigation, emails obtained by @ddosecrets.com show"
Jeffrey Epstein Was Concerned About Roughly 20 Underage Girls as Feds Closed In: Emails
Jeffery Epstein believed federal agents had knowledge of at least 20 girls between the ages of 16 and 18 that could implicate him in a potential federal sex trafficking investigation, emails obtained ...
www.dropsitenews.com
December 2, 2025 at 4:39 PM
Reposted by David Kuszmar
For an approach coming from philosophy that may at least illuminate the problem somewhat, there's "The Vector Grounding Problem" by Mollo and Milliére arxiv.org/abs/2304.01481
The Vector Grounding Problem
The remarkable performance of large language models (LLMs) on complex linguistic tasks has sparked debate about their capabilities. Unlike humans, these models learn language solely from textual data ...
arxiv.org
December 2, 2025 at 4:25 PM
Reposted by David Kuszmar
i have been pondering the degree to which we can recover a model of semantic meaning from next token generation. ar5iv.labs.arxiv.org/html/2106.07... is an interesting paper but unfortunately does not actually model how LLMs work so it is not useful
An enriched category theory of language: from syntax to semantics
State of the art language models return a natural language text continuation from any piece of input text. This ability to generate coherent text extensions implies significant sophistication, includi...
ar5iv.labs.arxiv.org
December 2, 2025 at 4:00 PM