Dmitrijs Trizna
dtrizna.bsky.social
Dmitrijs Trizna
@dtrizna.bsky.social
Cyber-security and AI research | ex-Microsoft | Founding AI Researcher @ Stealth | Agentic AI and adversarial ML for cyber-threat detection
Not discussing this research direction enough, we miss so many TTPs that motivated and malevolent adversaries may utilize against us in the next few years.

Refs:
[1] skylightcyber.com/2019/07/18/c...
[2] open.spotify.com/episode/2xRS...
November 20, 2024 at 10:05 AM
It would be beneficial if the discussion around "AI Red Teaming" could evolve to cover these broader, yet equally critical, aspects, rather than teams of "AI Red Teaming" experts being just narrowly focused on the security of LLM-powered projects.
November 20, 2024 at 10:05 AM
I'd say this is a vibrant topic itself, but not discussed under AI Red Teaming umbrella. But even beyond, consider:

2. Disrupting AI/ML-based defenses: This includes techniques like applying adversarial ML in conventional evasion chains [1] or poisoning defensive models [2], and so much more...
November 20, 2024 at 10:05 AM
There are at least several other potential development directions that should be included under the AI Red Teaming umbrella:

1. Using #AI / #ML as tools for conventional Red Teaming needs: For example, LLMs as co-operators or semi-autonomous agents.
November 20, 2024 at 10:05 AM
Having countless discussions at #BlackHat US this summer, I feel that many security experts, especially classical Red Teamers, expressed disappointment that such a broad concept is often reduced to discussions on text-based attacks like prompt injections.
November 20, 2024 at 10:05 AM