Topics
Browse posts by category and tag — every topic we cover, with the latest pieces under each.
Tags
- #adversarial-ml 9
- #prompt-injection 6
- #llm-security 4
- #red-teaming 4
- #jailbreaking 3
- #supply-chain 3
- #attack-patterns 2
- #backdoor-attacks 2
- #black-box-attacks 2
- #data-poisoning 2
- #fine-tuning 2
- #model-security 2
- #adversarial-patches 1
- #adversarial-suffix 1
- #agent-security 1
- #agentic-ai 1
- #ai-attacks 1
- #ai-red-team 1
- #ai-security 1
- #ci-cd 1
- #context-poisoning 1
- #dataset-integrity 1
- #evasion 1
- #evasion-attacks 1
- #fraud-detection 1
- #function-calling 1
- #garak 1
- #gcg 1
- #gpt-4v 1
- #hugging-face 1
- #indirect-injection 1
- #ip-theft 1
- #jailbreak 1
- #llm-attacks 1
- #long-context 1
- #malware-detection 1
- #many-shot 1
- #membership-inference 1
- #memory-attacks 1
- #model-extraction 1
- #model-inversion 1
- #model-poisoning 1
- #multimodal 1
- #optimization-attacks 1
- #owasp 1
- #persistence 1
- #privacy-attacks 1
- #rag 1
- #red-team 1
- #regression-testing 1
- #safety-training 1
- #tool-call-hijacking 1
- #training-data 1
- #training-data-extraction 1
- #trojan-ml 1
- #vision-models 1
- #visual-prompt-injection 1
- #vulnerability-taxonomy 1
- #web-scraping 1
- #white-box 1
Categories
adversarial-ml 6 posts
- Evasion Attacks on Production Classifiers: Malware, Spam, and FraudDeployed ML classifiers in malware, spam, and fraud detection face evasion attacks where the attacker has a clear payoff.
- Poisoning Web-Scale Training Sets: Split-View and FrontrunningYou don't need to control a model's training pipeline to poison it — you only need to control content the crawler will fetch.
- Adversarial Examples Against Vision Models in 2025Where physical-world adversarial patches and digital attacks stand against modern vision models — what still works, what's been hardened, and where the
- Model Extraction via Black-Box Query AttacksHow attackers reconstruct private model weights and decision boundaries through query-only access — the techniques, the economics, and what extracted
- Model Inversion and Membership Inference: Extracting LLM DataHow membership inference attacks determine whether specific data was used to train a model, and how model inversion techniques reconstruct private
- Training Data Poisoning and Backdoor Attacks on LLMsA technical deep-dive into how adversaries manipulate training datasets and introduce hidden backdoors into LLMs — covering poisoning mechanics, stealthy
attack-patterns 5 posts
- Jailbreaking Multimodal Models: Visual Prompt Injection AttacksHow attackers use images, typography, and adversarial visual inputs to bypass safety guardrails in GPT-4V, Claude, and Gemini — and why multimodal inputs
- Supply Chain Attacks on AI Models: Poisoning and BackdoorsHow attackers compromise AI models before they reach production — through malicious fine-tuning, dataset poisoning, serialization exploits, and the unique
- LLM Context Window PoisoningPersistent malicious instructions via memory and context manipulation — how attackers plant long-horizon influence across LLM sessions and what it takes
- Indirect Prompt Injection in RAG PipelinesHow attackers embed malicious instructions in documents that get retrieved into LLM context — and why RAG makes prompt injection a supply-chain problem.
- Tool-Call Hijacking in Agentic SystemsHow attackers exploit the gap between LLM reasoning and actual function execution to trigger unauthorized tool calls — exfiltration via email, rogue