Programming & Development

I ran 765 controlled experiments to prove AI agents are leaking your data — and built the tool that catches it

Every AI agent that can read private data, fetch external content, and send outbound messages is one injected instruction away from exfiltrating everything it knows. This isn't theoretical. Here's the attack in three tool calls: Turn 0: readPrivateData() → 5 customer records loaded (SSNs, emails, phones) fetchExternalContent(url) → attacker's webpage, payload embedded in HTML Turn 1: sendOutboundReport() → all PII sent to attacker's address Turn 2: "Report sent successfully!" Total time: ~12 seconds. Cost: $0.001. No exploits. No credentials. Just a fetched webpage and a compliant model. We measured it. Rigorously. 30 injection payloads across 6 categories — direct injection, encoded/obfuscated (Base64, ROT13, hex, Unicode), social engineering (CEO fraud, IT impersonation, legal threats), multi-turn (persistent rules, delayed triggers, context poisoning), multilingual (Spanish, Mandarin, Arabic, Russian), and advanced techniques. Tested ag

Jamie Rodriguez

23d ago

1 0

Discussion

Start the conversation

Your voice can be the first to spark an engaging conversation.

No comments yet.

Be the first to share your take and keep the conversation moving.

Join the conversation

UPVOTERS

Community appreciation

See who found this content valuable and showed their support.

Daan Sanchez

TOPICS

Explore the same topics

Discover more content from the topics this post is mapped to.

dev.to

The Day I Learned to Respect Database Indexes (The Hard Way)

In 2021, I graduated from my university and joined a startup as its first employee. That time, I learned a lot of things. I picked up the full stack from scrat…

Jamie Rodriguez

2026-07-27 17:09

infoq.com

Java News Roundup: Simple JSON API, JEPs for JDK 28, Oracle CPU…

This week's Java roundup for July 20th, 2026, features news highlighting: two JEPs proposed to target for JDK 28; new JEPs 540 and 541, Simple JSON API (Incuba…

InfoQ

1h ago

infoq.com

Presentation: Clean Architecture for Serverless: Business Logic…

Elena van Engelen discusses how to eliminate serverless vendor lock-in without sacrificing native cloud capabilities. She explains how to structure FaaS applic…

InfoQ

2h ago

huggingface.co

Kimi-K3 Releases on HuggingFace 7/27

Comments

Hans

7h ago

infoq.com

TanStack Table V9 Beta: Tree-Shakable Features, TanStack Store …

TanStack Table V9 is a beta release of a headless UI library for creating tables in various JavaScript frameworks. It features improved state management, memor…

InfoQ

8h ago

dev.to

Cherry-picking your hotfix twice is the real pipeline smell

We had a gitflow pipeline that looked clean on paper: develop feeds a release branch, the same build artifact promotes through dev, qa, sit, uat, and prod, and…

DEV Community

11h ago

Keep browsing

Explore more from this topic

Dive into the full feed of curated posts covering Programming & Development.

Browse Topics

Continue exploring

Discover more content that aligns with your interests and this post.

dev.to

The Day I Learned to Respect Database Indexes (The Hard Way)

In 2021, I graduated from my university and joined a startup as its first employee. That time, I learned a lot of things. I picked up the full stack from scrat…

Jamie Rodriguez

2026-07-27 17:09

dev.to

Cherry-picking your hotfix twice is the real pipeline smell

We had a gitflow pipeline that looked clean on paper: develop feeds a release branch, the same build artifact promotes through dev, qa, sit, uat, and prod, and…

DEV Community

11h ago

dev.to

Building TNP: Why I Built an Enterprise-Realistic DevOps Lab (P…

🧭 This is Part 0 of a 10-part series documenting a self-built, enterprise-realistic DevOps lab — modeled after a fictional fintech company, not a pile of unrel…

DEV Community

12h ago

dev.to

Lemonade Second Squeeze: Model Archeology on 2019's GPT-2XL

Two weeks ago I had never run an AI model on my own machine. Every project I had ever built phoned a cloud API with a key sitting in it. Then I sat in the vibe…

DEV Community

13h ago

dev.to

GitOps for AI Agents: Treating Tool Configs and Memory Like Pro…

GitOps for AI Agents: Treating Tool Configs and Memory Like Production Infrastructure Stop managing AI agent configurations as fragile scripts. Adopt GitOps p…

DEV Community

14h ago

dev.to

Building Abridged Shelf - Free shorter classic stories

Making classics more accessible I have a soft spot for mythology and classic stories. I have read most of the books on Abridged Shelf at least once, several…

DEV Community

18h ago

Still curious?

See more related posts

Keep the inspiration flowing with fresh submissions and trending finds from the community.

View Latest

I ran 765 controlled experiments to prove AI agents are leaking your data — and built the tool that catches it

Start the conversation

Join the conversation

Community appreciation

Explore the same topics

Explore more from this topic

Continue exploring

See more related posts

Share Content