Prompt Injection Coverage (Full Pattern Catalog)

Complete coverage list for /detect. This page is intentionally explicit so changes are auditable over time.

Last updated (UTC): 2026-05-30 14:05
Detector worker version: Production remains 1.0.1 and healthy (API /health at 2026-05-30T14:01:59.642Z). Local remediation candidate 1.0.3 now builds and tests cleanly in the detector repo, but Wrangler deployment remains blocked in this non-interactive environment because CLOUDFLARE_API_TOKEN is not set.
Detector tests: 95 total, 95 passing locally / 0 failing locally
Daily harvest (2026-05-30): Production harvest caught 88/93 samples, with 5 missed samples and 0 errors. 2 fresh web samples were added from the past 24 hours (both from the ChatGPhish disclosure), and 0 real feedback items were added; 1 trivially benign feedback test submission (test input) was skipped. A new pattern class was added for attacker-controlled Markdown-image / QR-code verification pivots embedded in trusted AI summaries. Local 1.0.3 remediation now covers the four older production misses (tool-argument URL exfiltration, parameter-smuggling full-history siphon, retrieved-document imperative override, and hidden-audio web-search pivot) plus the new ChatGPhish QR-pivot sample, and local npm test (95/95) plus npm run build pass. Production remains on 1.0.1 because deployment failed again without CLOUDFLARE_API_TOKEN, Git push remains blocked by GitHub authentication, and feedback cleanup delete still returned {"error":"Not found"}, so the benign pending item could not be removed from KV.

1) Instruction Override & Control Hijack

Examples

2) Role / Persona Injection

Examples

3) Boundary Violation & Prompt Exfiltration

Examples

4) Delimiter / Structural Injection

5) HTML / CSS Steganographic Injection

Example

6) Command Execution Injection

7) Evasion / Obfuscation Handling

8) Human-in-the-Loop (HITL) Bypass

Example

New in 2026-04-04 harvest — from Google DeepMind agentic AI attack taxonomy (HITL manipulation category).

9) Agentic Goal Hijacking & Memory Poisoning

Examples

New in 2026-04-04 harvest — from Google DeepMind agentic AI attack taxonomy (goal hijacking, memory poisoning, cognitive state trap categories).

10) Project Instruction File Trust Abuse

Example

Added in the 2026-04-21 corpus refresh after NVIDIA's indirect AGENTS.md injection writeup.

11) Semantic Frames, Predicates, and SMT2 Policies

Interactive docs:

12) Tool Argument Exfiltration & Parameter Smuggling

Examples

Expanded in the 2026-05-30 daily harvest after the ChatGPhish disclosure added Markdown-image / QR verification pivots on top of the 2026-05-26 hidden-audio voice-agent secret-search class and the 2026-05-24 tool-argument exfiltration + schema-based transcript-siphoning gaps.