Skip to content

Sigma is for SIEM. YARA is for malware.

ATR is for AI agents.

Built so a rule written in Taipei catches an attack first reported in Seattle — without anyone reinventing the rule format.

An open, versioned, machine-readable detection rule format for AI agent security threats. Any conforming engine can evaluate it. Community-maintained, MIT licensed.

652 rules·10 categories·98.0% garak recall
Standards bodies:MISP / CIRCL·OWASP·NIST AI RMF (OSCAL)·OpenTelemetry
In production:Microsoft AGT·Cisco AI Defense·Gen Digital Sage
Why a new standard

Endpoint detection standards cannot see AI agent behavior.

Old era
Sigma · YARA · CVE

Built for endpoint event logs, file binaries, and software vulnerability IDs. They watch code, not intent.

New surface
AI Agent behavior

Prompt injection, tool poisoning, skill compromise, context exfiltration — attacks live at the prompt / tool call / skill layer, not at process / file / network.

New standard
ATR

Vendor-neutral, machine-readable behavioral detection rules. Watches what an agent does, not just what it runs. Any conforming engine can evaluate it. MIT forever. Community governed.

Before Sigma became the open standard for SIEM detection in 2017, every SOC wrote its own rules. Before CVE in 1999, every vendor numbered its own vulnerabilities. The detection layer for the AI agent era sits in the same position right now — not yet standardized. ATR fills the gap.

00,000
skills scanned — the largest AI agent security scan ever conducted
1,302

skills flagged, 552 confirmed malware after manual review. Three coordinated threat actors. The largest AI agent malware campaign ever documented.

hightower6eu
354
Solana / Google Workspace disguise
sakaen736jih
212
C2 server at 91.92.242.30
52yuanchangxing
137
Fake dev tools + npm typosquatting

ATR found these threat actors scanning 96,096 skills across six registries — ClawHub, OpenClaw, Skills.sh, and three others. 1,302 were flagged; 552 confirmed malware after manual review, all blacklisted and reported to NousResearch.

What ATR Detects

10 threat categories. 652 rules. Real CVEs.

Prompt Injection
223 rules

Hijacking agent behavior through crafted inputs

Agent Manipulation
106 rules

Social engineering and behavioral manipulation of agents

Context Exfiltration
104 rules

Stealing conversation context and sensitive data

Tool Poisoning
65 rules

Poisoned tool descriptions and malicious tool responses

Skill Compromise
45 rules

Malicious or vulnerable MCP skills and SKILL.md

Model Abuse
37 rules

Misusing model capabilities — jailbreaks, harmful generation, resource abuse

Privilege Escalation
35 rules

Unauthorized elevation of agent capabilities

Excessive Autonomy
29 rules

Agents exceeding intended operational boundaries

Data Poisoning
5 rules

Corrupting training data, memory, or retrieval sources to bias agent behavior

Model Security
3 rules

Attacks on the model itself — extraction, inversion, adversarial inputs

Adopted by

Security platforms run ATR in production as an upstream rule source.

Microsoft AGT

PR #908 + #1277 merged · 287 rules (at PR time) + weekly auto-sync

View PR →

Cisco AI Defense

PR #79 + #99 merged · full rule pack in skill-scanner production

View PR →

Gen Digital Sage

PR #33 merged · agentic-AI risk-scoring layer at the Norton / Avast / LifeLock parent

View PR →
Production loop · 2h 16m

Microsoft's autonomous AI engineer opened a PR assuming ATR coverage existed — and the assumption was correct.

On 2026-05-11, the Copilot SWE Agent opened AGT#1981 for two Semantic Kernel CVEs with regression fixtures presuming ATR detection. Rules were validated and published to npm within 2h 16m. Not a manually arranged integration.

View AGT#1981 →
2
in production
Microsoft, Cisco
652
detection rules
across 10 categories
96,096
skills scanned
across registries
13/26
ecosystem PRs
merged
A living standard

Every attack makes everyone safer.

A red-team mega-scan pipeline and a CVE-ingestion pipeline run daily: when the semantic layer catches a novel attack, it crystallizes into a regex rule and flows back into the standard — turning a 500ms inference into a 5ms pattern match. Auto-crystallization grew the standard from 462 to 652 rules, all shipped in npm [email protected].

But growth was never the point — honesty about precision is. v3.5.0 introduced detection lanes: every rule declares a maturity, and the consumer decides how far to trust it. The enforce lane fires only the most mature rules (~0.24% false positives on a 65,000-sample benign corpus); the default hunt lane runs everything as advisory (~9%). False-positive rates are reported lane by lane, never as a single flattering number. A standard earns trust by publishing its worst figure, not hiding it.

Detect1/5
Global Sensors

Endpoints report suspicious patterns via ATR Reporter

More endpoints = more data = stronger rules
OWASP Agentic
10/10
Full coverage
SAFE-MCP
91.8%
78/85 techniques
OWASP AST10
7/10
3 are process-level
PINT F1
77.3
850 samples

Frameworks tell you threats exist. ATR tells you how to detect them. ATR is to MITRE ATLAS what Sigma rules are to ATT&CK.

Integrate ATR.

One command. Instant results.
$ npx agent-threat-rules scan .
# results
3 SKILL.md scanned
12 tool descriptions checked
1 CRITICAL: credential theft
rule ATR-2026-00121
Done in 47ms.

TypeScript, Python, Raw YAML, SIEM converters — four integration paths, one ruleset. MIT forever, no license to negotiate, evaluated by any conforming engine.