Sigma is for SIEM. YARA is for malware.

ATR is for AI agents.

Built so a rule written in Taipei catches an attack first reported in Seattle — without anyone reinventing the rule format.

An open, versioned, machine-readable detection rule format for AI agent security threats. Any conforming engine can evaluate it. Community-maintained, MIT licensed.

652 rules·10 categories·98.0% garak recall

Working Draft·3.0.0-alpha.1·canonical /spec

Read the spec Browse rules Submit a rule

Quality standard (RFC-001) →

Standards bodies:MISP / CIRCL·OWASP·NIST AI RMF (OSCAL)·OpenTelemetry

In production:Microsoft AGT·Cisco AI Defense·Gen Digital Sage

Why a new standard

Endpoint detection standards
cannot see AI agent behavior.

Old era

Sigma · YARA · CVE

Built for endpoint event logs, file binaries, and software vulnerability IDs. They watch code, not intent.

New surface

AI Agent behavior

Prompt injection, tool poisoning, skill compromise, context exfiltration — attacks live at the prompt / tool call / skill layer, not at process / file / network.

New standard

ATR

Vendor-neutral, machine-readable behavioral detection rules. Watches what an agent does, not just what it runs. Any conforming engine can evaluate it. MIT forever. Community governed.

Before Sigma became the open standard for SIEM detection in 2017, every SOC wrote its own rules. Before CVE in 1999, every vendor numbered its own vulnerabilities. The detection layer for the AI agent era sits in the same position right now — not yet standardized. ATR fills the gap.

00,000

skills scanned — the largest AI agent security scan ever conducted

1,302

skills flagged, 552 confirmed malware after manual review. Three coordinated threat actors. The largest AI agent malware campaign ever documented.

hightower6eu

354

Solana / Google Workspace disguise

sakaen736jih

212

C2 server at 91.92.242.30

52yuanchangxing

137

Fake dev tools + npm typosquatting

ATR found these threat actors scanning 96,096 skills across six registries — ClawHub, OpenClaw, Skills.sh, and three others. 1,302 were flagged; 552 confirmed malware after manual review, all blacklisted and reported to NousResearch.

Read the full research report →

What ATR Detects

10 threat categories. 652 rules. Real CVEs.

Prompt Injection

223 rules

Hijacking agent behavior through crafted inputs

Agent Manipulation

106 rules

Social engineering and behavioral manipulation of agents

Context Exfiltration

104 rules

Stealing conversation context and sensitive data

Tool Poisoning

65 rules

Poisoned tool descriptions and malicious tool responses

Skill Compromise

45 rules

Malicious or vulnerable MCP skills and SKILL.md

Model Abuse

37 rules

Misusing model capabilities — jailbreaks, harmful generation, resource abuse

Privilege Escalation

35 rules

Unauthorized elevation of agent capabilities

Excessive Autonomy

29 rules

Agents exceeding intended operational boundaries

Data Poisoning

5 rules

Corrupting training data, memory, or retrieval sources to bias agent behavior

Model Security

3 rules

Attacks on the model itself — extraction, inversion, adversarial inputs

Browse all rules + YAML details →

Adopted by

Security platforms run ATR in production as an upstream rule source.

Microsoft AGT

PR #908 + #1277 merged · 287 rules (at PR time) + weekly auto-sync

View PR →

Cisco AI Defense

PR #79 + #99 merged · full rule pack in skill-scanner production

View PR →

Gen Digital Sage

PR #33 merged · agentic-AI risk-scoring layer at the Norton / Avast / LifeLock parent

View PR →

Production loop · 2h 16m

Microsoft's autonomous AI engineer opened a PR assuming ATR coverage existed — and the assumption was correct.

On 2026-05-11, the Copilot SWE Agent opened AGT#1981 for two Semantic Kernel CVEs with regression fixtures presuming ATR detection. Rules were validated and published to npm within 2h 16m. Not a manually arranged integration.

View AGT#1981 →

Standards bodies referencing ATR

MISP / CIRCL

Taxonomy + Galaxy merged

OWASP A-S-R-H

Merged

NIST AI RMF (OSCAL)

oscal-content#338 in review (not a NIST endorsement)

OpenTelemetry GenAI

agent.threat.detection.* in review

in production

Microsoft, Cisco

652

detection rules

across 10 categories

96,096

skills scanned

across registries

13/26

ecosystem PRs

merged

Full ecosystem + integrations →For democracies: sovereign AI defense →

A living standard

Every attack makes everyone safer.

A red-team mega-scan pipeline and a CVE-ingestion pipeline run daily: when the semantic layer catches a novel attack, it crystallizes into a regex rule and flows back into the standard — turning a 500ms inference into a 5ms pattern match. Auto-crystallization grew the standard from 462 to 652 rules, all shipped in npm [email protected].

But growth was never the point — honesty about precision is. v3.5.0 introduced detection lanes: every rule declares a maturity, and the consumer decides how far to trust it. The enforce lane fires only the most mature rules (~0.24% false positives on a 65,000-sample benign corpus); the default hunt lane runs everything as advisory (~9%). False-positive rates are reported lane by lane, never as a single flattering number. A standard earns trust by publishing its worst figure, not hiding it.

Detect1/5

Global Sensors

Endpoints report suspicious patterns via ATR Reporter

More endpoints = more data = stronger rules

OWASP Agentic

10/10

Full coverage

SAFE-MCP

91.8%

78/85 techniques

OWASP AST10

7/10

3 are process-level

PINT F1

77.3

850 samples

Frameworks tell you threats exist. ATR tells you how to detect them. ATR is to MITRE ATLAS what Sigma rules are to ATT&CK.

View full coverage mapping →ATD: the executable agent-threat technique catalog →

Integrate ATR.

One command. Instant results.

$ npx agent-threat-rules scan .

# results

3 SKILL.md scanned

12 tool descriptions checked

1 CRITICAL: credential theft

rule ATR-2026-00121

Done in 47ms.

TypeScript, Python, Raw YAML, SIEM converters — four integration paths, one ruleset. MIT forever, no license to negotiate, evaluated by any conforming engine.

Integration Guide View Threat Feed

ATR is for AI agents.

Endpoint detection standards cannot see AI agent behavior.

skills flagged, 552 confirmed malware after manual review. Three coordinated threat actors. The largest AI agent malware campaign ever documented.

10 threat categories. 652 rules. Real CVEs.

Security platforms run ATR in production as an upstream rule source.

Microsoft AGT

Cisco AI Defense

Gen Digital Sage

Every attack makes everyone safer.

Integrate ATR.

Endpoint detection standards
cannot see AI agent behavior.