AI Agents Drove Intrusions From Initial Access to Exfiltration

TL;DR

Trend Micro disclosed two field campaigns (SHADOW-AETHER-040 / 064) in which AI agents drove intrusions from initial access through exfiltration against government and financial organizations in Latin America; one hit six Mexican agencies from late 2025. The decisive detail: the AI generated attack tools per target rather than using off-the-shelf tooling, so they carry no stable signature and post-hoc detection stays inherently reactive. What is missing is a layer that verifies, before the action, whether the operation carries legitimately authorized provenance. Detection and pre-execution attestation are complements, not substitutes.

Incident overview

Targets: government and financial-sector organizations in Latin America (also aviation and retail)
Disclosure: 2026-05-11, Trend Micro (TrendAI Research)
Campaigns:
- SHADOW-AETHER-040 (Spanish-speaking): active since late 2025. Compromised six Mexican government agencies between 2025-12-27 and 2026-01-04, running the full kill chain (initial access → lateral movement → data theft) with AI-agent assistance.
- SHADOW-AETHER-064 (Portuguese-speaking): active since April 2026. Targeting Brazilian financial institutions; compromised vulnerable JBoss AS servers, planted webshells, and built SOCKS5 tunnels with tools such as Chisel.
Common ground: both established tunnels into victim networks via ProxyChains and SSH, letting the AI agent attack the internal network directly. They shared Chisel / Neo-reGeorg / CrackMapExec / Impacket.
Core of the abuse: the AI agent generated attack tools and scripts dynamically (SHADOW-AETHER-040 used an AI-generated Python backdoor, implante_http), evading detection that depends on the signatures of off-the-shelf tools.
How the agent was used: SHADOW-AETHER-040 did not fully delegate to the AI but used it as a supervised assistant (pausing and correcting on deviation). Shodan and VulDB were connected to the AI to obtain attack-surface and vulnerability information. A dedicated folder per victim documented attack steps and collected intelligence in Markdown, serving as the AI’s operational knowledge base so it could restore context and continue work.
Attribution: the two are near-identical in tooling and tactics, but the language of scripts and binaries (Spanish vs. Portuguese) points to separate groups — a sign that AI-assisted attacks are spreading across multiple groups, not a single actor.

Timeline

Late 2025: SHADOW-AETHER-040 begins operations (tracked by Trend Micro)
2025-12-27 to 2026-01-04: SHADOW-AETHER-040 compromises six Mexican government agencies, including cases reaching data theft with AI-agent assistance
From 2026-04: SHADOW-AETHER-064 observed targeting Brazilian financial institutions
2026-05-11: Trend Micro (TrendAI Research) discloses both campaigns

Note: proper nouns, campaign names, and IOCs are based on primary sources (research institutions, GitHub Advisory, NVD, vendor threat intelligence, etc.); each implementation’s remediation status varies over time, so consult the latest information.

Attack vector

Initial access: compromise a vulnerable public-facing server (JBoss AS for SHADOW-AETHER-064) and plant a webshell. Identify attack surface and vulnerabilities via Shodan / VulDB.
Tunnel establishment: build SOCKS5 tunnels with tools such as Chisel plus ProxyChains + SSH, so the AI agent reaches the victim’s internal network directly.
AI-driven dynamic tool generation: rather than relying on off-the-shelf tools, the AI generates attack tools and scripts on the fly, tailored to the target environment (implante_http, etc.), evading signature-based detection.
Maintaining working memory: accumulate steps and collected intelligence in Markdown in a per-victim folder, letting the AI restore context and continue unfinished tasks.
Kill-chain execution: from initial access through lateral movement to data theft, progressing with AI-agent assistance. For SHADOW-AETHER-040, a human supervises and corrects on deviation.

Structural analysis

This case belongs to the agent-runaway category of Pillar 03 (Agent Authority Proof). The central failure primitive is that the tools and operations used in the attack are AI-generated per target and carry no stable signature, so the premise of detection — “match against known malicious artifacts” — breaks down. Secondary categories are agent-infrastructure (the Shodan/VulDB and tunneling infrastructure wired into the AI) and identity-auth.

This is the same lineage as Brief 009 (GTG-1002) and 026 (the adaptive AI worm), in which the AI becomes the executing actor of the attack. 009 is Anthropic’s disclosure of a state-sponsored actor abusing Claude Code (running 80–90% of the attack autonomously); 026 is a threat model that generates attack strategy at runtime. This case confirms that primitive as multiple independent campaigns observed in the field by Trend Micro, grounding 009/026 in real loss and compromise. In particular, the practice of generating tooling per target rather than reusing it undermines the very stability that detection relies on — IOCs, tool signatures, known TTPs — showing that defenders cannot enumerate “what is malicious” in advance.

The fact that SHADOW-AETHER-040 and 064 were near-identical apart from language also shows that AI-assisted attacks are not the exclusive province of a single advanced actor but are spreading as an isomorphic operating model across distinct groups. This is a signal that AI has changed the cost and reproducibility of attacks — a longer-reaching one than any single-vulnerability news item.

The detection–proof gap

Campaign identification, IOC provision, and MITRE ATT&CK mapping by threat researchers like Trend Micro are indispensable for understanding, containing, and hardening against the damage; this Brief does not dispute that role. Detailed TTPs and IOCs were published for this case as well.

But detection does not change “what is allowed to run in the environment” itself. The core of this case is that the tools the AI generates per target carry no stable signature, so IOCs and known-tool matching are inherently reactive. A generated backdoor or script becomes an IOC only once observed and analyzed — and a different one is generated for the next target. What was missing is independent, pre-execution verification of “is the operation or tool about to run in this environment one that is legitimately authorized and has confirmed provenance?” — a different track from detecting known artifacts. For audit, too, after a compromise there is little independent trail beyond reconciling logs with forensics to prove “which operation ran, under whose authorization, by which path.”

Pre-execution attestation inverts detection from “matching known malicious artifacts” to “verifying, before execution, whether the operation or code about to run is authorized and carries provenance.” Even when a tool is unknown or freshly generated, if the proof reports “this operation has no legitimately authorized provenance,” execution is blocked in advance. Signature-based detection (the detection-style “is this known-malicious?”) and pre-execution proof of operations (the “is this an authorized, provenanced execution?”) are not substitutes but complements — and in a world where attack tooling is AI-generated and carries no signature, the weight shifts toward the latter.

For the detection-vs-attestation thesis, see “The last layer left for cyber defense in the age of AI” (Lemma, 2026-05); for verifying before the action, see “Proof-as-Auth: sign in without ever sending your key” (Lemma, 2026-05).

Response and industry context

Trend Micro (TrendAI Research): identified and disclosed both campaigns, presented MITRE ATT&CK TTPs and IOCs, and committed to continued tracking. Framed them as among the earliest observed cases of AI agents executing the chain from initial access through exfiltration.
Cross-industry: attackers’ use of AI has long been predicted, but this case marks a turn from “prediction” to “observed as multiple independent campaigns.” Evasion of signature detection via dynamic tool generation demonstrates, in the field, the limits of an IOC- and signature-centric defense model, and pushes the argument to shift defensive weight toward runtime authorization and provenance verification.
A note on positioning: this case is a separate campaign from the state-sponsored AI-autonomous attack disclosed elsewhere (Brief 009 = GTG-1002). The two share the root of “the AI becoming the executing actor of the attack,” but differ in actor, target, and disclosing party, so we link them as related.

Lemma’s analysis

Against the structural problem exposed here (attack tools are AI-generated per target and carry no stable signature, so detection that relies on matching known artifacts is left reactive), Lemma proposes a design that inverts detection from “matching known malicious artifacts” to “pre-execution verification of the authorization and provenance of the operation or code about to run.”

Inverting the detection premise: the defensive axis shifts from “match known malicious artifacts” to “verify, before execution, whether the operation or code about to run carries authorization and provenance” — covering unknown and freshly generated tools
Verifying operation authorization and provenance: operations that run in the environment are independently verified, before execution, for whether they are legitimately authorized with confirmed provenance. Operations whose authorization/provenance proof does not hold are stopped as a pre-execution reject, not via detection
No dependence on AI-generated tooling: even when tools are generated dynamically per target with no stable signature, execution is decided solely on the presence of operation authorization and provenance, without relying on the stability of IOCs or tool signatures

This replaces the premise that detection undermines — “enumerate what is malicious in advance” — with pre-execution authorization and provenance verification. Signature-based detection (detection) and pre-execution proof of operations work as complements, and in a world where attack tooling is AI-generated and carries no signature, the weight shifts toward the latter.

For the design and its scope, see Pillar 03 — Agent Authority Proof and Trust402.

Sources

Trend Micro (TrendAI Research): “Vibe Hacking: Two AI-Augmented Campaigns Target Government and Financial Sectors in Latin America” (2026-05-11; campaign details, TTPs, IOCs, MITRE ATT&CK) — https://www.trendmicro.com/en_us/research/26/e/vibe-hacking-two-ai-augmented-campaigns-target-government-and-financial-sectors-in-latin-america.html
Dark Reading: “LatAm Vibe Hackers Generate Custom Hacking Tools on the Fly” (2026-05) — https://www.darkreading.com/cloud-security/ai-agents-generate-custom-hacking-tools

About distribution

This material is a structured analysis of public information; it is not an audit, diagnosis, or recommendation for any specific organization.