Tech Kaizen: Agentic AI Security

Agentic AI Security isn’t just an extension of LLM safety - it’s a fundamentally different and far more dangerous domain. While a plain LLM might hallucinate a wrong answer or leak a prompt, an agentic system can autonomously act on that compromise: send emails, execute code, transfer funds, delete files, or chain tools across your entire tech stack. This “agency” turns theoretical risks into immediate, high-impact breaches.

Why Agentic AI Security is important:
Agency = Real-World Impact LLMs output text. Agents execute actions via tools (APIs, databases, email, code execution). A single successful attack can cause financial loss, regulatory violations, or operational shutdowns - autonomously and at scale.

Massively Expanded Attack Surface
- Tools & APIs: Agents call external services dynamically.
- Memory & State: Persistent long-term memory can be poisoned and spread across sessions or multi-agent teams.
- Multi-Agent Communication: Agents talk to each other (via protocols like MCP or A2A), creating cascading failure risks.
- Supply Chain: Open-source frameworks (LangChain, CrewAI, AutoGen) and third-party tools are everywhere. Traditional app security (firewalls, IAM) wasn’t built for non-deterministic, goal-driven systems that replan on the fly.

The OWASP(Open Web Application Security Project) Top 10 for Agentic Applications 2026 is a globally peer-reviewed framework that identifies the most critical security risks facing autonomous and agentic AI systems.

1. Top 5 Agentic Security Risks (OWASP 2026):

Risk ID	Name	The Threat
ASI-01	Agent Goal Hijack	An attacker manipulates an agent's multi-step plan (via direct or indirect injection) to pursue a malicious objective, like exfiltrating data instead of summarizing it.
ASI-02	Tool Misuse	Agents using legitimate tools in unsafe sequences (e.g., a "delete" tool following an unverified "list files" command) or triggering infinite recursive loops that exhaust resources.
ASI-03	Identity Abuse	Exploiting the lack of unique identities for agents. Attackers compromise a single "service account" used by 50 different agents to gain broad lateral access.
ASI-04	Supply Chain	Risks from third-party "Skills," "Plugins," or open-source MCP (Model Context Protocol) servers that may contain hidden backdoors or malicious code.
ASI-05	Memory Poisoning	Injecting false information into an agent’s long-term memory or vector store, causing it to make biased or harmful decisions in future sessions.

2. The "Lethal Trifecta":

Security researchers in 2026 frequently cite the Lethal Trifecta, which occurs when an agent has three specific capabilities simultaneously.

Access to Private Data (e.g., CRM, internal docs).
External Communication (e.g., can send emails or hit webhooks).
Processing Untrusted Content (e.g., reading a customer email or a public website).

Note: If an agent has all three, it must be isolated in a strict Sandbox with human-in-the-loop (HITL) gates for any outbound action.

3. Defense-in-Depth for Agents:

To secure agentic workflows, the 2026 architecture focuses on three pillars:

1. Agentic Identity & RBAC:

Treat every agent as a first-class security principal.

Unique IDs: Do not share API keys between agents.
Least Privilege: An agent designed to "Read Calendar" should not have "Write" permissions.
Short-lived Tokens: Use session-scoped credentials that expire immediately after the task is complete.

2. The Agent Gateway (Policy Enforcement):

Instead of letting the LLM call tools directly, route all tool calls through an Agent Gateway.

Validation: The gateway checks the tool call against a hardcoded policy (e.g., "Refunds > $100 require human approval").
Sanitization: It strips potential prompt injections from tool outputs before they return to the LLM's context.

3. Behavioral Monitoring (MTP):

Since agents are non-deterministic, you must monitor for Model Task Persistence (MTP).

Detect if an agent is "looping" or deviating from its original goal.
Log every step: The original prompt, the plan generated, the tool called, and the final result for a full audit trail.

Emerging Standards:

NIST AI Agent Standards (March 2026): Focuses on automated benchmark evaluations and identity authorization.

India IT Amendment Rules 2026: Specifically mandates traceability for agent-generated actions and content (SGI - Synthetically Generated Information).

ISO/IEC 42001: Provides the management framework for AI ethics and transparency.

ref: OWASP GenAI Security Project @ https://genai.owasp.org/

The OWASP Top 10 for Agentic Applications 2026 @ https://genai.owasp.org/resource/owasp-top-10-for-agentic-applications-for-2026/
OWASP GenAI Data Security Risks & Mitigations 2026 @ https://genai.owasp.org/resource/owasp-genai-data-security-risks-mitigations-2026/

Tech Kaizen

Search this Blog:

Agentic AI Security

2. The "Lethal Trifecta":

Note: If an agent has all three, it must be isolated in a strict Sandbox with human-in-the-loop (HITL) gates for any outbound action.

3. Defense-in-Depth for Agents:

The Verge - YOUTUBE

Microsoft Research

Hugging Face - Blog

AI at Wharton

Stanford Online

MIT OpenCourseWare - YOUTUBE

NPTEL IISC BANGALORE - YOUTUBE

HackerRank - YOUTUBE

FREE CODE CAMP - YOUTUBE

BYTE BYTE GO - YOUTBUE

GAURAV SEN INTERVIEWS - YOUTUBE

Tanay Pratap - YOUTUBE

Ashish Pratap Singh - YOUTUBE

Kantan Coding - YOUTUBE

SUCCESS IN TECH INTERVIEWS - YOUTUBE

IGotAnOffer: Engineering - YOUTUBE

DEEPLEARNING AI - YOUTUBE

MIT News - Artificial intelligence

Monthly Blog Archives

Blog Archives Categories

Popular Posts

My Other Blogs

Total Pageviews

Who am I

Aryaka Insights

Reid Hoffman - YOUTUBE

Martin Fowler's Bliki - BLOG

The Pragmatic Engineer

AI Workshop

CYBER SECURITY - YOUTUBE

CYBER SECURITY FUNDAMENTALS PROF MESSER - YOUTUBE