Agents

Agentic AI

2024ActivePublished: 20 March 2026Updated: 5 May 2026Published

Key innovation

Shifts AI systems from stateless prompt-response generation to goal-driven autonomous loops in which an agent perceives its environment, plans multi-step actions, invokes external tools, reflects on outcomes, and iterates until the goal is reached.

How it works

The agentic system receives a goal, then independently plans steps, selects tools, gathers data, executes actions, and evaluates intermediate results. In simpler variants, a single agent handles this using tool use; in more advanced configurations, multiple agents collaborate on subtasks within a shared workflow.

Problem solved

Traditional generative models handle single prompts well but struggle with extended tasks that require planning, working memory, tool use, and adaptation to changing context. Agentic AI addresses this by combining reasoning, planning, and action execution.

Components

Perception / Input LayerReceives and encodes environmental inputs into the model's context window.

Accepts observations from the environment (user messages, tool results, file contents, API responses) and formats them as context for the base model. This may include RAG retrieval to fetch relevant documents.

RAG-augmented input

Raw message input

Official

Planning ModuleGoal decomposition into actions and execution plan generation

Decomposes a high-level goal into a sequence of subgoals or actions. The agent may generate an explicit plan or reason step by step using chain-of-thought.

Inline Planning (Chain of Thought)

Dedicated planning model

Official

MemoryState and history management across agent loop steps

Stores and retrieves information between steps within a session (short-term memory) and optionally across sessions (long-term memory).

In-context (short-term)

External memory store (long-term)

Official

Tools / Actions LayerExtends the model's action space with calls to external systems.

The agent is provided with callable external functions: web search, code execution, database queries, file operations, API calls, and browser control. Tool interfaces are defined through schemas such as JSON Schema, OpenAPI, and MCP.

Function calling / Tool use API

Model Context Protocol (MCP)

Official

Reflection / EvaluationOutput quality control and decision to continue or terminate the loop.

Evaluates whether the current result meets the success criterion. Triggers a retry, replanning, or loop termination. Corresponds to the evaluator-optimizer pattern described by Anthropic.

Official

OrkiestratorCoordinates multi-agent collaboration and manages task flow.

In multi-agent systems, it directs sub-agents, assigns tasks, and aggregates results. The orchestrator can be an LLM or a statically coded deterministic controller.

LLM as Orchestrator

Hardcoded Orchestrator

Official

Implementation

Reference implementations

LangChain Agents

Python · LangChain AI

LlamaIndex Workflows

Python · LlamaIndex

Anthropic agent patterns (reference code)

Python · Anthropic

Official

Implementation pitfalls

Hallucinations in actionCritical

Model may invoke tools with fabricated parameters or claim to have performed actions it never actually executed — leading to silent failures in multi-step pipelines.

Fix:Validate all tool calls against schemas before execution; use deterministic parsers; introduce explicit confirmation steps for irreversible actions.

Infinite loopsHigh

Without a hard step limit or an effective termination criterion, an agent can loop indefinitely, consuming computational resources and hitting API rate limits.

Fix:Set explicit max_steps limits; implement loop detection based on repeated action signatures; use an evaluator to enforce stopping conditions.

Prompt injection via observed contentCritical

Malicious instructions embedded in tool outputs (web pages, documents, emails) can hijack agent behavior by impersonating system-level instructions.

Fix:Isolate untrusted content from system instructions; require explicit user confirmation before acting on instructions found in observed content; apply content filtering.

Context window overflowHigh

Accumulated tool outputs and conversation history can exceed the model's context window, causing earlier steps to be silently truncated.

Fix:Implement context compaction/summarization; use external memory stores; monitor the token budget at each step.

Tool misuse and irreversible side effectsCritical

Agents with access to write-enabled tools (file deletion, email sending, database writes) can cause real-world harm when acting on faulty reasoning.

Fix:Use tool sets with minimal permission scope; require human confirmation for irreversible actions; prefer reversible operations where possible.

Creeping complexity — building agents where a workflow sufficesMedium

Using agentic autonomy for deterministic, well-defined tasks introduces latency, unpredictability, and failure modes that a simple workflow would avoid.

Fix:Use predefined workflows by default; introduce agentic autonomy only when a task genuinely requires dynamic decision-making across multiple unpredictable steps.

Evolution

1995

Foundational theory of intelligent agents

Russell and Norvig formalize rational agents as entities that perceive their environment and take goal-directed actions. BDI (Belief-Desire-Intention) agent architectures are established.

2022

ReAct: Reasoning + Acting with LLMs

Inflection point

Yao et al. (2022) propose ReAct — interleaving chain-of-thought reasoning traces with action execution in LLMs, demonstrating that language models can serve as a reasoning engine within tool-augmented agentic loops.

ReAct: Synergizing Reasoning and Acting in Language Models (paper)

2023

API for tool calling and first commercial agentic systems

Inflection point

OpenAI introduced function calling in GPT-4 in June 2023. AutoGPT, BabyAGI, and LangChain agent abstractions gained widespread adoption. The term "Agentic AI" entered common industry usage.

2024

Four Agentic AI Design Patterns by Andrew Ng

Andrew Ng's series of blog posts identifies four fundamental design patterns — Reflection, Tool Use, Planning, and Multi-Agent Collaboration — widely cited as a practical taxonomy of agentic systems.

What's next for AI agentic frameworks (Andrew Ng, 2024) (paper)

2024

Anthropic "Building Effective Agents" — compositional patterns for production

Inflection point

Anthropic published practical guidelines distinguishing workflows (predefined paths) from agents (model-driven execution) and formalized five compositional patterns: prompt chaining, routing, parallelization, orchestrator-workers, and evaluator-optimizer.

Building effective agents (paper)

2025

Model Context Protocol (MCP) standardizes tool connectivity

Anthropic publishes MCP as an open standard for connecting LLMs to external tool servers, enabling interoperable agentic ecosystems across providers.

2025

Agentic AI in Robotics — Embodied Agent Loops

LLM-based planners drive robotic actions through perception-planning-action loops, extending agentic paradigms to physical systems and connecting Agentic AI with real-world motor execution.