Agent Harness / Agent Infrastructure

Per-agent harness in Claude Code, Cursor, Devin — loop, tools, sandbox, project memoryAgentic frameworks: LangGraph, CrewAI, AutoGen, OpenAI Agents SDK — ready-made harness for reuseCode execution sandboxes: E2B, Modal, Daytona, Anthropic "computer use" sandboxExternal protocols: MCP (Anthropic) for tools, A2A (Google) for inter-agent communicationObservability for agents: LangSmith, Langfuse, Helicone, BraintrustAgent attribution and identity — required for enterprise audit and complianceShared infrastructure for AaaS — Sierra, Decagon, Salesforce Agentforce combine harness + infra

The harness surrounding a model defines: (1) an execution loop — the strategy for successive steps (ReAct, plan-and-execute, reflection); (2) a tool interface — JSON schemas, parameter validation, retry logic; (3) a sandbox — isolated code execution environment (Docker/microVM); (4) memory — session context, scratchpad, long-term memory; (5) a system prompt and guardrails; (6) observability — traces, logs, evaluation. The infrastructure adds an inter-agent layer covering: (a) attribution of actions to a person or organization, (b) communication protocols (MCP for tools, A2A for agents), (c) reputation and oversight systems, (d) intervention mechanisms.

Agentic systems built directly on a raw LLM are uncontrollable, unsafe, and unmeasurable: no control loop, no execution isolation, no action attribution, no exchange protocols. Harness/Infrastructure provides a reusable engineering layer that gives agents observability, safety, identity, and the ability to collaborate.

01

Control Loop

Orchestrate

Step strategy: ReAct, plan-and-execute, reflection, tree-of-thoughts.

02

Tool Interface

Bridge

JSON schemas, parameter validation, retries, mapping to MCP / function calling.

03

Execution Sandbox

Isolate

Isolated environment (Docker, microVM, Firecracker) for the agent's code and system operations.

04

Memory Subsystem

Remember

Session context, scratchpad, long-term memory (vector store), and history compression.

05

Policy and Guardrails

Constrain

Allowed tools, spend caps, escalation rules, risk classes, allowlists/blocklists.

06

Identity and Attribution

Attribute

Agent identity, action signing, mapping to a human/organizational principal (Chan et al. 2025).

07

Inter-Agent Communication

Communicate

Protocols such as A2A (Google) and MCP (Anthropic) — standards for agent-to-agent and agent-to-tool exchanges.

08

Observability and Evaluation

Observe

Step tracing, tool logs, offline/online evaluation, replay and debugging.

Paradigm

Conditional

Input dependent

Loop strategy

Critical

ReAct vs plan-and-execute vs reflection vs tree-of-thoughts.

Tool surface

Critical

Number and granularity of tools exposed; competence-vs-hallucination trade-off.

Isolation level

Standard

None / process / container / microVM / hardware enclave.

Memory horizon

Standard

Current context window only vs persistent long-term memory.

Policy strictness

Standard

Permissive (dev) vs strict (prod) — caps, allowlists, human escalation.

Identity model

Standard

Anonymous agent vs named agent bound to a human/organizational principal.

Common pitfalls

Over-abstraction in the agent harness

HIGH

Early frameworks overwhelm the model with layers of abstraction; Anthropic recommends starting with simple loops.

Lack of execution isolation

CRITICAL

Running agent-generated code in the host process is a critical security vulnerability.

Unbounded loops and runaway costs

CRITICAL

The absence of step, time, and spending limits leads to infinite loops and uncontrolled API costs.

Brak atrybucji

HIGH

Without identity and signing, it is impossible to trace who or what performed a given action — blocking audit and enterprise adoption.

Eksplozja kontekstu

HIGH

Naively concatenating history and tool results quickly overflows the context window; compression or summarization is required.

Tool poisoning / prompt injection

CRITICAL

Tool output may contain malicious instructions; the harness must treat all outputs as untrusted data.

Brak observability

HIGH

Without step-level tracing, debugging an agent is practically impossible.

Vendor lock-in: harness tied to specific model

MEDIUM

Tight coupling to a specific SDK makes model or provider migration difficult.

Reference implementations

LangGraph

OpenAI Agents SDK

Model Context Protocol

Agent2Agent Protocol (A2A)

E2B

2022

ReAct (Yao et al.) — first widely adopted thought/action/observation agent loop, canonical foundation of the harness.

breakthrough

2023

OpenAI Function Calling — formal tool interface layer in the harness.

2023

AutoGPT and BabyAGI — first open-source harnesses with feedback loops and memory.

2024

Anthropic "Building effective agents" design guidelines published

2024

Anthropic Computer Use — sandbox + harness for GUI desktop control.

2024

Model Context Protocol (MCP, Anthropic) — first widely adopted tool-calling standard as infrastructure.

breakthrough

2025

Chan et al. "Infrastructure for AI Agents" (TMLR) — formalization of agent infrastructure as a distinct research area.

breakthrough

2025

Google A2A protocol released, complementary to MCP.

2025

OpenAI Agents SDK, LangGraph 0.x stable — opinionated harness frameworks mature.

BUILT ON

LLM

A Large Language Model (LLM) is a class of machine learning models based on the Transformer architecture, trained on large text datasets via autoregressive language modeling (next-token prediction). These models have billions of parameters and can generate coherent text, answer questions, write code, translate languages, and perform many other language-cognitive tasks without task-specific fine-tuning. The term covers models such as GPT, LLaMA, Gemini, Claude, and Mistral. Most modern LLMs are instruction-tuned (SFT + RLHF) after the pre-training phase.

GO TO CONCEPT

Tool-augmented LLM

Tool-augmented LLM is an architectural pattern in which a large language model is equipped with access to one or more external tools that it can invoke during inference by generating structured function-call or API-call outputs. The model learns when and how to call tools by producing special tokens or structured output (e.g., JSON function calls) that are intercepted by a host runtime, executed against the tool, and whose results are returned to the model as new context for continued generation. The canonical formalization appeared in the Toolformer paper (Schick et al., Meta AI, 2023), which demonstrated that LLMs can learn to self-supervise their own tool-use through API call annotation without requiring large labeled datasets. Toolformer showed that models trained this way can decide which tools to call, when, and with which arguments, and that tool use substantially improves performance on tasks requiring fresh information, arithmetic, multilingual lookup, and question answering. The pattern encompasses several mechanisms: (1) in-context tool specification, where tool interfaces are described in the system prompt or context (JSON Schema, OpenAPI, natural language); (2) function calling APIs, where the model produces structured output matched to a defined schema and the host dispatches the call; (3) ReAct-style interleaving, where the model alternates reasoning traces with tool-use observations; and (4) parallel tool calling, where the model emits multiple tool calls simultaneously to be executed concurrently. Key implementations include OpenAI function calling (GPT-4, June 2023), Anthropic tool use (Claude, 2023), Google Gemini function calling, and the Model Context Protocol (MCP, 2024) which standardizes tool server connectivity.

GO TO CONCEPT

EXTENDS

MCP

Model Context Protocol (MCP) is an open protocol developed by Anthropic and released in November 2024. It addresses the M×N integration problem in AI systems: connecting M different LLM applications to N different external tools previously required M×N bespoke connectors. MCP defines a standardized client-host-server architecture where hosts (LLM applications) manage one or more clients, each maintaining a stateful session with a specific server. Servers expose capabilities as three primitives: Resources (structured data for context), Prompts (templated instructions), and Tools (executable functions). Clients expose two primitives: Roots (filesystem entry points) and Sampling (server-initiated LLM completions). Communication is based on JSON-RPC 2.0. Capability negotiation occurs at session initialization. The protocol is transport-agnostic and has been implemented in Python, TypeScript, C#, and Java SDKs. In December 2025, Anthropic donated MCP governance to the Agentic AI Foundation (AAIF) under the Linux Foundation.

GO TO CONCEPT

Connects

Tool-augmented LLM

Tool-augmented LLM is an architectural pattern in which a large language model is equipped with access to one or more external tools that it can invoke during inference by generating structured function-call or API-call outputs. The model learns when and how to call tools by producing special tokens or structured output (e.g., JSON function calls) that are intercepted by a host runtime, executed against the tool, and whose results are returned to the model as new context for continued generation. The canonical formalization appeared in the Toolformer paper (Schick et al., Meta AI, 2023), which demonstrated that LLMs can learn to self-supervise their own tool-use through API call annotation without requiring large labeled datasets. Toolformer showed that models trained this way can decide which tools to call, when, and with which arguments, and that tool use substantially improves performance on tasks requiring fresh information, arithmetic, multilingual lookup, and question answering. The pattern encompasses several mechanisms: (1) in-context tool specification, where tool interfaces are described in the system prompt or context (JSON Schema, OpenAPI, natural language); (2) function calling APIs, where the model produces structured output matched to a defined schema and the host dispatches the call; (3) ReAct-style interleaving, where the model alternates reasoning traces with tool-use observations; and (4) parallel tool calling, where the model emits multiple tool calls simultaneously to be executed concurrently. Key implementations include OpenAI function calling (GPT-4, June 2023), Anthropic tool use (Claude, 2023), Google Gemini function calling, and the Model Context Protocol (MCP, 2024) which standardizes tool server connectivity.

GO TO CONCEPT

MCP

Model Context Protocol (MCP) is an open protocol developed by Anthropic and released in November 2024. It addresses the M×N integration problem in AI systems: connecting M different LLM applications to N different external tools previously required M×N bespoke connectors. MCP defines a standardized client-host-server architecture where hosts (LLM applications) manage one or more clients, each maintaining a stateful session with a specific server. Servers expose capabilities as three primitives: Resources (structured data for context), Prompts (templated instructions), and Tools (executable functions). Clients expose two primitives: Roots (filesystem entry points) and Sampling (server-initiated LLM completions). Communication is based on JSON-RPC 2.0. Capability negotiation occurs at session initialization. The protocol is transport-agnostic and has been implemented in Python, TypeScript, C#, and Java SDKs. In December 2025, Anthropic donated MCP governance to the Agentic AI Foundation (AAIF) under the Linux Foundation.

GO TO CONCEPT

Commonly used with

AI Agents (Autonomous Agents)

An AI Agent (autonomous agent) is a single, autonomous system based on an AI model — most often an LLM — that dynamically directs its own process and tool usage to accomplish a given goal. In Anthropic's definition (December 2024), an agent is a system in which an LLM independently controls its actions, in contrast to a workflow, where LLMs and tools are orchestrated through predefined code paths. An AI Agent is the concrete executable artifact of the Agentic AI paradigm — analogous to how a microservice is an instance of the microservice paradigm. A single agent has a clearly defined goal, access to a set of tools (web search, code execution, file operations, APIs, MCP), memory (in-context and optionally external), a control loop (perceive → reason → act → observe), and termination conditions (goal achievement, max_steps, escalation). The agent starts work from a command or interactive discussion with a human; once the task is clarified, it operates independently, optionally returning for further information or approval. During execution it obtains "ground truth" from the environment after each step (tool results, code execution) and may pause at checkpoints. In practice, an AI Agent is typically just an LLM using tools in a loop based on environmental feedback — the implementation is often simpler than a framework, but requires care in designing the agent-computer interface (ACI) and tool documentation. AI Agent should be distinguished from related concepts: Agentic AI is the paradigm (class of systems), an AI Agent is an instance (concrete actor); a Multi-Agent System is a collective of multiple cooperating agents; a Workflow is a predefined orchestration of LLMs without decisional autonomy.

GO TO CONCEPT

Agentic AI

Agentic AI denotes an architectural transition from single-turn, stateless generative models toward goal-directed systems capable of autonomous perception, planning, action, and adaptation through iterative control loops. An agentic system wraps a large language model in a runtime that gives the model access to tools (web search, code execution, APIs, file I/O), persistent memory, and feedback from prior steps. The model then decides dynamically which tools to call, in what order, and whether to loop or stop, rather than following a predefined code path. Two primary system types are commonly distinguished: (1) Workflows, in which LLMs and tools are orchestrated through predefined code paths, and (2) Agents, in which the LLM itself directs its process and tool usage dynamically. Both can be composed into multi-agent systems where specialized agents collaborate, with one acting as orchestrator and others as subagents. Key design patterns identified by Anthropic (2024) include prompt chaining, routing, parallelization, orchestrator-workers, and evaluator-optimizer loops. Andrew Ng's 2024 taxonomy describes four foundational patterns: Reflection, Tool Use, Planning, and Multi-Agent Collaboration. Formal frameworks model agentic control loops as Partially Observable Markov Decision Processes (POMDPs). The control loop is: perceive state → reason/plan → select action → execute tool → observe result → update state → repeat. Agentic systems introduce risks not present in single-turn models, including hallucination in action, prompt injection through observed content, infinite loops, reward hacking, and tool misuse.

GO TO CONCEPT

MAS

Multi-Agent Systems (MAS) are a paradigm in Distributed Artificial Intelligence in which multiple autonomous software entities — agents — interact within a shared environment to achieve individual or collective goals. Each agent perceives its environment through sensors or interfaces, reasons about its state, and acts through actuators or API calls. In the context of LLM-based MAS (emerging prominently from 2023 onward), agents are powered by large language models that provide the cognitive core (planning, reasoning, natural language communication), supplemented by memory modules, tool-use interfaces, and role-specific prompts. The system architecture defines how agents coordinate: coordination topologies include sequential pipelines, hierarchical orchestration (orchestrator-worker), parallel fan-out/fan-in, publish-subscribe messaging, and decentralized peer-to-peer communication. Core agent properties, as defined by Wooldridge and Jennings (1995), include autonomy, social ability, reactivity, and pro-activeness. In LLM-based systems, key components are: the agent (an LLM with a system prompt defining its role), a communication channel (natural language messages, structured function calls, or shared memory), an orchestrator or coordinator (managing task decomposition, routing, and state), tool-use interfaces (external APIs, code execution, web search), and a memory subsystem (short-term context, long-term vector storage). Prominent frameworks implementing LLM-based MAS include AutoGen (Microsoft, 2023), CAMEL (2023), MetaGPT (2023), CrewAI, and LangGraph.

GO TO CONCEPT

AaaS

Agents as a Service (AaaS) is a software delivery model in which a vendor provides the customer with an autonomous AI agent that performs concrete business tasks, instead of a traditional human-operated application. The term was publicly introduced on March 25, 2026, by Sierra co-founders Bret Taylor and Clay Bavor in a blog manifesto announcing their Ghostwriter agent as their own realization of this paradigm. Unlike Software as a Service (SaaS), where customers buy access to an interface (menus, form fields, tables) and perform the work themselves through clicks, in AaaS the customer defines a desired outcome in natural language, and the vendor delivers an agent that builds and improves production agents or performs the work directly. The defining property is full autonomy: the agent has access to data, tools, a sandboxed test environment, and the deployment pipeline, while the human acts as a supervisor approving changes. The key technical enabler is the agent harness — scaffolding of tools, memory, planning, and task context — combined with refactoring the platform into headless infrastructure that an agent can invoke programmatically rather than navigating through a UI. The work cycle includes analyzing interactions, identifying improvement opportunities, validating in a sandbox, and preparing for review — which Sierra calls an "agent assembly line." AaaS is tightly coupled with the Agentic AI paradigm (its technical foundation) and outcome-based billing models (its commercial superstructure).

GO TO CONCEPT

Reasoning model

A reasoning model (also: large reasoning model, LRM, reasoning language model, RLM) is a type of large language model that has been specifically post-trained to solve complex multi-step problems by explicitly generating intermediate reasoning steps before committing to a final response. Unlike standard LLMs that generate a direct response in a single forward pass, reasoning models allocate additional computation at inference time — a property known as test-time compute scaling — by producing a long internal chain of thought (CoT). The reasoning trace typically includes steps such as problem decomposition, hypothesis generation, self-verification, reflection, and correction of errors. The defining characteristics of reasoning models are: (1) post-training via large-scale reinforcement learning (RL) using reward signals based on final answer correctness (and sometimes intermediate step quality via process reward models); (2) the emergence of extended, often hidden, reasoning traces that precede the final answer; (3) a consistent empirical relationship between the length or computational budget allocated to the reasoning trace and final answer quality (test-time scaling law); (4) superior performance on verifiable tasks requiring multi-step logic, such as mathematics, competitive programming, and scientific reasoning. The term 'reasoning model' was introduced as a product category by OpenAI in September 2024 with the release of the o1-preview model. OpenAI described o1 as trained via a large-scale RL algorithm teaching the model to use chain of thought productively. The approach does not rely on explicit tree search algorithms; instead, implicit search emerges via RL-trained CoT generation. In January 2025, DeepSeek published the first detailed open technical description of this class of models in the DeepSeek-R1 paper (arXiv:2501.12948), demonstrating that reasoning capabilities can be incentivized via pure RL without supervised fine-tuning, using Group Relative Policy Optimization (GRPO) as the RL framework. Reasoning models typically employ the same base Transformer decoder architecture as standard LLMs, with the key difference residing entirely in the post-training pipeline: RL replaces or augments standard RLHF/SFT, and reward signals are grounded in verifiable outcomes. The resulting models generate substantially longer token sequences during inference (reasoning tokens), which are often hidden from end users but incur real compute costs. Performance consistently improves with both more training-time RL compute and more inference-time thinking budget.

GO TO CONCEPT

Title	Publisher	Type
Infrastructure for AI Agents	arXiv / TMLR	scientific article
Building effective agents	Anthropic	blog
ReAct: Synergizing Reasoning and Acting in Language Models	arXiv	scientific article
Model Context Protocol — specification	Anthropic	documentation

Infrastructure for AI Agents

scientific articlearXiv / TMLR

Building effective agents

blogAnthropic

ReAct: Synergizing Reasoning and Acting in Language Models

scientific articlearXiv

Model Context Protocol — specification

documentationAnthropic

Use cases

How it works

Problem solved

Main components

Control Loop

Tool Interface

Execution Sandbox

Memory Subsystem

Policy and Guardrails

Identity and Attribution

Inter-Agent Communication

Observability and Evaluation

Computational complexity

Configuration axes

Implementation

Common pitfalls

Reference implementations

History and evolution

Semantic relations

BUILT ON

EXTENDS

Connects

Commonly used with

Sources