AI Agent Security — Attacks, Jailbreaking, and Defense · Red Teaming, Monitoring, and Secure Design of Agentic Systems

LLM Red Teaming Methodology: Planning, Scope, Threat Model — Test Plan

Red Teaming, Monitoring, and Secure Design of Agentic Systems

Introduction

LLM red teaming is a structured process of attacking a language model or agentic system by an internal security team in order to discover vulnerabilities before an adversary does. This lesson covers defining objectives and scope, building a threat model (STRIDE, OWASP LLM Top 10), designing a test plan, and measuring the success of a red teaming session.