Yawning Titan Autonomous Cyber Defence

Overview#

Training a blue-team agent to defend a network by running it against live attacks is slow, expensive, and carries operational risk. Abstract simulation changes the calculus: define a network topology as a graph, script red-agent behaviours, and let the blue agent learn by running thousands of episodes overnight. The outcomes are measurable, the environments are configurable, and the risk is zero. That is the premise behind Yawning Titan, the open-source autonomous cyber-defence simulation framework developed at NCSC UK.

The Yawning Titan Autonomous Cyber Defence module within Argus provides a simulation and evaluation environment for autonomous cyber-defence experiments built around network scenarios, repeated simulations, and blue-team win-rate measurement. Teams track available network environments, monitor simulation volume, measure completion rates, and assess blue-agent performance over time.

Open Standards#

OAuth 2.0 and JWT Bearer Token: Token-based authentication protects typed, auditable read and write workflows across the platform.
OAuth 2.0 Bearer Token (RFC 6750): every outbound call to the Yawning Titan API carries an HTTP the Bearer authorisation header header, and every inbound authenticated workflow is gated behind the same bearer-token authentication mechanism.
MITRE ATT&CK: the Yawning Titan simulation framework, developed at NCSC/Dstl UK, models red-agent attack behaviours against network graphs using adversarial techniques drawn from the MITRE ATT&CK knowledge base, making ATT&CK the conceptual grounding for the threat scenarios evaluated through this module.
Farama Gymnasium API (formerly OpenAI Gym): the reinforcement-learning agents supported by the module (PPO, DQN, A2C) conform to the Gymnasium environment interface, the open standard contract for RL environment observation spaces, action spaces, and episode stepping.
JSON (RFC 8259): all payloads exchanged between Argus and the Yawning Titan API are serialised as JSON, with Content-Type: application/json set explicitly on launch requests and Accept: application/json on all retrieval calls.
UUID (RFC 4122): network topology identifiers and simulation run identifiers are stored and exchanged as RFC 4122 UUIDs, ensuring globally unique, format-stable references across tenant boundaries.

Last Reviewed: 2026-03-25 Last Updated: 2026-04-14

Key Features#

Network Topology Portfolio: Tracks the number of available network environments used for cyber-defence experimentation, from small office graphs to complex enterprise topologies.
Simulation Volume Monitoring: Shows total simulation activity and the number of completed runs, giving research leads a clear picture of experimentation throughput.
Blue-Team Performance Measurement: Provides average blue win-rate visibility over time, supporting outcome tracking across cohorts, policy versions, and network environments.
Active Simulation Awareness: Surfaces currently running simulations for operators supervising experimentation workloads or coordinating compute resources.
Autonomous Defence Evaluation: Supports structured comparison of automated cyber-defence behaviour across scenarios, enabling evidence-based selection of agent policies for operational deployment candidates.

Use Cases#

Autonomous Defence Research: Teams evaluate how automated blue-team approaches perform across multiple network environments, comparing policy architectures and training strategies.
Cyber Range Experimentation: Operators supervise active simulation campaigns during testing or training events, monitoring throughput and flagging anomalous results.
Defensive Strategy Benchmarking: Security researchers compare blue win rates across scenario sets to understand which approaches are improving and which have reached a performance ceiling.
Exercise Outcome Review: Leadership and analysts review simulation completion and performance trends as part of post-exercise analysis or capability assessment.

Integration#

Autonomous cyber-defence simulation services
Cyber-range and exercise workflows
Assurance and performance measurement dashboards
Cyber operations and training workbenches