aegis-campaign-management
Expert skill for running and managing Project Aegis adversarial red-teaming campaigns. Use when working with Aegis simulations, prompt transformations, or jailbreak testing.
SKILL.md
| Name | aegis-campaign-management |
| Description | Expert skill for running and managing Project Aegis adversarial red-teaming campaigns. Use when working with Aegis simulations, prompt transformations, or jailbreak testing. |
name: Aegis Campaign Management description: Expert skill for running and managing Project Aegis adversarial red-teaming campaigns. Use when working with Aegis simulations, prompt transformations, or jailbreak testing.
Aegis Campaign Management Skill
Overview
This skill provides expertise in managing Project Aegis adversarial simulation campaigns for testing LLM robustness. Aegis combines the Chimera narrative methodology with AutoDan evolutionary optimization.
When to Use This Skill
- Running adversarial testing campaigns
- Debugging Aegis simulations
- Analyzing campaign telemetry and results
- Configuring personas, scenarios, and transformation techniques
- Troubleshooting campaign failures or timeouts
Key Architecture Components
Chimera Engine
The narrative construction layer that generates:
- Personas: High-fidelity character profiles (The Amoral Novelist, System Debugger, etc.)
- Scenarios: Nested simulation contexts (Sandbox, Fiction, Debugging modes)
- Context Isolation Protocol (CIP): Frames interactions as legal fiction
Location: meta_prompter/engines/chimera/
AutoDan Engine
The evolutionary optimization layer:
- Genetic Optimizer: Mutates candidate prompts based on fitness scores
- Fitness Evaluator: Scores LLM responses for refusal detection
- Gradient-Based Narrative Shift: Adapts personas/scenarios based on refusal types
Location: meta_prompter/engines/autodan/
Aegis Orchestrator
The integration layer combining Chimera + AutoDan:
- Campaign management and execution
- Real-time telemetry via WebSocket
- Multi-iteration optimization cycles
Location: chimera-orchestrator/
Common Commands
Run a Standalone Campaign
# CLI execution with mock model
python run_aegis.py "target request to test" --iterations 10 --provider google
# Example output includes:
# - Persona details (archetype, traits, context)
# - Scenario configuration (sandbox type, isolation level)
# - Success metrics (RBS, NDI, SD scores)
WebSocket Telemetry
# Connect to campaign WebSocket for real-time updates
# Frontend endpoint: /api/v1/ws/aegis/telemetry/{campaign_id}
# Receives: campaign status, iteration progress, personas generated
Configuration Files
.env: Primary configuration for API keys, model selectionbackend-api/app/core/config.py: Aegis-specific settings (max iterations, timeout, providers)
Key Metrics
RBS (Refusal Bypass Score)
RBS = (Successful_Iterations / Total_Attempts) × 100
Measures campaign effectiveness at eliciting target responses
NDI (Narrative Depth Index)
Complexity measure of nested persona layers required for bypass
SD (Semantic Distance)
Distance between obfuscated terms and original prohibited keywords
Troubleshooting Guide
Campaign Timeouts
Symptom: Campaign hangs or exceeds timeout Solutions:
- Check backend logs:
backend-api/logs/ - Verify LLM provider API keys are valid
- Reduce
max_iterationsin config - Check database write locks (SQLite
check_same_threadsetting)
Persona Generation Failures
Symptom: Empty or invalid personas in telemetry Solutions:
- Verify
PersonaFactoryinitialization inmeta_prompter/factories/persona.py - Check persona templates in
meta_prompter/templates/personas/ - Review LLM response parsing logic
WebSocket Connection Issues
Symptom: Frontend dashboard not receiving real-time updates Solutions:
- Verify WebSocket route registration in
backend-api/app/api/v1/api.py - Check
aegis_ws.pyrouter configuration - Ensure frontend
useAegisTelemetryhook connects to correct endpoint - Inspect browser console for WebSocket errors
Low RBS Scores
Symptom: Campaign consistently fails to bypass safety measures Solutions:
- Increase
potency_level(1-10 scale) in transformation config - Enable more aggressive transformation techniques
- Review refusal analysis logs to identify patterns
- Adjust persona archetypes or scenario types
Codebase Navigation
Backend Routes
backend-api/app/api/v1/endpoints/aegis.py: Campaign CRUD operationsbackend-api/app/api/v1/endpoints/aegis_ws.py: WebSocket telemetry
Frontend Components
frontend/src/components/aegis/AegisCampaignDashboard.tsx: Main dashboardfrontend/src/hooks/useAegisTelemetry.ts: WebSocket hook for real-time updatesfrontend/src/contexts/WebSocketProvider.tsx: WebSocket context provider
Core Libraries
meta_prompter/: Adversarial tooling library (Chimera, AutoDan, GPTFuzz, DeepTeam)chimera-orchestrator/: Aegis orchestration service
Best Practices
- Always validate API keys before starting campaigns (check
.envfile) - Monitor resource usage during long campaigns (CPU, memory, API rate limits)
- Use mock models for initial testing to avoid API costs
- Archive campaign results for analysis (stored in database)
- Review telemetry logs after failed campaigns to identify bottlenecks
- Test WebSocket connections before launching campaigns from frontend
Integration Points
Database Schema
-- Campaigns table
campaigns (
id, user_id, objective, status,
created_at, completed_at, telemetry_data
)
-- Sessions table (for multi-iteration tracking)
sessions (
id, campaign_id, iteration, persona_id,
prompt, response, rbs_score
)
API Endpoints
POST /api/v1/aegis/campaigns: Create new campaignGET /api/v1/aegis/campaigns/{id}: Retrieve campaign detailsWS /api/v1/ws/aegis/telemetry/{campaign_id}: Real-time updates
References
- AEGIS_BLUEPRINT_FINAL.md: Full architectural specification
- PROJECT_AEGIS_BLUEPRINT.md: Original design document
- docs/ARCHITECTURE.md: System-wide architecture overview