name: system-spec-kit description: "Unified documentation and context preservation: spec folder workflow (levels 1-3+), CORE + ADDENDUM template architecture (v2.2), validation, and Spec Kit Memory for context preservation. Mandatory for all file modifications." allowed-tools: [Bash, Edit, Glob, Grep, Read, Task, Write] version: 2.2.26.0

Spec Kit - Mandatory Conversation Documentation

Orchestrates mandatory spec folder creation for all conversations involving file modifications. Ensures proper documentation level selection (1-3+), template usage, and context preservation through AGENTS.md-enforced workflows.

1. WHEN TO USE

What is a Spec Folder?

A spec folder is a numbered directory (e.g., specs/007-auth-feature/) that contains all documentation for a single feature or task:

Purpose: Track specifications, plans, tasks, and decisions for one unit of work
Location: Always under specs/ directory with format ###-short-name/
Contents: Markdown files (spec.md, plan.md, tasks.md) plus optional memory/ and scratch/ subdirectories

Think of it as a "project folder" for AI-assisted development - it keeps context organized and enables session continuity.

Activation Triggers

MANDATORY for ALL file modifications:

Code files: JS, TS, Python, CSS, HTML
Documentation: Markdown, README, guides
Configuration: JSON, YAML, TOML, env templates
Templates, knowledge base, build/tooling files

Request patterns that trigger activation:

"Add/implement/create [feature]"
"Fix/update/refactor [code]"
"Modify/change [configuration]"
Any keyword: add, implement, fix, update, create, modify, rename, delete, configure, analyze

Example triggers:

"Add email validation to the signup form" → Level 1-2
"Refactor the authentication module" → Level 2-3
"Fix the button alignment bug" → Level 1
"Implement user dashboard with analytics" → Level 3

When NOT to Use

Pure exploration/reading (no file modifications)
Single typo fixes (<5 characters in one file)
Whitespace-only changes
Auto-generated file updates (package-lock.json)
User explicitly selects Option D (skip documentation)

Rule of thumb: If modifying ANY file content → Activate this skill. Status: ✅ This requirement applies immediately once file edits are requested.

Agent Exclusivity

⛔ CRITICAL: @speckit is the ONLY agent permitted to create or substantively write spec folder documentation (*.md files).

Requires @speckit: spec.md, plan.md, tasks.md, checklist.md, decision-record.md, implementation-summary.md, and any other *.md in spec folders
Exceptions:
- memory/ → uses generate-context.js script
- scratch/ → temporary workspace, any agent
- handover.md → @handover agent only
- research.md → @research agent only
- debug-delegation.md → @debug agent only

Routing to @general, @write, or other agents for spec documentation is a hard violation. See constitutional memory: speckit-exclusivity.md

Utility Template Triggers

Template	Trigger Keywords	Action
`handover.md`	"handover", "next session", "continue later", "pass context", "ending session", "save state", "multi-session", "for next AI"	Suggest creating handover
`debug-delegation.md`	"stuck", "can't fix", "tried everything", "same error", "fresh eyes", "hours on this", "still failing", "need help debugging"	Suggest `/spec_kit:debug`

Rule: When detected, proactively suggest the appropriate action.

2. SMART ROUTING

Resource Domains

The router discovers markdown resources recursively from references/ and assets/ and then applies intent scoring from RESOURCE_MAP. Keep this section domain-focused rather than static file inventories.

references/memory/ for context retrieval, save workflows, trigger behavior, and indexing.
references/templates/ for level selection, template composition, and structure guides.
references/validation/ for checklist policy, verification rules, and decision formats.
references/structure/ for folder organization and sub-folder versioning.
references/workflows/ for command workflows and worked examples.
references/debugging/ for troubleshooting and root-cause methodology.
references/config/ for runtime environment configuration.

Template and Script Sources of Truth

Level definitions and template size guidance: level_specifications.md
Template usage and composition rules: template_guide.md
Use templates/level_N/ for operational templates; core/ and addendum/ remain composition inputs.
Script architecture, build outputs, and runtime entrypoints: scripts/README.md
Memory save JSON schema and workflow contracts: save_workflow.md

Primary operational scripts:

spec/validate.sh
spec/create.sh
spec/archive.sh
spec/check-completion.sh
spec/recommend-level.sh
templates/compose.sh

Resource Loading Levels

Level	When to Load	Resources
ALWAYS	Every skill invocation	Shared patterns + SKILL.md
CONDITIONAL	If intent signals match	Intent-mapped references
ON_DEMAND	Only on explicit request	Deep-dive quality standards

Smart Router Pseudocode

The authoritative routing logic for scoped loading, weighted intent scoring, and ambiguity handling.

from pathlib import Path

SKILL_ROOT = Path(__file__).resolve().parent
RESOURCE_BASES = (SKILL_ROOT / "references", SKILL_ROOT / "assets")
DEFAULT_RESOURCE = "references/workflows/quick_reference.md"

INTENT_SIGNALS = {
    "PLAN": {"weight": 3, "keywords": ["plan", "design", "new spec", "level selection", "option b"]},
    "RESEARCH": {"weight": 3, "keywords": ["investigate", "explore", "analyze", "prior work", "evidence"]},
    "IMPLEMENT": {"weight": 3, "keywords": ["implement", "build", "execute", "workflow"]},
    "DEBUG": {"weight": 4, "keywords": ["stuck", "error", "not working", "failed", "debug"]},
    "COMPLETE": {"weight": 4, "keywords": ["done", "complete", "finish", "verify", "checklist"]},
    "MEMORY": {"weight": 4, "keywords": ["memory", "save context", "resume", "checkpoint", "context"]},
    "HANDOVER": {"weight": 4, "keywords": ["handover", "continue later", "next session", "pause"]},
    "PHASE": {"weight": 4, "keywords": ["phase", "decompose", "split", "workstream", "multi-phase", "phased approach", "phased", "multi-session"]},
}

RESOURCE_MAP = {
    "PLAN": [
        "references/templates/level_specifications.md",
        "references/templates/template_guide.md",
    ],
    "RESEARCH": [
        "references/workflows/quick_reference.md",
        "references/workflows/worked_examples.md",
    ],
    "IMPLEMENT": [
        "references/validation/validation_rules.md",
        "references/templates/template_guide.md",
    ],
    "DEBUG": [
        "references/debugging/troubleshooting.md",
        "references/workflows/quick_reference.md",
    ],
    "COMPLETE": [
        "references/validation/validation_rules.md",
    ],
    "MEMORY": [
        "references/memory/memory_system.md",
        "references/memory/save_workflow.md",
    ],
    "HANDOVER": [
        "references/workflows/quick_reference.md",
    ],
    "PHASE": [
        "references/structure/phase_definitions.md",
        "references/structure/sub_folder_versioning.md",
        "references/validation/phase_checklists.md",
    ],
}

COMMAND_BOOSTS = {
    "/spec_kit:plan": "PLAN",
    "/spec_kit:research": "RESEARCH",
    "/spec_kit:implement": "IMPLEMENT",
    "/spec_kit:debug": "DEBUG",
    "/spec_kit:complete": "COMPLETE",
    "/spec_kit:handover": "HANDOVER",
    "/spec_kit:phase": "PHASE",
}

LOADING_LEVELS = {
    "ALWAYS": [DEFAULT_RESOURCE],
    "ON_DEMAND_KEYWORDS": ["deep dive", "full validation", "full checklist", "full template"],
    "ON_DEMAND": [
        "references/validation/phase_checklists.md",
        "references/templates/template_guide.md",
    ],
}

def _task_text(task) -> str:
    parts = [
        str(getattr(task, "query", "")),
        str(getattr(task, "text", "")),
        " ".join(getattr(task, "keywords", []) or []),
        str(getattr(task, "command", "")),
    ]
    return " ".join(parts).lower()

def _guard_in_skill(relative_path: str) -> str:
    """Allow markdown loads only within this skill folder."""
    resolved = (SKILL_ROOT / relative_path).resolve()
    resolved.relative_to(SKILL_ROOT)
    if resolved.suffix.lower() != ".md":
        raise ValueError(f"Only markdown resources are routable: {relative_path}")
    return resolved.relative_to(SKILL_ROOT).as_posix()

def discover_markdown_resources() -> set[str]:
    """Recursively discover routable markdown docs for this skill only."""
    docs = []
    for base in RESOURCE_BASES:
        if base.exists():
            docs.extend(p for p in base.rglob("*.md") if p.is_file())
    return {doc.relative_to(SKILL_ROOT).as_posix() for doc in docs}

def score_intents(task) -> dict[str, float]:
    """Weighted scoring from request text, keywords, and explicit command boosts."""
    text = _task_text(task)
    scores = {intent: 0.0 for intent in INTENT_SIGNALS}

    for intent, cfg in INTENT_SIGNALS.items():
        for keyword in cfg["keywords"]:
            if keyword in text:
                scores[intent] += cfg["weight"]

    command = str(getattr(task, "command", "")).lower()
    for prefix, intent in COMMAND_BOOSTS.items():
        if command.startswith(prefix):
            scores[intent] += 6

    return scores

def select_intents(scores: dict[str, float], ambiguity_delta: float = 1.0, max_intents: int = 2) -> list[str]:
    """Return primary intent and secondary intent when scores are close."""
    ranked = sorted(scores.items(), key=lambda item: item[1], reverse=True)
    if not ranked or ranked[0][1] <= 0:
        return ["IMPLEMENT"]

    selected = [ranked[0][0]]
    if len(ranked) > 1:
        primary_score = ranked[0][1]
        secondary_intent, secondary_score = ranked[1]
        if secondary_score > 0 and (primary_score - secondary_score) <= ambiguity_delta:
            selected.append(secondary_intent)

    return selected[:max_intents]

def route_speckit_resources(task):
    """Scoped, recursive, weighted, ambiguity-aware routing."""
    inventory = discover_markdown_resources()
    intents = select_intents(score_intents(task), ambiguity_delta=1.0)
    loaded = []
    seen = set()

    def load_if_available(relative_path: str) -> None:
        guarded = _guard_in_skill(relative_path)
        if guarded in inventory and guarded not in seen:
            load(guarded)
            loaded.append(guarded)
            seen.add(guarded)

    # ALWAYS: base references for every invocation
    for relative_path in LOADING_LEVELS["ALWAYS"]:
        load_if_available(relative_path)

    # CONDITIONAL: intent-scored resources
    for intent in intents:
        for relative_path in RESOURCE_MAP.get(intent, []):
            load_if_available(relative_path)

    # ON_DEMAND: explicit deep-dive requests
    text = _task_text(task)
    if any(keyword in text for keyword in LOADING_LEVELS["ON_DEMAND_KEYWORDS"]):
        for relative_path in LOADING_LEVELS["ON_DEMAND"]:
            load_if_available(relative_path)

    if not loaded:
        load_if_available(DEFAULT_RESOURCE)

    return {"intents": intents, "resources": loaded}

3. HOW IT WORKS

Gate 3 Integration

See AGENTS.md Section 2 for the complete Gate 3 flow. This skill implements that gate.

When file modification detected, AI MUST ask:

**Spec Folder** (required): A) Existing | B) New | C) Update related | D) Skip

Option	Description	Best For
A) Existing	Continue in related spec folder	Iterative work, related changes
B) New	Create `specs/###-name/`	New features, unrelated work
C) Update	Add to existing documentation	Extending existing docs
D) Skip	No spec folder (creates tech debt)	Trivial changes only

Enforcement: Constitutional-tier memory surfaces automatically via memory_match_triggers().

Complexity Detection (Option B Flow)

When user selects B) New, AI estimates complexity and recommends a level:

Estimate LOC, files affected, risk factors
Recommend level (1, 2, 3, or 3+) with rationale
User accepts or overrides
Run ./scripts/spec/create.sh --level N

Level Guidelines:

LOC	Level	Template Folder
<100	1	`templates/level_1/`
100-499	2	`templates/level_2/`
≥500	3	`templates/level_3/`
Complex	3+	`templates/level_3+/`

See: quick_reference.md for detailed examples.

CLI Tool:

# Create spec folder with level 2 templates
./scripts/spec/create.sh "Add OAuth2 with MFA" --level 2

# Create spec folder with level 3+ (extended) templates
./scripts/spec/create.sh "Major platform migration" --level 3+

3-Level Progressive Enhancement (CORE + ADDENDUM v2.2)

Higher levels ADD VALUE, not just length. Each level builds on the previous:

Level 1 (Core):         Essential what/why/how (~455 LOC)
         ↓ +Verify
Level 2 (Verification): +Quality gates, NFRs, edge cases (~875 LOC)
         ↓ +Arch
Level 3 (Full):         +Architecture decisions, ADRs, risk matrix (~1090 LOC)
         ↓ +Govern
Level 3+ (Extended):    +Enterprise governance, AI protocols (~1075 LOC)

Level	LOC Guidance	Required Files	What It ADDS
1	<100	spec.md, plan.md, tasks.md, implementation-summary.md	Essential what/why/how
2	100-499	Level 1 + checklist.md	Quality gates, verification, NFRs
3	≥500	Level 2 + decision-record.md	Architecture decisions, ADRs
3+	Complex	Level 3 + extended content	Governance, approval workflow, AI protocols

Level Selection Examples:

Task	LOC Est.	Level	Rationale
Fix CSS alignment	10	1	Simple, low risk
Add form validation	80	1-2	Borderline, low complexity
Modal component	200	2	Multiple files, needs QA
Auth system refactor	600	3	Architecture change, high risk
Database migration	150	3	High risk overrides LOC

Override Factors (can push to higher level):

High complexity or architectural changes
Risk (security, config cascades, authentication)
Multiple systems affected (>5 files)
Integration vs unit test requirements

Decision rule: When in doubt → choose higher level. Better to over-document than under-document.

Checklist as Verification Tool (Level 2+)

The checklist.md is an ACTIVE VERIFICATION TOOL, not passive documentation:

Priority	Meaning	Deferral Rules
P0	HARD BLOCKER	MUST complete, cannot defer
P1	Required	MUST complete OR user-approved deferral
P2	Optional	Can defer without approval

AI Workflow:

Load checklist.md at completion phase
Verify items in order: P0 → P1 → P2
Mark [x] with evidence for each verified item
Cannot claim "done" until all P0/P1 items verified

Evidence formats:

[Test: npm test - all passing]
[File: src/auth.ts:45-67]
[Commit: abc1234]
[Screenshot: evidence/login-works.png]
(verified by manual testing)
(confirmed in browser console)

Example checklist entry:

## P0 - Blockers
- [x] Auth flow working [Test: npm run test:auth - 12/12 passing]
- [x] No console errors [Screenshot: evidence/console-clean.png]

## P1 - Required  
- [x] Unit tests added [File: tests/auth.test.ts - 8 new tests]
- [ ] Documentation updated [DEFERRED: Will complete in follow-up PR]

Folder Naming Convention

Format: specs/###-short-name/

Rules:

2-3 words (shorter is better)
Lowercase, hyphen-separated
Action-noun structure
3-digit padding: 001, 042, 099 (no padding past 999)

Good examples: fix-typo, add-auth, mcp-code-mode, cli-codex Bad examples: new-feature-implementation, UpdateUserAuthSystem, fix_bug

Find next number:

ls -d specs/[0-9]*/ | sed 's/.*\/\([0-9]*\)-.*/\1/' | sort -n | tail -1

Sub-Folder Versioning

When reusing spec folders with existing content:

Trigger: Option A selected + root-level content exists
Pattern: 001-original/, 002-new-work/, 003-another/
Memory: Each sub-folder has independent memory/ directory
Tracking: Spec folder path passed via CLI argument (stateless)

Example structure:

specs/007-auth-system/
├── 001-initial-implementation/
│   ├── spec.md
│   ├── plan.md
│   └── memory/
├── 002-oauth-addition/
│   ├── spec.md
│   ├── plan.md
│   └── memory/
└── 003-security-audit/
    ├── spec.md
    └── memory/

Full documentation: See sub_folder_versioning.md

Context Preservation

Manual context save (MANDATORY workflow):

Trigger: /memory:save, "save context", or "save memory"
MUST use: node .opencode/skill/system-spec-kit/scripts/dist/memory/generate-context.js [spec-folder-path]
NEVER: Create memory files manually via Write/Edit (AGENTS.md Memory Save Rule)
Location: specs/###-folder/memory/
Filename: DD-MM-YY_HH-MM__topic.md (auto-generated by script)
Content includes: PROJECT STATE SNAPSHOT with Phase, Last Action, Next Action, Blockers

Subfolder Support:

The generate-context script supports nested spec folder paths (parent/child format):

# Full nested path (parent/child)
node .opencode/skill/system-spec-kit/scripts/dist/memory/generate-context.js 02--system-spec-kit/121-script-audit

# Bare child name (auto-searches all parents for unique match)
node .opencode/skill/system-spec-kit/scripts/dist/memory/generate-context.js 121-script-audit

# With specs/ prefix
node .opencode/skill/system-spec-kit/scripts/dist/memory/generate-context.js specs/02--system-spec-kit/121-script-audit

# Flat folder (existing behavior, unchanged)
node .opencode/skill/system-spec-kit/scripts/dist/memory/generate-context.js 02--system-spec-kit

Memory files are always saved to the child folder's memory/ directory (e.g., specs/02--system-spec-kit/121-script-audit/memory/). If a bare child name matches multiple parents, the script reports an error and requires the full parent/child path.

Memory File Structure:

## Project Context
[Auto-generated summary of conversation and decisions]

## Project State Snapshot
- Phase: Implementation
- Last Action: Completed auth middleware
- Next Action: Add unit tests for login flow
- Blockers: None

## Key Artifacts
- Modified: src/middleware/auth.ts
- Created: src/utils/jwt.ts

Spec Kit Memory System (Integrated)

Context preservation across sessions via hybrid search (vector similarity + BM25 + FTS with Reciprocal Rank Fusion).

Server: @spec-kit/mcp-server v1.7.2 — context-server.ts (~682 lines) with 12 handler files, 20 lib subdirectories, and 25 MCP tools across 7 layers.

MCP Tools (8 most-used of 25 total — see memory_system.md for full reference):

Tool	Layer	Purpose
`memory_context()`	L1	Unified entry point — modes: auto, quick, deep, focused, resume
`memory_search()`	L2	Hybrid search (vector + FTS + BM25 with RRF fusion). With optional adaptive fusion (SPECKIT_ADAPTIVE_FUSION) and artifact-class routing
`memory_match_triggers()`	L2	Trigger matching + cognitive (decay, tiers, co-activation)
`memory_save()`	L2	Index a memory file with pre-flight validation
`memory_list()`	L3	Browse stored memories with pagination (parent rows by default)
`memory_delete()`	L4	Delete memories by ID or spec folder
`checkpoint_create()`	L5	Create gzip-compressed checkpoint snapshot
`checkpoint_restore()`	L5	Transaction-wrapped restore with rollback

Search architecture: The search pipeline uses a 4-stage architecture (candidate generation → fusion → reranking → filtering). See search/README.md for pipeline details, scoring algorithms, and graph signal features.

memory_context() — Mode Routing:

Mode	Token Budget	When `mode=auto`: Intent Routing
`quick`	800	—
`deep`	2000	`add_feature`, `refactor`, `security_audit`
`focused`	1500	`fix_bug`, `understand`
`resume`	1200	—

memory_search() — Key Rules:

REQUIRED: query (string) OR concepts (2-5 strings). specFolder alone causes E040 error.
Use anchors with includeContent: true for token-efficient section retrieval (~90% savings).
Intent weights auto-adjust scoring: fix_bug boosts recency, security_audit boosts importance, refactor/understand boost similarity.
Full parameter reference: See memory_system.md

memory_save() — Save-Time Processing:

Runs a pre-storage quality gate (threshold 0.4 signal density). Low-quality saves receive warnings or rejection when strict. See SPECKIT_SAVE_QUALITY_GATE flag.
Similar existing memories are auto-merged via reconsolidation (≥0.88 similarity). The save may update an existing memory instead of creating a new one. See SPECKIT_RECONSOLIDATION flag.
A verify-fix-verify loop auto-corrects trigger phrases, anchors, and token budget (up to 3 retries).
Entities are extracted and linked cross-document at save time. See SPECKIT_AUTO_ENTITIES and SPECKIT_ENTITY_LINKING flags.

Epistemic Learning: Use task_preflight() before and task_postflight() after implementation to measure knowledge gains. Learning Index: LI = (KnowledgeDelta × 0.4) + (UncertaintyReduction × 0.35) + (ContextImprovement × 0.25). Review trends via memory_get_learning_history(). See epistemic_vectors.md.

Key Concepts:

Constitutional tier — 3.0x search boost + 2.0x importance multiplier; merged into normal scoring pipeline
Document-type scoring — 10 indexed document types with multipliers: spec (1.4x), plan (1.3x), constitutional (2.0x), decision_record (1.4x), tasks (1.1x), implementation_summary (1.1x), research (1.1x), checklist (1.0x), handover (1.0x), memory (1.0x). README files and skill-doc trees (sk-*, including references/ and assets/) are excluded from memory indexing.
Decay scoring — FSRS v4 power-law model; recent memories rank higher
Import-path hardening - Spec 126 fixed MCP import-path regressions in memory runtime modules (including context server + attention decay wiring)
Metadata preservation pipeline - memory_save update/reinforce paths preserve document_type and spec_level, and vector-index metadata updates stay in sync
Descriptive memory titles - context generation writes MEMORY_TITLE into frontmatter and heading; parser falls back to feature/overview content when the top heading is generic (for example, "SESSION SUMMARY")
Causal edge stability - conflict-update semantics keep causal edge IDs stable during re-link and graph maintenance operations
Real-time sync — Use memory_save or memory_index_scan after creating files
Checkpoints — Gzip-compressed JSON snapshots of memory_index + working_memory; max 10 stored; transaction-wrapped restore
Indexing persistence — After generate-context.js, call memory_index_scan() or memory_save() for immediate MCP visibility
Artifact routing — 9 artifact classes (spec, plan, tasks, checklist, decision-record, implementation-summary, memory, research, unknown) with per-type retrieval strategies applied at query time
Adaptive fusion — Intent-aware weighted RRF with 7 task-type profiles (fix_bug, add_feature, understand, refactor, security_audit, find_spec, find_decision). Enabled by default via feature flag SPECKIT_ADAPTIVE_FUSION (set false to disable)
Retrieval trace — Typed ContextEnvelope wraps every retrieval response with pipeline stages and a DegradedModeContract describing fallback behavior
Mutation ledger — Append-only audit trail for all memory mutations (create, update, delete, reinforce); implemented via SQLite triggers; queryable for compliance and rollback
Retrieval telemetry — 4-dimension metrics (latency, retrieval mode, fallback activation, quality score). Enabled via feature flag SPECKIT_EXTENDED_TELEMETRY (default: on)
Validation scoring — wasUseful=false applies a demotion penalty to memory scores; 5+ positive validations may promote a memory's importance tier

Feature Flags:

Flag	Default	Effect
`SPECKIT_ADAPTIVE_FUSION`	on	Enables intent-aware weighted RRF with 7 task-type profiles in `memory_search()` (set `false` to disable)
`SPECKIT_EXTENDED_TELEMETRY`	on	Emits 4-dimension retrieval metrics (latency, mode, fallback, quality) per search operation
`SPECKIT_INDEX_SPEC_DOCS`	on	Gates spec document indexing in `memory_index_scan()`. When enabled, discovers and indexes spec folder documents (specs, plans, tasks, etc.) with document-type scoring multipliers. Set `SPECKIT_INDEX_SPEC_DOCS=false` to disable.
`SPECKIT_SAVE_QUALITY_GATE`	on	Pre-storage quality gate rejects content below 0.4 signal density (14-day warn-only period after activation)
`SPECKIT_RECONSOLIDATION`	on	Auto-merges similar memories on save when similarity ≥0.88; supersedes at 0.75-0.88
`SPECKIT_NEGATIVE_FEEDBACK`	on	`wasUseful=false` applies score demotion with 30-day recovery window
`SPECKIT_LEARN_FROM_SELECTION`	on	Tracks which search results are used and boosts them in future searches
`SPECKIT_EMBEDDING_EXPANSION`	on	Expands queries with semantic neighbors before vector search
`SPECKIT_AUTO_ENTITIES`	on	Extracts entities at save time for cross-document linking
`SPECKIT_ENTITY_LINKING`	on	Links memories sharing extracted entities during search

Set via environment variable before starting the MCP server (e.g., SPECKIT_ADAPTIVE_FUSION=1).

Token budgets per layer: L1:2000, L2:1500, L3:800, L4:500, L5:600, L6:1200, L7:1000 (enforced via chars/3.5 approximation).

Full documentation: See memory_system.md for tool behavior, importance tiers, and configuration.

Validation Workflow

Automated validation of spec folder contents via validate.sh.

Usage: .opencode/skill/system-spec-kit/scripts/spec/validate.sh <spec-folder>

Exit Codes:

Code	Meaning	Action
0	Passed (no errors, no warnings)	Proceed with completion
1	Passed with warnings	Address or document warnings
2	Failed (errors found)	MUST fix before completion

Completion Verification:

Run validation: ./scripts/spec/validate.sh <spec-folder>
Exit 2 → FIX errors
Exit 1 → ADDRESS warnings or document reason
For code changes, run alignment verifier: python3 .opencode/skill/sk-code--opencode/scripts/verify_alignment_drift.py --root .opencode/skill/system-spec-kit
Exit 0 from both checks → Proceed with completion claim

Full documentation: See validation_rules.md for all rules, configuration, and troubleshooting.

4. RULES

✅ ALWAYS

Determine level (1/2/3/3+) before ANY file changes - Count LOC, assess complexity/risk
Copy templates from templates/level_N/ - Use level folders, NEVER create from scratch
Fill ALL placeholders - Remove placeholder markers and sample content
Ask A/B/C/D/E when file modification detected - Present options, wait for selection
Check for related specs before creating new folders - Search keywords, review status
Get explicit user approval before changes - Show level, path, templates, approach
Use consistent folder naming - specs/###-short-name/ format
Use checklist.md to verify (Level 2+) - Load before claiming done
Mark items [x] with evidence - Include links, test outputs, screenshots
Complete P0/P1 before claiming done - No exceptions
Suggest handover.md on session-end keywords - "continue later", "next session"
Run validate.sh before completion - Completion Verification requirement
Create implementation-summary.md at end of implementation phase (Level 1+) - Document what was built
Suggest /spec_kit:handover when session-end keywords detected OR after extended work (15+ tool calls) - Proactive context preservation
Suggest /spec_kit:debug after 3+ failed fix attempts on same error - Do not continue without offering debug delegation
Suggest /spec_kit:phase when task requires multi-phase decomposition - Complex specs spanning multiple sessions or workstreams
Route all code creation/updates through sk-code--opencode - Full alignment is mandatory before claiming completion
Route all documentation creation/updates through sk-doc - Full alignment is mandatory before claiming completion
Enforce ToC policy from validation rules - Only research.md may include a Table of Contents section; remove ToC headings from standard spec artifacts

❌ NEVER

Create documentation from scratch - Use templates only
Skip spec folder creation - Unless user explicitly selects D
Make changes before spec + approval - Spec folder is prerequisite
Leave placeholders in final docs - All must be replaced
Decide autonomously update vs create - Always ask user
Claim done without checklist verification - Level 2+ requirement
Proceed without spec folder confirmation - Wait for A/B/C/D/E
Skip validation before completion - Completion Verification hard block
Add ToC sections to standard spec artifacts - spec.md, plan.md, tasks.md, checklist.md, decision-record.md, implementation-summary.md, handover.md, and debug-delegation.md must not contain ToC headings

⚠️ ESCALATE IF

Scope grows during implementation - Run upgrade-level.sh to add higher-level templates (recommended), then auto-populate all placeholder content:
- Read all existing spec files (spec.md, plan.md, tasks.md, implementation-summary.md) for context
- Replace every placeholder marker pattern in newly injected sections with content derived from that context
- For sections without sufficient source context, write "N/A — insufficient source context" instead of fabricating content
- Run check-placeholders.sh <spec-folder> to verify zero placeholders remain (see level_specifications.md for the full procedure)
- Document the level change in changelog
Uncertainty about level <80% - Present level options to user, default to higher
Template doesn't fit requirements - Adapt closest template, document modifications
User requests skip (Option D) - Warn about tech debt, explain debugging challenges, confirm consent
Validation fails with errors - Report specific failures, provide fix guidance, re-run after fixes

5. SUCCESS CRITERIA

Documentation Created

Spec folder exists at specs/###-short-name/
Folder name follows convention (2-3 words, lowercase, hyphen-separated)
Number is sequential (no gaps or duplicates)
Correct level templates copied (not created from scratch)
All placeholders replaced with actual content
Sample content and instructional comments removed
Cross-references to sibling documents work (spec.md ↔ plan.md ↔ tasks.md)
No ToC heading in non-research spec artifacts (ToC allowed only in research.md)

User Approval

Asked user for A/B/C/D choice when file modification detected
Documentation level presented with rationale
Spec folder path shown before creation
Templates to be used listed
Explicit approval ("yes", "go ahead", "proceed") received before file changes

Context Preservation

Context saved via generate-context.js script (NEVER manual Write/Edit)
Memory files contain PROJECT STATE SNAPSHOT section
Manual saves triggered via /memory:save or keywords
Anchor pairs properly formatted and closed

Checklist Verification (Level 2+)

Loaded checklist.md before claiming completion
Verified items in priority order (P0 → P1 → P2)
All P0 items marked [x] with evidence
All P1 items marked [x] with evidence
P2 items either verified or deferred with documented reason
No unchecked P0/P1 items remain

Validation Passed

Ran validate.sh on spec folder
Exit code is 0 (pass) or 1 (warnings only)
All ERROR-level issues resolved
WARNING-level issues addressed or documented

6. INTEGRATION POINTS

Priority System

Priority	Level	Deferral
P0	Blocker	Cannot proceed without resolution
P1	Warning	Must address or defer with user approval
P2	Optional	Can defer without approval

Validation Triggers

AGENTS.md Gate 3 → Validates spec folder existence and template completeness
AGENTS.md Completion Verification → Runs validate.sh before completion claims
Manual /memory:save → Context preservation on demand
Template validation → Checks placeholder removal and required field completion

Cross-Skill Workflows

Workflow	Flow
Spec → Implementation	system-spec-kit → sk-code--opencode (mandatory for code changes) → sk-git → Spec Kit Memory
Documentation Quality	system-spec-kit → sk-doc (mandatory for documentation changes; validate, score) → Iterate if <90
Validation	Implementation complete → validate.sh → Fix errors → Address warnings → Claim completion

Quick Reference Commands

Command	Usage
Create spec folder	`./scripts/spec/create.sh "Description" --short-name name --level 2`
Validate	`.opencode/skill/system-spec-kit/scripts/spec/validate.sh specs/007-feature/`
Verify code alignment drift	`python3 .opencode/skill/sk-code--opencode/scripts/verify_alignment_drift.py --root .opencode/skill/system-spec-kit`
Save context	`node .opencode/skill/system-spec-kit/scripts/dist/memory/generate-context.js specs/007-feature/`
Next spec number	`ls -d specs/[0-9]/ \| sed 's/.\/\([0-9]\)-./\1/' \| sort -n \| tail -1`
Upgrade level	`bash .opencode/skill/system-spec-kit/scripts/spec/upgrade-level.sh specs/007-feature/ --to 2`
Completeness	`.opencode/skill/system-spec-kit/scripts/spec/calculate-completeness.sh specs/007-feature/`

7. RELATED RESOURCES

Related Skills

Direction	Skill	Integration
Upstream	None	This is the foundational workflow
Downstream	sk-code--opencode	Mandatory alignment for all code changes
Downstream	sk-git	References spec folders in commit messages and PRs
Downstream	sk-doc	Mandatory alignment for all documentation changes
Integrated	Spec Kit Memory	Context preservation via MCP (merged into this skill)

External Dependencies

Resource	Location	Purpose
Templates	`templates/level_1/` through `level_3+/` (see Resource Inventory above)	Pre-merged level templates
Validation	`scripts/spec/validate.sh`	Automated validation
Gates	`AGENTS.md` Section 2	Gate definitions
Memory gen	`scripts/memory/generate-context.ts` → `scripts/dist/`	Memory file creation
MCP Server	`mcp_server/context-server.ts`	Spec Kit Memory MCP (~682 lines)
Database	`mcp_server/dist/database/context-index.sqlite`	Vector search index (canonical runtime path)
Constitutional	`constitutional/`	Always-surface rules

Remember: This skill is the foundational documentation orchestrator. It enforces structure, template usage, context preservation, and validation for all file modifications. Every conversation that modifies files MUST have a spec folder.

Name	system-spec-kit
Description	Unified documentation and context preservation: spec folder workflow (levels 1-3+), CORE + ADDENDUM template architecture (v2.2), validation, and Spec Kit Memory for context preservation. Mandatory for all file modifications.

system-spec-kit

SKILL.md

Spec Kit - Mandatory Conversation Documentation

1. WHEN TO USE

What is a Spec Folder?

Activation Triggers

When NOT to Use

Agent Exclusivity

Utility Template Triggers

2. SMART ROUTING

Resource Domains

Template and Script Sources of Truth

Resource Loading Levels

Smart Router Pseudocode

3. HOW IT WORKS

Gate 3 Integration

Complexity Detection (Option B Flow)

3-Level Progressive Enhancement (CORE + ADDENDUM v2.2)

Checklist as Verification Tool (Level 2+)

Folder Naming Convention

Sub-Folder Versioning

Context Preservation

Spec Kit Memory System (Integrated)

Validation Workflow

4. RULES

✅ ALWAYS

❌ NEVER

⚠️ ESCALATE IF

5. SUCCESS CRITERIA

Documentation Created

User Approval

Context Preservation

Checklist Verification (Level 2+)

Validation Passed

6. INTEGRATION POINTS

Priority System

Validation Triggers

Cross-Skill Workflows

Quick Reference Commands

7. RELATED RESOURCES

Related Skills

External Dependencies