Agent Skill
2/7/2026

skill-validator

Validate skills against production-level criteria. Use when reviewing, auditing, or improving skills to ensure they meet quality standards. Evaluates structure, content quality, user interaction patterns, documentation completeness, domain standards compliance, and technical robustness. Returns actionable validation report with scores and improvement recommendations.

A
alijilani
0GitHub Stars
1Views
npx skills add alijilani-dev/Claude

SKILL.md

Nameskill-validator
DescriptionValidate skills against production-level criteria. Use when reviewing, auditing, or improving skills to ensure they meet quality standards. Evaluates structure, content quality, user interaction patterns, documentation completeness, domain standards compliance, and technical robustness. Returns actionable validation report with scores and improvement recommendations.

name: skill-validator description: Validate skills against production-level criteria. Use when reviewing, auditing, or improving skills to ensure they meet quality standards. Evaluates structure, content quality, user interaction patterns, documentation completeness, domain standards compliance, and technical robustness. Returns actionable validation report with scores and improvement recommendations.

Skill Validator

Validate any skill against production-level quality criteria.

Validation Workflow

Phase 1: Gather Context

  1. Read the skill's SKILL.md completely
  2. Identify skill type from frontmatter description:
    • Builder skill (creates artifacts)
    • Guide skill (provides instructions)
    • Automation skill (executes workflows)
    • Hybrid skill (combination)
  3. Read all reference files in references/ directory
  4. Check for assets/scripts directories

Phase 2: Apply Criteria

Evaluate against 7 criteria categories. Each criterion scores 0-3:

  • 0: Missing/Absent
  • 1: Present but inadequate
  • 2: Adequate implementation
  • 3: Excellent implementation

Criteria Categories

1. Structure & Anatomy (Weight: 15%)

CriterionWhat to Check
SKILL.md existsRoot file present
Line count<500 lines (context is precious)
Frontmattername and description present in YAML
Description qualityTriggers: when to use, what it does, key features
No extraneous filesNo README.md, CHANGELOG.md, LICENSE in skill dir
Progressive disclosureDetails in references/, not bloated SKILL.md
Asset organizationTemplates in assets/, scripts in scripts/

Fail condition: Missing SKILL.md or >800 lines = automatic fail

2. Content Quality (Weight: 20%)

CriterionWhat to Check
ConcisenessNo verbose explanations, context is public good
Imperative formInstructions use "Do X" not "You should do X"
Appropriate freedomConstraints where needed, flexibility where safe
Scope clarityClear what skill does AND does not do
No hallucination riskNo instructions that encourage making up info
Output specificationClear expected outputs defined

3. User Interaction (Weight: 15%)

CriterionWhat to Check
Clarification triggersAsks questions before acting on ambiguity
Required vs optionalDistinguishes must-know from nice-to-know
Graceful handlingWhat to do when user doesn't answer
No over-askingDoesn't ask obvious or inferrable questions
Context awarenessUses available context before asking

Key pattern to look for:

## Required Clarifications
1. Question about X
2. Question about Y

## Optional Clarifications
3. Question about Z (if relevant)

4. Documentation & References (Weight: 15%)

CriterionWhat to Check
Source URLsOfficial documentation links provided
Reference filesComplex details in references/ not main file
Fetch guidanceInstructions to fetch docs for unlisted patterns
Version awarenessNotes about checking for latest patterns
Example coverageGood/bad examples for key patterns

Key pattern to look for:

| Resource | URL | Use For |
|----------|-----|---------|
| Official Docs | https://... | Complex cases |

5. Domain Standards (Weight: 15%)

CriterionWhat to Check
Best practicesFollows domain conventions (e.g., WCAG, OWASP)
Enforcement mechanismChecklists, validation steps, must-verify items
Anti-patternsLists what NOT to do
Quality gatesOutput checklist before delivery

Key pattern to look for:

### Must Follow
- [ ] Requirement 1
- [ ] Requirement 2

### Must Avoid
- Antipattern 1
- Antipattern 2

6. Technical Robustness (Weight: 10%)

CriterionWhat to Check
Error handlingGuidance for failure scenarios
Security considerationsInput validation, secrets handling if relevant
DependenciesExternal tools/APIs documented
Edge casesCommon edge cases addressed
TestabilityCan outputs be verified?

7. Maintainability (Weight: 10%)

CriterionWhat to Check
ModularityReferences are self-contained topics
Update pathEasy to update when standards change
No hardcoded valuesUses placeholders/variables where appropriate
Clear organizationLogical section ordering

Scoring Guide

Category Scores

Calculate each category score:

Category Score = (Sum of criterion scores) / (Max possible) * 100

Overall Score

Overall = Σ(Category Score × Weight)

Rating Thresholds

ScoreRatingMeaning
90-100ProductionReady for wide use
75-89GoodMinor improvements needed
60-74AdequateFunctional but needs work
40-59DevelopingSignificant gaps
0-39IncompleteMajor rework required

Output Format

Generate validation report:

# Skill Validation Report: [skill-name]

**Rating**: [Production/Good/Adequate/Developing/Incomplete]
**Overall Score**: [X]/100

## Summary
[2-3 sentence assessment]

## Category Scores

| Category | Score | Weight | Weighted |
|----------|-------|--------|----------|
| Structure & Anatomy | X/100 | 15% | X |
| Content Quality | X/100 | 20% | X |
| User Interaction | X/100 | 15% | X |
| Documentation | X/100 | 15% | X |
| Domain Standards | X/100 | 15% | X |
| Technical Robustness | X/100 | 10% | X |
| Maintainability | X/100 | 10% | X |

## Critical Issues (if any)
- [Issue requiring immediate fix]

## Improvement Recommendations
1. **High Priority**: [Specific action]
2. **Medium Priority**: [Specific action]
3. **Low Priority**: [Specific action]

## Strengths
- [What skill does well]

Quick Validation Checklist

For rapid assessment, check these critical items:

  • SKILL.md <500 lines
  • Frontmatter has name + description
  • No README.md/CHANGELOG.md in skill directory
  • Has clarification questions for builder skills
  • Has official documentation links
  • Has enforcement checklist (if domain standards exist)
  • Has output specification
  • References exist for complex details

If 6+ checked: Likely Good or better If 4-5 checked: Likely Adequate If <4 checked: Needs significant work


Reference Files

FileWhen to Read
references/detailed-criteria.mdDeep evaluation of specific criterion
references/scoring-examples.mdExample validations for calibration
references/improvement-patterns.mdCommon fixes for common issues

Usage Examples

Validate a skill

Validate the chatgpt-widget-creator skill against production criteria

Quick audit

Quick validation check on mcp-builder skill

Focused review

Check if skill-creator skill has proper user interaction patterns
Skills Info
Original Name:skill-validatorAuthor:alijilani