Agent Skill
2/7/2026

slack-memory-cleanup

Memory cleanup and organization skill for AI employees. Provides guidelines for detecting duplicates, fixing misclassified files, and removing stale information from memory storage.

K
krafton
587GitHub Stars
1Views
npx skills add krafton-ai/KIRA

SKILL.md

Nameslack-memory-cleanup
DescriptionMemory cleanup and organization skill for AI employees. Provides guidelines for detecting duplicates, fixing misclassified files, and removing stale information from memory storage.
<table align="center"><tr><td>
                          ●      ●

                        ▄████◣◢████▄
                     ◥▄████▀▀  ▀▀████▄◤
                        ▀▀        ▀▀

 _____ ___ ___ __  __ ___ _  _ _   _ ___     _  _____ ___    _
|_   _| __| _ \  \/  |_ _| \| | | | / __|___| |/ /_ _| _ \  /_\
  | | | _||   / |\/| || || .` | |_| \__ \___| ' < | ||   / / _ \
  |_| |___|_|_\_|  |_|___|_|\_|\___/|___/   |_|\_\___|_|_\/_/ \_\
</td></tr></table> <p align="center"> <p align="center"> A smarter agent harness for <a href="https://github.com/laude-institute/terminal-bench">Terminal-Bench</a>, built on <a href="https://github.com/laude-institute/terminal-bench">Terminus 2</a> with native tool calling <br/> <em>Simple fixes, significant gains.</em> </p> <p align="center"> <img src="https://img.shields.io/badge/Codex_5.3-75.5%25-blue?style=for-the-badge" alt="Codex 5.3: 75.5%"> <img src="https://img.shields.io/badge/Opus_4.6-75.7%25-blueviolet?style=for-the-badge" alt="Opus 4.6: 75.7%"> <img src="https://img.shields.io/badge/Gemini_3.1_Pro-74.8%25-orange?style=for-the-badge" alt="Gemini 3.1 Pro: 74.8%"> </p> </p>

Changelog

VersionDescription
v1.1Migrated from In-Context Learning (ICL) to native tool calling via LLM tools parameter. Removed verbose JSON/XML response format instructions from system prompt — the model now receives structured tool definitions directly, resulting in a significantly shorter prompt and more reliable outputs.
v1.0Initial release. Fork of Terminus 2 with ICL-based JSON response parsing and full response format instructions in the system prompt.

Key Features

  • Native Tool Calling — Replaces ICL JSON/XML parsing with the LLM tools parameter for structured, reliable outputs
  • Image Analysis (Multimodal)image_read tool for base64-encoded image analysis directly from the terminal
  • Marker-based Polling — Early command completion detection using echo markers, cutting unnecessary wait time
  • Block Timeout Protection — 10-minute timeout on infrastructure API calls to prevent indefinite hangs
  • Smart Completion Verification — Double-confirmation checklist covering requirements, robustness, and multi-perspective QA (test engineer, QA engineer, user)
  • Prompt Caching — Anthropic ephemeral caching on recent messages to reduce latency and cost

Architecture

Terminus-KIRA extends Terminus 2 by replacing its ICL (In-Context Learning) response parsing with native LLM tool calling.

Tool definitions passed via the tools parameter:

ToolPurpose
execute_commandsRun shell commands with analysis and plan
task_completeSignal task completion (triggers double-confirmation)
image_readAnalyze image files via base64 multimodal input

How it works:

  1. Calls litellm.acompletion directly with tools=TOOLS, bypassing the base Chat class to access native tool calling
  2. The model returns structured tool calls instead of free-form text — no regex/JSON parsing needed
  3. On context window overflow, automatically summarizes conversation history and retries
  4. Marker-based polling appends echo '__CMDEND__<seq>__' after each command; if the marker appears before the requested duration, execution moves on immediately

Evolution

Key milestones from development history:

#MilestoneDescription
1GenesisCopy of Terminus 2 as starting point
2Native Tool UseReplaced ICL JSON/XML parsing with LLM tools parameter
3Output Limiting30 KB cap on terminal output to prevent context bloat
4Autonomy & ConstraintsPrompt engineering for agent autonomy and environment constraints
5Completion ConfirmationInclude original instruction in completion check
6Multimodalimage_read tool for visual analysis of terminal screenshots
7Completion ChecklistMulti-perspective QA checklist (test engineer, QA, user)
8Execution OptimizationMarker-based polling and block timeout protection
9Temperature FixSet temperature to 1 when using reasoning effort

Usage

uv run harbor run \
    --dataset terminal-bench-sample@2.0 \
    --n-tasks 1 \
    --agent-import-path "terminus_kira.terminus_kira:TerminusKira" \
    --model anthropic/claude-opus-4-6 \
    --env docker \
    -n 1

For more details, visit our blog post.


Project Structure

├── terminus_kira/
│   ├── __init__.py
│   └── terminus_kira.py        # Main agent (native tool calling)
├── prompt-templates/
│   └── terminus-kira.txt        # System prompt
├── run-scripts/
│   ├── run_docker.sh            # Local Docker execution
│   ├── run_daytona.sh           # Daytona cloud execution
│   └── run_runloop.sh           # Runloop cloud execution
├── anthropic_caching.py         # Prompt caching utility
└── pyproject.toml

Citing Us

If you found Terminus-KIRA useful, please cite us as:

@misc{terminuskira2026,
      title={Terminus-KIRA: Boosting Frontier Model Performance on Terminal-Bench with Minimal Harness },
      author={{KRAFTON AI} and {Ludo Robotics}},
      year={2026},
      url={https://github.com/krafton-ai/kira},
}

KRAFTON AI & Ludo Robotics

Skills Info
Original Name:slack-memory-cleanupAuthor:krafton