Agent Skill
2/7/2026

switchailocal

Unified LLM proxy for AI agents. Route all model requests through http://localhost:18080/v1. Provides FREE access to Gemini CLI, Claude CLI, Codex, and Vibe via your existing subscriptions. Use when: (1) making LLM calls using provider prefixes, (2) switching between CLI/Local/Cloud providers, (3) needing to attach local files/folders to prompts via CLI, (4) requiring intelligent routing between models, or (5) needing to monitor provider health and analytics.

T
traylinx
0GitHub Stars
1Views
npx skills add traylinx/switchAILocal

SKILL.md

Nameswitchailocal
DescriptionUnified LLM proxy for AI agents. Route all model requests through http://localhost:18080/v1. Provides FREE access to Gemini CLI, Claude CLI, Codex, and Vibe via your existing subscriptions. Use when: (1) making LLM calls using provider prefixes, (2) switching between CLI/Local/Cloud providers, (3) needing to attach local files/folders to prompts via CLI, (4) requiring intelligent routing between models, or (5) needing to monitor provider health and analytics.

name: switchailocal description: Unified LLM proxy for AI agents. Route all model requests through http://localhost:18080/v1. Provides FREE access to Gemini CLI, Claude CLI, Codex, and Vibe via your existing subscriptions. Use when: (1) making LLM calls using provider prefixes, (2) switching between CLI/Local/Cloud providers, (3) needing to attach local files/folders to prompts via CLI, (4) requiring intelligent routing between models, or (5) needing to monitor provider health and analytics.

switchAILocal Proxy

Unified LLM proxy for AI agents. Always use http://localhost:18080/v1 as your base URL.

The killer feature: Use your paid CLI subscriptions (Gemini Pro, Claude Pro, etc.) via the API - it's FREE because you already pay for the subscription!


Quick Start

1. Make a request (FREE with CLI)

curl http://localhost:18080/v1/chat/completions \
  -H "Content-Type: application/json" \
  -d '{
    "model": "geminicli:",
    "messages": [{"role": "user", "content": "Hello!"}]
  }'

2. Configure Python Client

from openai import OpenAI
client = OpenAI(base_url="http://localhost:18080/v1", api_key="sk-test-123")
response = client.chat.completions.create(model="geminicli:", messages=[{"role": "user", "content": "Hi!"}])

πŸ—ΊοΈ Skill Files

FileDescription
SKILL.md (this file)Core workflow and endpoint reference
references/routing.mdIntelligent routing and matrix setup
references/multimodal.mdVision and image processing
references/examples.mdReal-world agentic use cases
references/management-api.mdFull Monitoring & Operations API
references/steering.mdConditional routing rules
references/hooks.mdAutomation and event hooks
references/memory.mdAnalytics and history

⚠️ Critical: Model Format

NEVER use bare model names. Format is ALWAYS provider: or provider:model.

❌ Wrongβœ… CorrectWhy
gemini-2.5-progeminicli:gemini-2.5-proNeeds provider prefix
claude-3-5-sonnetclaudecli:claudecli: uses default
llama3ollama:llama3Needs provider prefix

πŸ—οΈ Provider Reference

1. CLI Providers (FREE!)

Uses your human's CLI subscriptions. Best for agents.

PrefixCLISubscription Required
geminicli:geminiGoogle AI Premium/Pro
claudecli:claudeClaude Pro/Max
codex:codexOpenAI Plus
vibe:vibeMistral Le Chat

2. Local & Cloud

PrefixSourceCost
ollama:Local OllamaFREE
autoLocal CortexFREE (Requires plugin)
switchai:Traylinx CloudPer-token
groq:Groq CloudPer-token

πŸš€ Core Features

CLI Attachments & Flags

Pass local context and control autonomy via CLI extensions.

{
  "model": "geminicli:",
  "messages": [{"role": "user", "content": "Fix this code"}],
  "extra_body": {
    "cli": {
      "attachments": [{"type": "folder", "path": "./src"}],
      "flags": {"auto_approve": true, "yolo": true}
    }
  }
}

Streaming

Add "stream": true to any request for SSE token streaming.


🌲 Decision Tree

What do you need?
β”œβ”€ FREE + Powerful + Files
β”‚   └─ CLI Providers (geminicli:, claudecli:)
β”œβ”€ FREE + Private + Fast
β”‚   └─ Local Ollama (ollama:llama3.2)
β”œβ”€ Ultra-Fast Production
β”‚   └─ Groq Cloud (groq:llama-3.3-70b)
└─ I don't know, you pick
    └─ Intelligent Routing (auto)

πŸ› οΈ Troubleshooting & Best Practices

ProblemFix
Connection errorCheck if server is running on port 18080
Model not foundEnsure you used the provider: prefix
401 UnauthorizedCheck API key in config.yaml

Best Practices

  1. Prefer CLI Providers: They are free and support file attachments.
  2. Check Status: Use GET /v1/providers to see what is active.
  3. Use auto: For simple tasks, let the router pick the best model.
  4. Local for Privacy: Use ollama: for confidential data.

Route wisely. Save tokens. Use CLI. πŸš€

Skills Info
Original Name:switchailocalAuthor:traylinx