Agent Skill
2/7/2026clawdsense
Real-time image analysis from ClawdSense ESP32 dongle. Monitors media receiver, captures photos from device, analyzes instantly with Groq Vision. Use when ClawdSense sends photos via /photo command or button controls. Provides instant room analysis, occupancy detection, and environmental awareness.
Y
ypyt1
0GitHub Stars
1Views
npx skills add YPYT1/All-skills
SKILL.md
| Name | clawdsense |
| Description | Real-time image analysis from ClawdSense ESP32 dongle. Monitors media receiver, captures photos from device, analyzes instantly with Groq Vision. Use when ClawdSense sends photos via /photo command or button controls. Provides instant room analysis, occupancy detection, and environmental awareness. |
name: clawdsense description: Real-time image analysis from ClawdSense ESP32 dongle. Monitors media receiver, captures photos from device, analyzes instantly with Groq Vision. Use when ClawdSense sends photos via /photo command or button controls. Provides instant room analysis, occupancy detection, and environmental awareness.
ClawdSense Skill
Real-time image capture and analysis from ClawdSense ESP32 dongle.
Quick Start
Start Services
# Terminal 1: Media receiver (accepts photo uploads from ESP32)
node ~/clawd/clawdsense-skill/scripts/media-receiver.js
# Terminal 2: Analyzer (monitors inbound folder, analyzes with Groq)
node ~/clawd/clawdsense-skill/scripts/analyzer.js
# Terminal 3: Health monitor (keeps both services alive)
node ~/clawd/clawdsense-skill/scripts/health-monitor.js
Usage
- Send
/photocommand to ClawdSense via Telegram - Device captures and POSTs to media receiver (port 5555)
- Analyzer detects new photo and analyzes with Groq Vision
- Results printed to console
Architecture
Three Components
Media Receiver (port 5555)
- Accepts multipart/form-data uploads from ESP32
- Stores photos in
~/.clawdbot/media/inbound/ - Endpoints:
- POST
/inbound/photo- JPEG photos - POST
/inbound/audio- WAV audio - POST
/inbound/video- AVI video
- POST
Analyzer (real-time polling)
- Polls inbound folder every 500ms
- Detects new photos automatically
- Sends to Groq Vision API for analysis
- Uses pixtral-12b model for instant results
Health Monitor
- Checks both services every 30s
- Restarts if either dies
- Logs status to console
Performance
- Detection latency: ~500ms (polling interval)
- Analysis time: 1-3s (Groq API)
- Total end-to-end: ~2-5s from capture to results
Configuration
ESP32 Firmware Settings
Device must be configured with:
MEDIA_RECEIVER_URL = "http://localhost:5555"
or for public: "https://your-ngrok-url"
Groq API Key
Stored in environment:
export GROQ_API_KEY="<GROQ_API_KEY_REDACTED>"
Troubleshooting
"Media receiver is DOWN"
- Check if port 5555 is in use
- Restart:
node ~/clawd/clawdsense-skill/scripts/media-receiver.js
"No new photos detected"
- Is device sending to media receiver? Check device logs
- Is media receiver running? Curl http://localhost:5555/health
- Check inbound folder permissions
"Groq API errors"
- Verify API key is set
- Check account quota/billing
References
- See
references/groq-vision-api.mdfor Groq setup - See
references/esp32-setup.mdfor device configuration
Skills Info
Original Name:clawdsenseAuthor:ypyt1
Download