Agent Skill
2/7/2026

qwen3-audio

High-performance audio library for Apple Silicon with text-to-speech (TTS) and speech-to-text (STT).

S
skills
0GitHub Stars
1Views
npx skills add skills-router/skills-store

SKILL.md

Nameqwen3-audio
DescriptionHigh-performance audio library for Apple Silicon with text-to-speech (TTS) and speech-to-text (STT).

Skills Store

A collection of reusable skills for AI assistants.

Available Skills

free-resource

Search and download royalty-free images, videos, sound effects, and music from Pixabay, Freesound, and Jamendo.

Features:

  • Pixabay: Images and videos search & download
  • Freesound: Audio effects search & download
  • Jamendo: Music/BGM search & download

Quick Start:

cd skills/free-resource
cp config.example.json config.json
# Edit config.json with your API keys
bun ./scripts/pixabay.ts search-images --query "nature"

qwen3-audio

High-performance audio processing library optimized

Features:

  • Text-to-Speech (TTS) with voice cloning
  • Speech-to-Text (STT) / Automatic Speech Recognition

Prerequisites:

  • Python 3.10+

Quick Start:

uv run --project "/<qwen-audio-skill-path>" python "./scripts/qwen-audio.py" tts --text "hello world" --output "output.wav"

Structure

skills-store/
├── skills/
│   ├── free-resource/     # Royalty-free media search
│   │   ├── SKILL.md       # Skill documentation
│   │   ├── scripts/       # CLI tools
│   │   └── config.json    # API keys configuration
│   └── qwen-audio/       # Audio processing (TTS/STT)
│       ├── SKILL.md       # Skill documentation
│       ├── scripts/       # Python scripts
│       └── .venv/         # Python virtual environment
└── LICENSE

License

MIT

Skills Info
Original Name:qwen3-audioAuthor:skills