siliconflow-tts-gen
Text-to-Speech using SiliconFlow API (CosyVoice2). Supports multiple voices, languages, and dialects.
安装 / 下载方式
TotalClaw CLI推荐
totalclaw install clawskills:clawskills~lilei0311-siliconflow-tts-gencURL直接下载,无需登录
curl -fsSL https://skills.taituai.com/api/skills/clawskills%3Aclawskills~lilei0311-siliconflow-tts-gen/file -o lilei0311-siliconflow-tts-gen.md# SiliconFlow TTS Generation Skill Text-to-Speech using SiliconFlow API with CosyVoice2 model. Supports 8 preset voices, multiple languages, and Chinese dialects. ## Features - 🎙️ **8 Preset Voices**: 4 male + 4 female voices - 🌍 **Multilingual**: Chinese, English, Japanese, Korean - 🗣️ **Chinese Dialects**: Cantonese, Sichuan, Shanghai, Tianjin, Wuhan - ⚡ **Ultra Low Latency**: 150ms first packet delay - 🎵 **Voice Cloning**: 3-second rapid voice cloning - 💾 **Auto Download**: Saves audio files locally ## Requirements - **Environment Variable**: `SILICONFLOW_API_KEY` - **Optional Config File**: `~/.openclaw/openclaw.json` (for auto-detect) ## Installation ```bash npx clawhub install siliconflow-tts-gen ``` ## Configuration Set your SiliconFlow API key: ```bash export SILICONFLOW_API_KEY="your-api-key" ``` ## Usage ### List Available Voices ```bash python3 scripts/generate.py --list-voices ``` ### Generate Speech ```bash # Basic usage (default voice: alex) python3 scripts/generate.py "你好,世界" # Specify voice python3 scripts/generate.py "Hello World" --voice bella # Adjust speed python3 scripts/generate.py "你好" --voice claire --speed 0.9 # Save to file python3 scripts/generate.py "欢迎收听" --output welcome.mp3 # Change format python3 scripts/generate.py "Hello" --format wav ``` ## Available Voices ### Male Voices | ID | Name | Characteristic | |----|------|----------------| | alex | 沉稳男声 | Mature and steady | | benjamin | 低沉男声 | Deep and low | | charles | 磁性男声 | Magnetic | | david | 欢快男声 | Cheerful | ### Female Voices | ID | Name | Characteristic | |----|------|----------------| | anna | 沉稳女声 | Mature and elegant | | bella | 激情女声 | Passionate | | claire | 温柔女声 | Gentle and kind | | diana | 欢快女声 | Sweet and happy | ## Parameters | Parameter | Type | Default | Range | Description | |-----------|------|---------|-------|-------------| | `--voice` | string | alex | - | Voice ID | | `--speed` | float | 1.0 | 0.25-4.0 | Speech speed | | `--format` | string | mp3 | mp3/opus/wav/pcm | Output format | | `--output` | string | output.mp3 | - | Output file path | ## Security Notes - This skill requires an API key to call SiliconFlow services - The script reads `~/.openclaw/openclaw.json` only to auto-detect API keys - No sensitive data is transmitted except to `api.siliconflow.cn` - Review the code at `scripts/generate.py` before providing credentials ## Author MaxStorm Team ## License MIT