mlx-stt

ClawSkills 作者 guoqiao v1.0.7

Speech-To-Text with MLX (Apple Silicon) and opensource models (default GLM-ASR-Nano-2512) locally.

安装 / 下载方式

TotalClaw CLI推荐
totalclaw install clawskills:clawskills~guoqiao-mlx-stt
cURL直接下载,无需登录
curl -fsSL https://skills.taituai.com/api/skills/clawskills%3Aclawskills~guoqiao-mlx-stt/file -o guoqiao-mlx-stt.md
# MLX STT

Speech-To-Text/ASR/Transcribe with MLX (Apple Silicon) and opensource models (default GLM-ASR-Nano-2512) locally.

Free and Accurate. No api key required. No server required.

## Requirements

- `mlx`: macOS with Apple Silicon
- `brew`: used to install deps if not available

## Installation

```bash
bash ${baseDir}/install.sh
```
This script will use `brew` to install these cli tools if not available:
- `ffmpeg`: convert audio format when needed
- `uv`: install python package and run python script
- `mlx_audio`: do the real job

## Usage

To transcribe an audio file, run this script:

```bash
bash  ${baseDir}/mlx-stt.sh <audio_file_path>
```

- First run could be a little slow, since it will need to download model.
- The transcript result will be printed to stdout.