azure-ai-transcription-py

TotalClaw 作者 totalclaw

适用于 Python 的 Azure AI 转录 SDK。用于带有时间戳和分类的实时和批量语音到文本转录。 触发器:“转录”、“语音转文本”、“Azure AI 转录”、“TranscriptionClient”。

安装 / 下载方式

TotalClaw CLI推荐
totalclaw install totalclaw:totalclaw~thegovind-azure-ai-transcription-py
cURL直接下载,无需登录
curl -fsSL https://skills.taituai.com/api/skills/totalclaw%3Atotalclaw~thegovind-azure-ai-transcription-py/file -o thegovind-azure-ai-transcription-py.md
## 概述(中文)

适用于 Python 的 Azure AI 转录 SDK。用于带有时间戳和分类的实时和批量语音到文本转录。
触发器:“转录”、“语音转文本”、“Azure AI 转录”、“TranscriptionClient”。

## 原文

# Azure AI Transcription SDK for Python

Client library for Azure AI Transcription (speech-to-text) with real-time and batch transcription.

## Installation

```bash
pip install azure-ai-transcription
```

## Environment Variables

```bash
TRANSCRIPTION_ENDPOINT=https://<resource>.cognitiveservices.azure.com
TRANSCRIPTION_KEY=<your-key>
```

## Authentication

Use subscription key authentication (DefaultAzureCredential is not supported for this client):

```python
import os
from azure.ai.transcription import TranscriptionClient

client = TranscriptionClient(
    endpoint=os.environ["TRANSCRIPTION_ENDPOINT"],
    credential=os.environ["TRANSCRIPTION_KEY"]
)
```

## Transcription (Batch)

```python
job = client.begin_transcription(
    name="meeting-transcription",
    locale="en-US",
    content_urls=["https://<storage>/audio.wav"],
    diarization_enabled=True
)
result = job.result()
print(result.status)
```

## Transcription (Real-time)

```python
stream = client.begin_stream_transcription(locale="en-US")
stream.send_audio_file("audio.wav")
for event in stream:
    print(event.text)
```

## Best Practices

1. **Enable diarization** when multiple speakers are present
2. **Use batch transcription** for long files stored in blob storage
3. **Capture timestamps** for subtitle generation
4. **Specify language** to improve recognition accuracy
5. **Handle streaming backpressure** for real-time transcription
6. **Close transcription sessions** when complete