技能库 37 个
optimizing-attention-flash
Optimizes transformer attention with Flash Attention for 2-4x speedup and 10-20x memory reduction. Use when training/run
faiss
Facebook's library for efficient similarity search and clustering of dense vectors. Supports billions of vectors, GPU ac
clip
OpenAI's model connecting vision and language. Enables zero-shot image classification, image-text matching, and cross-mo
chroma
Open-source embedding database for AI applications. Store embeddings and metadata, perform vector and full-text search,
huggingface-accelerate
Simplest distributed training API. 4 lines to add distributed support to any PyTorch script. Unified API for DeepSpeed/F
segment-anything-model
SAM: zero-shot image segmentation via points, boxes, masks.
audiocraft-audio-generation
AudioCraft: MusicGen text-to-music, AudioGen text-to-sound.
serving-llms-vllm
vLLM: high-throughput LLM serving, OpenAI API, quantization.
obliteratus
OBLITERATUS: abliterate LLM refusals (diff-in-means).
llama-cpp
llama.cpp local GGUF inference + HF Hub model discovery.
huggingface-hub
HuggingFace hf CLI: search/download/upload models, datasets.
weights-and-biases
W&B: log ML experiments, sweeps, model registry, dashboards.
evaluating-llms-harness
lm-eval-harness: benchmark LLMs (MMLU, GSM8K, etc.).