技能库 177 个
memento-flashcards
Spaced-repetition flashcard system. Create cards from facts or text, chat with flashcards using free-text answers graded
here.now
Publish static sites to {slug}.here.now and store private files in cloud Drives for agent-to-agent handoff.
canvas
Canvas LMS integration — fetch enrolled courses and assignments using API token authentication.
whisper
OpenAI's general-purpose speech recognition model. Supports 99 languages, transcription, translation to English, and lan
unsloth
Unsloth: 2-5x faster LoRA/QLoRA fine-tuning, less VRAM.
fine-tuning-with-trl
TRL: SFT, DPO, PPO, GRPO, reward modeling for LLM RLHF.
axolotl
Axolotl: YAML LLM fine-tuning (LoRA, DPO, GRPO).
distributed-llm-pretraining-torchtitan
Provides PyTorch-native distributed LLM pretraining using torchtitan with 4D parallelism (FSDP2, TP, PP, CP). Use when p
tensorrt-llm
Optimizes LLM inference with NVIDIA TensorRT for maximum throughput and lowest latency. Use for production deployment on
stable-diffusion-image-generation
State-of-the-art text-to-image generation with Stable Diffusion models via HuggingFace Diffusers. Use when generating im
slime-rl-training
Provides guidance for LLM post-training with RL using slime, a Megatron+SGLang framework. Use when training GLM models,
simpo-training
Simple Preference Optimization for LLM alignment. Reference-free alternative to DPO with better performance (+6.4 points
sparse-autoencoder-training
Provides guidance for training and analyzing Sparse Autoencoders (SAEs) using SAELens to decompose neural network activa
qdrant-vector-search
High-performance vector similarity search engine for RAG and semantic search. Use when building production RAG systems r
pytorch-lightning
High-level PyTorch framework with Trainer class, automatic distributed training (DDP/FSDP/DeepSpeed), callbacks system,
dspy
DSPy: declarative LM programs, auto-optimize prompts, RAG.
pytorch-fsdp
Expert guidance for Fully Sharded Data Parallel training with PyTorch FSDP - parameter sharding, mixed precision, CPU of
pinecone
Managed vector database for production AI applications. Fully managed, auto-scaling, with hybrid search (dense + sparse)
peft-fine-tuning
Parameter-efficient fine-tuning for LLMs using LoRA, QLoRA, and 25+ methods. Use when fine-tuning large models (7B-70B)
nemo-curator
GPU-accelerated data curation for LLM training. Supports text/image/video/audio. Features fuzzy deduplication (16× faste
modal-serverless-gpu
Serverless GPU cloud platform for running ML workloads. Use when you need on-demand GPU access without infrastructure ma
llava
Large Language and Vision Assistant. Enables visual instruction tuning and image-based conversations. Combines CLIP visi
lambda-labs-gpu-cloud
Reserved and on-demand GPU cloud instances for ML training and inference. Use when you need dedicated GPU instances with
instructor
Extract structured data from LLM responses with Pydantic validation, retry failed extractions automatically, parse compl