Train models with Hugging Face TRL: supervised fine-tuning (SFT), direct preference optimization (DPO), group relative policy optimization (GRPO), and GGUF conversion.

When to use this skill

Fine-tuning a model with SFT
Running DPO training
Converting a model to GGUF

Core concepts

This skill provides guidance on hf model training best practices, patterns, and common pitfalls. It is designed to be loaded on demand when a relevant task is detected.

Installation

curl -LO https://opencode-skills.example/downloads/ai-ml/hf-training.zip
unzip hf-training.zip -d ~/.config/opencode/skills/

Restart OpenCode — the skill loads automatically.

When it triggers

fine-tuning a model with SFT
running DPO training
converting a model to GGUF

HF Model Training

When to use this skill

Core concepts

Installation

When it triggers