{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreigwh4mlydxwi6lagn72rveymkevx4ug2x4s3cs5cr4xxinymyurha",
    "uri": "at://did:plc:5sgu76a53rz3n6unbykmovqy/app.bsky.feed.post/3mlodh6xmv762"
  },
  "description": "Fine-tuning is the process of taking a pretrained model and continuing to train it on a target dataset, so its weights adapt to a specific task, domain, style, or output format. It is the standard way to customise a model when prompting alone is not enough.\n\n\nVariants\n\n * Full fine-tuning. Updates every parameter. Maximum capacity, maximum cost.\n * LoRA (Low-Rank Adaptation). Trains a small low-rank update on top of frozen weights. Fast, cheap, and the result is a small adapter file.\n * QLoRA. L",
  "path": "/engineering-glossary/fine-tuning-language-models/",
  "publishedAt": "2026-05-12T17:39:36.000Z",
  "site": "https://sahilkapoor.com",
  "tags": [
    "RAG",
    "Embeddings"
  ],
  "textContent": "**Fine-tuning** is the process of taking a pretrained model and continuing to train it on a target dataset, so its weights adapt to a specific task, domain, style, or output format. It is the standard way to customise a model when prompting alone is not enough.\n\n## Variants\n\n  * **Full fine-tuning.** Updates every parameter. Maximum capacity, maximum cost.\n  * **LoRA (Low-Rank Adaptation).** Trains a small low-rank update on top of frozen weights. Fast, cheap, and the result is a small adapter file.\n  * **QLoRA.** LoRA on a quantised base model, enabling fine-tuning of large models on a single GPU.\n  * **Instruction tuning.** Fine-tuning on instruction-response pairs to make a base model follow instructions.\n  * **RLHF and DPO.** Aligning model outputs with human preferences, used in modern chat models.\n\n\n\nšŸ”—\n\n**Related Terms**\nRAG, Embeddings.",
  "title": "Fine-tuning",
  "updatedAt": "2026-05-13T19:15:20.259Z"
}