{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreia5fcgbk4bv3k3key6a3rz3lhgja4ih4jazknrtwc2xvg37kfvkae",
"uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mhh5p35xwox2"
},
"path": "/t/train-and-evaluation-loss-drop-between-epochs/174409#post_1",
"publishedAt": "2026-03-19T20:13:57.000Z",
"site": "https://discuss.huggingface.co",
"textContent": "Hi everyone.\n\nI’m training a LoRA adapter for a model using the SFT trainer and also set an evaluation set. In my case, **the train and evaluation sets are completely unrelated**.\n\nWhat I observe is a drop in loss between the epochs. It makes perfect sense for the train loss. the model already saw the examples, thus the loss is expected to drop. But I can’t reason about why is there a drop in the evaluation loss, as the examples are not related and there is no sign of contamination.\n\nAny ideas and advice would be highly appreciated, thank you in advance.\n\nAdding the train/eval loss from W&B, I’ve ran 2 training epochs in this session on 4 GPUs on the same node.",
"title": "Train and Evaluation loss drop between epochs"
}