{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreictxd2krpmm2uqhdvvyraznlchz5gyl5jiruqo73jalaphqg3urna",
"uri": "at://did:plc:lk3jfj3zq4k4wxnk474axylu/app.bsky.feed.post/3mlune3k2hkf2"
},
"path": "/t/reinforcement-fine-tuning-using-gpt-4-1-mini/1380920#post_3",
"publishedAt": "2026-05-15T05:36:25.000Z",
"site": "https://community.openai.com",
"textContent": "Thank you so much for the detailed explanation, _j! This really cleared things up.\n\nJust to add a bit of context, this is part of a research project Im working on, and I was exploring RFT as an alternative approach to see if it could yield different results for my use case. Your note about RFT being specifically designed around the inaccessibility of reasoning traces is really insightful and helps me understand why it may not be the right fit here.\n\nI’ll continue exploring the supervised fine-tuning path for gpt-4.1. Really appreciate you taking the time.",
"title": "Reinforcement Fine Tuning using gpt-4.1-mini"
}