{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreiaotmzlwwemob2r4qdnatmgovbzcevvb7pwh2rniec3s4ywhpll24",
"uri": "at://did:plc:wwyqal4cnqhuwyacdj7rqq3n/app.bsky.feed.post/3mi75j35dr7x2"
},
"path": "/t/best-model-reduction-approach-for-fine-gray-prediction-model/28678#post_6",
"publishedAt": "2026-03-27T11:27:38.000Z",
"site": "https://discourse.datamethods.org",
"tags": [
"here"
],
"textContent": "Yes, as described here data reduction (unsupervised learning) involves reducing the number of variables to model in a way that is completely masked to Y. That way you don’t create model uncertainty / overfit, and interpretation can be enhanced by not trying to separate collinear variable. My favorite default method is sparse PCA for which there are two examples in those RMS notes. The one that is near the end of the ebook involves nonlinear sparse PCA.\n\nVariable selection methods (other than ones based solely on subject matter knowledge), even when smartly using shrinkage as in _lasso_ are unlikely to be stable and will miss important variables.",
"title": "Best model reduction approach for Fine-Gray prediction model"
}