{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreielb7uz3whv7n3z2hscyqnpt2i4giolgkujg7qifbq3lbo4hck7le",
"uri": "at://did:plc:wwyqal4cnqhuwyacdj7rqq3n/app.bsky.feed.post/3mecctjmf36m2"
},
"path": "/t/bayesian-predictive-projection-for-variable-selection/28620#post_13",
"publishedAt": "2026-02-04T08:16:57.000Z",
"site": "https://discourse.datamethods.org",
"tags": [
"a github issue"
],
"textContent": "Projection is based on minimizing the KL-divergence from the reference model predictive distribution to the constrained model predictive distribution for each reference posterior draw separately. For many data model distributions, this is equivalent or can be approximated with optimization of the constrained model parameters given mean of the reference model prediction for each reference model posterior draw. In case of joint missing data imputation, each reference model posterior draw includes draw from the missing data distribution, too. The optimization approach to minimize KL does not work well for these latent data parameters. We could approximate by keeping the latent data parameters as fixed, and optimize only other parameters, but then this would be the same as using multiple imputation approach, which is a big task to add as discussed in a github issue.\n\nIn theory that KL-projection could be done for any model so that instead of doing the projection draw-by-draw, we would directly optimize to find the full projected distribution, but that task so complicated that we don’t know how to do it in reasonable computation time.",
"title": "Bayesian predictive projection for variable selection"
}