{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreibgvw3vuc53huvhsztilmrjcnd5glxga7qmlh7ffqtrmdcsx6k4yy",
"uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mejhi7z6os32"
},
"path": "/t/how-are-you-deploying-models-without-inference-providers/172965#post_6",
"publishedAt": "2026-02-10T15:16:20.000Z",
"site": "https://discuss.huggingface.co",
"textContent": "We’ve seen a similar split in practice. A lot of models without attached providers end up being used either locally (llama.cpp / Ollama / LM Studio) or served by teams on their own infrastructure once they move past experimentation.\n\nIn many cases the lack of a default provider is intentional. It gives teams flexibility to deploy based on their own cost, latency, and control requirements rather than a one-size-fits-all endpoint.",
"title": "How are you deploying models without inference providers?"
}