Raw Record Source

{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreibgvw3vuc53huvhsztilmrjcnd5glxga7qmlh7ffqtrmdcsx6k4yy",
    "uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mejhi7z6os32"
  },
  "path": "/t/how-are-you-deploying-models-without-inference-providers/172965#post_6",
  "publishedAt": "2026-02-10T15:16:20.000Z",
  "site": "https://discuss.huggingface.co",
  "textContent": "We’ve seen a similar split in practice. A lot of models without attached providers end up being used either locally (llama.cpp / Ollama / LM Studio) or served by teams on their own infrastructure once they move past experimentation.\n\nIn many cases the lack of a default provider is intentional. It gives teams flexibility to deploy based on their own cost, latency, and control requirements rather than a one-size-fits-all endpoint.",
  "title": "How are you deploying models without inference providers?"
}