{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreiapzkex4ovxwxhv3wp5hs6zeiy45zglm26rv464i4aijzbdukdi74",
    "uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mo7owcpmes52"
  },
  "path": "/t/slopsome-com-a-free-vram-fit-calculator-real-tokens-sec-database-for-local-llms/176771#post_1",
  "publishedAt": "2026-06-14T00:38:46.000Z",
  "site": "https://discuss.huggingface.co",
  "tags": [
    "slopsome.com",
    "slopsome — Will It Fit? - a Hugging Face Space by NexAIGuy"
  ],
  "textContent": "Hey all,\n\nI built slopsome.com to answer the question I kept re-deriving by hand: will model X run on GPU Y at quant Q with a Z-token context, and how fast?\n\nIt’s a search engine for LLM + GPU stats: a VRAM fit-calculator (fits in VRAM / with offload / multi-GPU / won’t fit + estimated tok/s), real measured throughput, and side-by-side compares of open-weight and API models (params, quant sizes, min VRAM, benchmarks, cost). Built for the GGUF / llama.cpp / Ollama / vLLM crowd. Free, no signup, sourced data (no invented numbers).\n\nThere’s also an open read-only API and a small HF Space demo. Feedback very welcome - wrong numbers, missing models/GPUs, features you’d want.\n\nTry the demo Space: slopsome — Will It Fit? - a Hugging Face Space by NexAIGuy",
  "title": "Slopsome.com - a free VRAM fit-calculator + real tokens/sec database for local LLMs"
}