{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreiesqqjael6dtifq6pfmcouq2r566lwhnvwepm3fm6jqg4lyugv3ha",
"uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mifsdvmz7zr2"
},
"path": "/t/getting-you-have-exceeded-your-pro-gpu-quota-error-even-with-credits-loaded/174850#post_2",
"publishedAt": "2026-04-01T02:48:06.000Z",
"site": "https://discuss.huggingface.co",
"tags": [
"Hugging Face",
"Gradio",
"Hugging Face Forums",
"@spaces.GPU"
],
"textContent": "We need a little more information to determine whether that is some kind of misunderstanding or an error on the HF side…\n\n* * *\n\n## Most likely cause\n\nYou were probably using a **ZeroGPU Space** , while the extra money you added was going into **Inference Providers** billing, which is a different product bucket. Hugging Face’s docs say purchased credits and pay-as-you-go apply to **Inference Providers** when requests are routed through Hugging Face, while **ZeroGPU** has its own daily quota system. Public HF docs do **not** describe bought credits as a way to extend ZeroGPU daily minutes. (Hugging Face)\n\nIn plain terms, “I have PRO and extra credits” does **not** automatically mean “this ZeroGPU Space can keep running after I hit the daily ZeroGPU limit.” For ZeroGPU, the current documented quota is **25 minutes per day for PRO** , and it resets **exactly 24 hours after your first GPU usage** , not at midnight. (Hugging Face)\n\n## Why this is confusing\n\nHF sells several things that look similar from the user side but are enforced differently underneath:\n\n * **PRO** increases your ZeroGPU quota and queue priority.\n * **Inference Providers credits** are monthly credits plus pay-as-you-go for routed inference.\n * **Spaces compute** is a separate compute service category in billing. (Hugging Face)\n\n\n\nSo if you saw wording that made it sound like extra credits would “kick in” after daily GPU usage, that wording is at least **misleading for ZeroGPU use** , because the public docs describe credits for Inference Providers and quotas for ZeroGPU as separate mechanisms. I cannot prove from the public docs whether that was a UI bug, stale copy, or context-specific message, but the documentation does not support “credits extend ZeroGPU daily quota.” (Hugging Face)\n\n## Second most likely cause\n\nYour request may not have been recognized as coming from your **logged-in PRO identity**. HF’s Spaces API docs say authenticated requests consume **your account’s** ZeroGPU quota, while unauthenticated requests use a stricter shared pool. The docs list the current ZeroGPU daily quotas as **2 minutes unauthenticated** , **3.5 minutes free** , and **25 minutes PRO**. (Hugging Face)\n\nThis matters because ZeroGPU rate limiting is tied to request identity. Gradio’s ZeroGPU client docs explain that requests missing the `X-IP-Token` header are treated as unauthenticated. HF’s Spaces API docs also show that when one app calls another ZeroGPU Space, you may need to forward the user’s auth context, including `x-ip-token`. (Gradio)\n\nThat is why these cases often fail even for PRO users:\n\n * opening the raw `.hf.space` link instead of the Space’s Hub page,\n * using incognito,\n * using strict cookie or privacy settings,\n * calling the Space through an API client or proxy without proper auth forwarding,\n * calling a ZeroGPU Space from another Space without forwarding the identity token. (Hugging Face Forums)\n\n\n\n## Third likely cause\n\nThe specific Space may simply be **expensive per run**. ZeroGPU Spaces can request longer runtimes with `@spaces.GPU(duration=...)`, and `xlarge` uses **2×** the daily quota of `large`. The default runtime window is **60 seconds** , but a Space can ask for more. That means a run can be rejected even when you still have some quota left, because the Space is asking for more seconds than you have remaining. (Hugging Face)\n\nSo one Space can fail fast while another still works, even on the same account. That is not necessarily a platform bug. It can just be the Space’s runtime settings and GPU size. (Hugging Face)\n\n## Real bugs and HF-side issues also exist\n\nThere have been genuine platform-side problems. HF staff stated in March 2025 that they had **changed how ZeroGPU quotas work** and that some PRO users were **negatively impacted**. There are also reports from 2025 and 2026 where users had active PRO status and visible quota on the billing page but still got `0s left` or wrong quota behavior across multiple Spaces. (Hugging Face Forums)\n\nThere was also a documented bug where **ZeroGPU + non-SSR apps** could fall back to **IP-based quotas** , which would ignore logged-in PRO quotas. HF staff said that was fixed by upgrading Gradio. So some cases are not user error and not billing confusion. They are app-side or platform-side auth bugs. (Hugging Face)\n\n## My best diagnosis for a case like yours\n\nIf the pattern is “I got PRO, added extra credits, and still immediately saw a Pro GPU quota error on a Space,” then my ranking is:\n\n 1. **Most likely:** credits were added to the wrong billing bucket for this use case. ZeroGPU quota was still the actual limiter. (Hugging Face)\n 2. **Next:** the Space request was treated as guest or IP-based instead of PRO-authenticated. (Hugging Face)\n 3. **Then:** the Space itself requests more time or more expensive GPU usage than your remaining quota allows. (Hugging Face)\n 4. **Also possible:** your account quota state was out of sync on HF’s side. (Hugging Face Forums)\n 5. **Less likely unless it is one Space only:** the Space is outdated or misconfigured. (Hugging Face)\n\n\n\n## What to do\n\n### If you are just using someone else’s Space\n\nOpen the Space from its **Hub page while logged in** , not from the raw `.hf.space` subdomain. Avoid incognito. Temporarily disable aggressive cookie or tracking blockers for that test. Community reports show `.hf.space` direct access and incognito can behave like unauthenticated use. (Hugging Face Forums)\n\nThen test **two or three unrelated ZeroGPU Spaces**. If only one fails, suspect that specific Space’s runtime settings or app bug. If **all** ZeroGPU Spaces fail while your billing page shows PRO status and remaining minutes, suspect an **HF-side quota sync issue** or auth attribution problem. (Hugging Face Forums)\n\nAlso, do **not** rely on purchased credits to extend ZeroGPU minutes. If you need more runtime than the daily ZeroGPU pool allows, the clean alternatives are:\n\n * use **Inference Providers** directly for pay-as-you-go inference,\n * or use a **paid GPU Space** instead of ZeroGPU. HF documents those as separate compute/billing paths. (Hugging Face)\n\n\n\n### If you own the Space or call it programmatically\n\nMake sure the actual inference call is authenticated, not just your account check elsewhere. HF’s Spaces API docs show using `Authorization: Bearer ...` for Space calls, and Gradio’s docs explain why missing request identity causes fallback to unauthenticated rate limits. (Hugging Face)\n\nIf your app calls another ZeroGPU Space, forward `x-ip-token`. If you use an older Gradio setup, upgrade it. HF staff explicitly linked a real ZeroGPU auth bug to older app behavior and said upgrading Gradio restores logged-in quotas instead of IP-based fallback. (Gradio)\n\nIf the Space is expensive, reduce `duration` where possible and avoid `size=\"xlarge\"` unless it is truly needed. Shorter durations improve queue behavior, and `xlarge` burns quota twice as fast. (Hugging Face)\n\n## The short conclusion\n\nMy strongest guess is **not** “the credits failed to activate.” It is:\n\n * **you hit a ZeroGPU quota that extra credits do not extend** , or\n * **the request was not being counted under your PRO identity**. (Hugging Face)\n\n\n\nA real HF-side bug is also possible, but I would move that higher **only** if you are logged in correctly, opening Spaces from the Hub page, and **multiple unrelated ZeroGPU Spaces** still instantly report `0s left` or similar nonsense. (Hugging Face Forums)",
"title": "Getting \"You have exceeded your Pro GPU quota\" error even with credits loaded"
}