500 Internal Server Error with Ollama
Hey guys!
I’m new here to HF and trying to utilize local LLMs in general. I have been playing around with Claude Code using Ollama. It’s been working fine but I just installed a second GPU so am wanting to play with more models. I’m new to using HF to download. I’m trying to run the following model in Ollama and ultimately Claude Code:
huggingface.co
Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF · Hugging Face
We’re on a journey to advance and democratize artificial intelligence through open source and open science.
I used the command provided by the ‘use this model’ drop down to run Ollama and pull the manifest.
ollama run ``hf.co/Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF:Q8_0
It downloaded fine, and I can confirm the file is in the directory and named as the model is trying to call. However, I’m still getting this error when I try to launch Ollama with it.
Error loading model: 500 Internal Server Error: unable to load model: /usr/share/ollama/.ollama/models/blobs/sha256-9093fa002d4e64576b0724bd67010de8f87d5ec284fa71d8394fc47e2d9c65be
I’m on Ubuntu 24.04 and have two RTX 4090s so it shouldn’t be a size problem. Does anyone have any ideas on what to look for?
Thank you!
Discussion in the ATmosphere