External Publication

500 Internal Server Error with Ollama

Hugging Face Forums [Unofficial] March 28, 2026

Hey guys!

I’m new here to HF and trying to utilize local LLMs in general. I have been playing around with Claude Code using Ollama. It’s been working fine but I just installed a second GPU so am wanting to play with more models. I’m new to using HF to download. I’m trying to run the following model in Ollama and ultimately Claude Code:

huggingface.co

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF · Hugging Face

We’re on a journey to advance and democratize artificial intelligence through open source and open science.

I used the command provided by the ‘use this model’ drop down to run Ollama and pull the manifest.

ollama run ``hf.co/Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF:Q8_0

It downloaded fine, and I can confirm the file is in the directory and named as the model is trying to call. However, I’m still getting this error when I try to launch Ollama with it.

Error loading model: 500 Internal Server Error: unable to load model: /usr/share/ollama/.ollama/models/blobs/sha256-9093fa002d4e64576b0724bd67010de8f87d5ec284fa71d8394fc47e2d9c65be

I’m on Ubuntu 24.04 and have two RTX 4090s so it shouldn’t be a size problem. Does anyone have any ideas on what to look for?

Thank you!

Jackrong/Qwen3.5-27B-Claude-4.6-Opus-Reasoning-Distilled-v2-GGUF · Hugging Face

Discussion in the ATmosphere