{
  "$type": "site.standard.document",
  "bskyPostRef": {
    "cid": "bafyreig7hpkzemhkfz4hszvnrcipbvdno6gah6kznniv6pd4p24sknp6be",
    "uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mgvo7x6nlxt2"
  },
  "path": "/t/valueerror-loading-helsinki-nlp-tokenizers/174192#post_4",
  "publishedAt": "2026-03-12T13:02:44.000Z",
  "site": "https://discuss.huggingface.co",
  "tags": [
    "174223"
  ],
  "textContent": "Thank you. Yes, I did get the sentencepiece error when I initially switched to MarianTokenizer.from_pretrained(), but all hunky dory once I installed it.\n\nNow my issue is that these models don’t work with DataParallel anymore, but that is another issue (174223)(throws a StopTermination attempting to access self.model.device in its bowels). I may have to hand-jam my own threads for parallel inference (I had to do that for the ModernBERT models a while back, so code reuse is king!).",
  "title": "ValueError loading Helsinki-NLP tokenizers"
}