{
"$type": "site.standard.document",
"bskyPostRef": {
"cid": "bafyreiehysm6ayilkyxcrjbvsdrgshtpo2hbshdy5df2w7szgpvy6347iu",
"uri": "at://did:plc:pgryn3ephfd2xgft23qokfzt/app.bsky.feed.post/3mjmwpb2o3g22"
},
"path": "/t/dataset-viewer-broke-after-repo-rename/175327#post_1",
"publishedAt": "2026-04-16T17:25:52.000Z",
"site": "https://discuss.huggingface.co",
"textContent": "Hi,\n\nAfter renaming my dataset repository, the dataset viewer began failing with the following error, although it worked before the rename. Also, the internally generated `refs/convert/parquet` branch that had previously been created by the parquet-converter bot is now missing after the rename.\n\n`The full dataset viewer is not available (click to read why). Only showing a preview of the rows.`\n\n\n Error code: DatasetGenerationError\n Exception: IndexError\n Message: list index out of range\n Traceback: Traceback (most recent call last):\n File \"/usr/local/lib/python3.12/site-packages/datasets/builder.py\", line 1904, in _prepare_split_single\n original_shard_lengths[original_shard_id] += len(table)\n ~~~~~~~~~~~~~~~~~~~~~~^^^^^^^^^^^^^^^^^^^\n IndexError: list index out of range\n\n The above exception was the direct cause of the following exception:\n\n Traceback (most recent call last):\n File \"/src/services/worker/src/worker/job_runners/config/parquet_and_info.py\", line 1342, in compute_config_parquet_and_info_response\n parquet_operations, partial, estimated_dataset_info = stream_convert_to_parquet(\n ^^^^^^^^^^^^^^^^^^^^^^^^^^\n File \"/src/services/worker/src/worker/job_runners/config/parquet_and_info.py\", line 907, in stream_convert_to_parquet\n builder._prepare_split(split_generator=splits_generators[split], file_format=\"parquet\")\n File \"/usr/local/lib/python3.12/site-packages/datasets/builder.py\", line 1739, in _prepare_split\n for job_id, done, content in self._prepare_split_single(\n ^^^^^^^^^^^^^^^^^^^^^^^^^^^\n File \"/usr/local/lib/python3.12/site-packages/datasets/builder.py\", line 1925, in _prepare_split_single\n raise DatasetGenerationError(\"An error occurred while generating the dataset\") from e\n datasets.exceptions.DatasetGenerationError: An error occurred while generating the dataset\n",
"title": "Dataset viewer broke after repo rename"
}