datasets: Dataset wrong checksum in download

Short description Cannot load properly camelyon dataset.

Reproduction instructions

pcam, pcam_info = tfds.load("patch_camelyon", with_info=True)

Output Error

 raise NonMatchingChecksumError(resource.url, tmp_path)
tensorflow_datasets.core.download.download_manager.NonMatchingChecksumError: Artifact https://zenodo.org/record/2546921/files/camelyonpatch_level_2_split_train_x.h5.gz, downloaded to /home/luigi/tensorflow_datasets/downloads/zeno.org_reco_2546_file_came_leve_2_spli_trbOY5E-X_2wontwSZI65F3fEMTim90H_hpA4Swq0obJw.gz.tmp.5ca1a06c29894e8c8c543733097e6702/camelyonpatch_level_2_split_train_x.h5.gz, has wrong checksum.

Expected behavior It should load all datasets from Zenodo but i think some urls have changed.

Additional context Here are all of the URLs and checksums: Name

camelyonpatch_level_2_split_test_meta.csv

https://zenodo.org/record/2546921/files/camelyonpatch_level_2_split_test_meta.csv?download=1

md5:3455fd69135b66734e1008f3af684566

camelyonpatch_level_2_split_test_x.h5.gz

https://zenodo.org/record/2546921/files/camelyonpatch_level_2_split_test_x.h5.gz?download=1

md5:d8c2d60d490dbd479f8199bdfa0cf6ec

camelyonpatch_level_2_split_test_y.h5.gz

https://zenodo.org/record/2546921/files/camelyonpatch_level_2_split_test_y.h5.gz?download=1

md5:60a7035772fbdb7f34eb86d4420cf66a

camelyonpatch_level_2_split_train_meta.csv

https://zenodo.org/record/2546921/files/camelyonpatch_level_2_split_train_meta.csv?download=1

md5:5a3dd671e465cfd74b5b822125e65b0a

camelyonpatch_level_2_split_train_x.h5.gz

https://zenodo.org/record/2546921/files/camelyonpatch_level_2_split_train_x.h5.gz?download=1

md5:1571f514728f59376b705fc836ff4b63

camelyonpatch_level_2_split_train_y.h5.gz

https://zenodo.org/record/2546921/files/camelyonpatch_level_2_split_train_y.h5.gz?download=1

md5:35c2d7259d906cfc8143347bb8e05be7

camelyonpatch_level_2_split_valid_meta.csv

https://zenodo.org/record/2546921/files/camelyonpatch_level_2_split_valid_meta.csv?download=1

md5:67589e00a4a37ec317f2d1932c7502ca

camelyonpatch_level_2_split_valid_x.h5.gz

https://zenodo.org/record/2546921/files/camelyonpatch_level_2_split_valid_x.h5.gz?download=1

md5:d5b63470df7cfa627aeec8b9dc0c066e

About this issue

  • Original URL
  • State: closed
  • Created 4 years ago
  • Comments: 15 (10 by maintainers)

Most upvoted comments

Could it be some kind of server error zenodo side sending the wrong checksum?

I don’t think it is server error.

Just guessing 😁