Skip to content

kiloword dataset download fails (hash mismatch) #13441

@hoechenberger

Description

@hoechenberger

Description of the problem

Reported at https://mne.discourse.group/t/kiloword-dataset-not-downloading-md5-hash-mismatch/11533

The kiloword dataset cannot be downloaded.

Steps to reproduce

import mme
mne.datasets.kiloword.data_path(force_update=True)

Link to data

No response

Expected results

Dataset is downloaded

Actual results

>>> mne.datasets.kiloword.data_path(force_update=True)
Using default location ~/mne_data for kiloword...
Downloading file 'MNE-kiloword-data.tar.gz' from 'https://osf.io/qkvf9/download?version=1' to '/Users/richardhochenberger/mne_data'.
0.00B [00:00, ?B/s]
Failed to download 'MNE-kiloword-data.tar.gz'. Will attempt the download again 2 more times.
0.00B [00:00, ?B/s]
Failed to download 'MNE-kiloword-data.tar.gz'. Will attempt the download again 1 more time.
0.00B [00:00, ?B/s]
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "<decorator-gen-327>", line 12, in data_path
  File "/private/tmp/mne-test/.venv/lib/python3.12/site-packages/mne/datasets/kiloword/kiloword.py", line 49, in data_path
    return _download_mne_dataset(
           ^^^^^^^^^^^^^^^^^^^^^^
  File "/private/tmp/mne-test/.venv/lib/python3.12/site-packages/mne/datasets/utils.py", line 260, in _download_mne_dataset
    fetch_dataset(
  File "/private/tmp/mne-test/.venv/lib/python3.12/site-packages/mne/datasets/_fetch.py", line 267, in fetch_dataset
    raise ValueError(
ValueError: MD5 hash of downloaded file (MNE-kiloword-data.tar.gz) does not match the known hash: expected md5:3a124170795abbd2e48aae8727e719a8 but got c5428b925680b6063e1b6724e4097852. Deleted download for safety. The downloaded file may have been corrupted or the known hash may be outdated. Consider using force_update=True to force the dataset to be downloaded again.
>>>

Additional information

>>> mne.sys_info()
Platform             macOS-26.0.1-arm64-arm-64bit
Python               3.12.10 (main, Apr  9 2025, 03:49:38) [Clang 20.1.0 ]
Executable           /private/tmp/mne-test/.venv/bin/python3
CPU                  Apple M2 Pro (10 cores)
Memory               32.0 GiB

Core
├☑ mne               1.10.1 (latest release)
├☑ numpy             2.3.3 (unknown linalg bindings (threadpoolctl module not found: No module named 'threadpoolctl'))
├☑ scipy             1.16.2
└☑ matplotlib        3.10.7 (backend=macosx)

Numerical (optional)
└☐ unavailable       sklearn, numba, nibabel, nilearn, dipy, openmeeg, cupy, pandas, h5io, h5py

Visualization (optional)
└☐ unavailable       pyvista, pyvistaqt, vtk, qtpy, ipympl, pyqtgraph, mne-qt-browser, ipywidgets, trame_client, trame_server, trame_vtk, trame_vuetify

Ecosystem (optional)
└☐ unavailable       mne-bids, mne-nirs, mne-features, mne-connectivity, mne-icalabel, mne-bids-pipeline, neo, eeglabio, edfio, mffpy, pybv

Metadata

Metadata

Assignees

No one assigned

    Labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions