Skip to content

Datapackage should have some tests with the whole content of datasets #63

@femtotrader

Description

@femtotrader

Hello,

I think that Datapackage should have some tests with the whole content of
https://github.com/datasets/

for example I noticed manually that:

datapackage.DataPackage("https://raw.githubusercontent.com/datasets/gold-prices/master/")

raises

ValueError: License type is missing

maybe we should expect to have set it to None
(see #64 )

This kind of tests could be disabled for Continuous Integration (because that's quite big to download, and quite long to process).

This kind of tests should only be run with a cache mechanism like what I requested in
#61 and what I implemented in #62

A list of Datapackages is available at https://github.com/datasets/registry
but it's not readable using trickvi/datapackage see

datapackage.DataPackage("https://raw.githubusercontent.com/datasets/registry/master/")

it raises

HTTPError: HTTP Error 404: Not Found

because there is no datapackage.json see datasets/commons#112

An other related issue is that datapackage is not able to accept GitHub repository.
#65 and (unfortunately) https://github.com/datasets/registry contains only GitHub repository, not URL to directory which contains datapackage.json or URL to datapackage.json (see datasets/commons#113 )

Kind regards

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions