Autoeval config by nazneenrajani · Pull Request #4234 · huggingface/datasets

nazneenrajani · 2022-04-27T05:32:10Z

Added autoeval config to imdb as pilot

HuggingFaceDocBuilderDev · 2022-04-27T05:42:47Z

The documentation is not available anymore as the PR was closed or merged.

lewtun · 2022-04-27T12:07:01Z

Related to: https://github.com/huggingface/autonlp-backend/issues/414 and https://github.com/huggingface/autonlp-backend/issues/424

lewtun

Thanks for kicking off the metadata additions for evaluation @NRajani 🚀 ! I've left a few small nits, but otherwise this LGTM!

datasets/imdb/README.md

lewtun · 2022-04-27T12:43:27Z

datasets/imdb/README.md

+  metrics:
+  - type: accuracy
+  - type: f1
+    name: f1_macro


Perhaps it's better to use a "pretty name" here (and the other F1 metrics)?

Suggested change

name: f1_macro

name: F1 macro

lewtun · 2022-04-27T12:47:23Z

The tests are failing due to the changed metadata:

got an unexpected keyword argument 'train-eval-index'

I think you can fix this by updating the DatasetMetadata class and implementing an appropriate validate_train_eval_index() function

@lhoestq we are working with an arbitrary set of tags for autoeval config. See https://github.com/huggingface/autonlp-backend/issues/414
I need to add a validator function though for the tests to pass. Our set is not well-defined as in the rest https://github.com/huggingface/datasets/tree/master/src/datasets/utils/resources. What's a workaround for this?

https://github.com/huggingface/autonlp-backend/issues/414

Merge branch 'autoeval' of https://github.com/nrajani/datasets into autoeval

lewtun · 2022-05-03T13:21:55Z

On the question of validating the train-eval-index metadata, I think the simplest approach would be to validate that the required fields exist and not worry about their values (which are open-ended).

For me, the required fields include:

config
task
task_id
splits (train / validation / eval)
col_mapping
metrics (checking that each one has type, name)

Here I'm using the spec defined in https://github.com/huggingface/autonlp-backend/issues/414 as a guide.

WDYT @lhoestq ?

lhoestq · 2022-05-04T12:26:34Z

Makes sense ! Currently the metadata type validator doesn't support subfields - let me open a PR to add it

- support YAML keys with dashes - add train-eval-index validation

lhoestq · 2022-05-04T13:56:05Z

I ended up improving the metadata validation in this PR x)

In particular:

I added support YAML keys with dashes instead of underscores for train-eval-index
I added train-eval-index validation with validate_train_eval_index. It does nothing fancy, it just checks that it is a list if it exists in the YAML, but feel free to improve it if you want

Let me know if it sounds good to you ! I think we can improve validate_train_eval_index in another PR

lewtun

Thanks for add a special validator for train-eval-index @lhoestq ! I think this approach looks great 🤗

lhoestq · 2022-05-04T14:56:11Z

Come on windows... I didn't do anything advanced...

Anyway, will try to fix this when I get back home x)

lewtun · 2022-05-04T15:08:27Z

Come on windows... I didn't do anything advanced...

Anyway, will try to fix this when I get back home x)

Hehe, thanks!

nazneenrajani · 2022-05-04T15:10:02Z

Thanks, @lhoestq this is great!

lhoestq · 2022-05-05T15:21:49Z

Did I just fix it for windows and now it fails on linux ? xD

lewtun · 2022-05-05T15:27:18Z

Did I just fix it for windows and now it fails on linux ? xD

Looks like the Heisenberg uncertainty principle is at play here - you cannot simultaneously have unit tests passing in both Linux and Windows 😅

lhoestq · 2022-05-05T15:35:38Z

The worst is that the tests pass locally both on my windows and my linux x)

lhoestq · 2022-05-05T17:55:12Z

Ok fixed it, the issue came from python 3.6 that doesn't return the right __origin__ for Dict and List types

lhoestq

Alright thanks for adding the first Autoeval config ! :D

datasets/imdb/README.md

lewtun · 2022-05-05T19:04:07Z

Alright thanks for adding the first Autoeval config ! :D

Woohoo! Thank you so much 🤗

fxmarty · 2022-05-06T13:20:31Z

This is cool!

nazneenrajani added 2 commits April 26, 2022 22:30

autoeval config added

4728ee0

autoeval config added

0d34e7a

nazneenrajani requested review from lewtun and lhoestq April 27, 2022 05:32

lewtun approved these changes Apr 27, 2022

View reviewed changes

NRajani and others added 4 commits April 27, 2022 10:34

Added autonlp config changes

980a4ff

https://github.com/huggingface/autonlp-backend/issues/414

multi-input text classification as task id instead of category

c7128e5

Merge branch 'master' into autoeval

43742ec

Updating task ids

e65af12

Merge branch 'autoeval' of https://github.com/nrajani/datasets into autoeval

lhoestq and others added 2 commits May 4, 2022 15:51

improve metadata validation:

7e07a4b

- support YAML keys with dashes - add train-eval-index validation

Merge branch 'master' into autoeval

627c453

revert debugging stuff

1fcf28c

lewtun approved these changes May 4, 2022

View reviewed changes

lhoestq added 2 commits May 4, 2022 16:26

fix tests

278739b

style

d5d9a9a

Update metadata.py

911a6e9

Update metadata.py

2ad854c

lhoestq approved these changes May 5, 2022

View reviewed changes

datasets/imdb/README.md Show resolved Hide resolved

lhoestq merged commit 6af556b into huggingface:master May 5, 2022

fxmarty mentioned this pull request May 17, 2022

Compare optimized models vs. transformers models huggingface/optimum#194

Merged

11 tasks

Conversation

nazneenrajani commented Apr 27, 2022

Uh oh!

HuggingFaceDocBuilderDev commented Apr 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lewtun commented Apr 27, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lewtun left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lewtun Apr 27, 2022

Choose a reason for hiding this comment

Uh oh!

lewtun commented Apr 27, 2022 • edited by nazneenrajani Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lewtun commented May 3, 2022

Uh oh!

lhoestq commented May 4, 2022

Uh oh!

lhoestq commented May 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lewtun left a comment

Choose a reason for hiding this comment

Uh oh!

lhoestq commented May 4, 2022 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

lewtun commented May 4, 2022

Uh oh!

nazneenrajani commented May 4, 2022

Uh oh!

lhoestq commented May 5, 2022

Uh oh!

lewtun commented May 5, 2022

Uh oh!

lhoestq commented May 5, 2022

Uh oh!

lhoestq commented May 5, 2022

Uh oh!

lhoestq left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lewtun commented May 5, 2022

Uh oh!

fxmarty commented May 6, 2022

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

6 participants

HuggingFaceDocBuilderDev commented Apr 27, 2022 •

edited

Loading

lewtun commented Apr 27, 2022 •

edited

Loading

lewtun commented Apr 27, 2022 •

edited by nazneenrajani

Loading

lhoestq commented May 4, 2022 •

edited

Loading

lhoestq commented May 4, 2022 •

edited

Loading