NVTabular issues

Can I install NVTabular from source? [QST]

3

**What is your question?** I've installed NVTabular for 3 times today, and each time I encountered a different bug. It's quite torturing. The latest error was the closest to success,...

zxgx

question

P1

Update the `Categorify` operator to set the domain max correctly

3

## Goal Reduce the resulting `int_domain.max` property by one on a ColumnSchema after transforming with `Categorify`. To match the data correctly. ## Motivation / Context This PR was motivated by...

oliverholworthy

bug

Graphs were correctly visualized when ran the script again.

7

graph of categorical features and the combination of categorical features i.e.('userId', 'movieId') and numerical feature i.e. (rating) were visualized and the difference can be seen in the uploaded script.

kuwarkapur

[FEA] classification example

1

I tried changing the loss function to BCELoss. I got a message saying that Many models use a sigmoid layer right before the binary cross entropy layer? How do I...

subbayya

question

PyTorch

Extract Python and Dask `Executor` classes from `Workflow`

13

We'd like to re-use some of the mechanics of graph execution (both local and distributed) in other parts of Merlin, so this is a step in the direction of disentangling...

karlhigley

chore

[BUG] Getting error when jointly encoding single-hot and multi-hot categ columns

1

**Describe the bug** I would like to jointly encode single and multi-hot categorical columns but I am getting the following error: ``` --------------------------------------------------------------------------- TypeError Traceback (most recent call last) Input...

rnyak

bug

P1

S3

Fix `GroupBy` column names when grouping and outputting same column

6

In this case, the piece of code that should be connecting the column name with the aggregation name incorrectly tries to join every character of the column name with the...

karlhigley

bug

[BUG] `Categorify` with multi-column group fails when NA values are present

3

**Describe the bug** **Steps/Code to reproduce bug** The following code ``` import cudf import nvtabular as nvt import numpy as np gdf = cudf.DataFrame(data=[['apple', np.nan], ['apple', 'red'], ['apple', 'green'], ['orange',...

radekosmulski

bug

P0

update NVTabular examples

56

This is just a very early WIP, only adding it here to share current status with @bschifferer, still a lot of work to be done

radekosmulski

documentation

examples

[BUG] `Groupby` with single column for grouping and calculating results throws an error

2

**Describe the bug** This functionality is important because often we might want to group by an identifier column (such as `customer_id` for instance), perform some calculations on the groupings and...

radekosmulski

bug

P1

NVTabular
NVTabular copied to clipboard

Metadata

Can I install NVTabular from source? [QST]

Update the `Categorify` operator to set the domain max correctly

Graphs were correctly visualized when ran the script again.

[FEA] classification example

Extract Python and Dask `Executor` classes from `Workflow`

[BUG] Getting error when jointly encoding single-hot and multi-hot categ columns

Fix `GroupBy` column names when grouping and outputting same column

[BUG] `Categorify` with multi-column group fails when NA values are present

update NVTabular examples

[BUG] `Groupby` with single column for grouping and calculating results throws an error

← Metadata

Owner

Metadata

NVTabular NVTabular copied to clipboard

Metadata

← Metadata

Owner

Metadata

NVTabular
NVTabular copied to clipboard