Remove padding from NVTabular getting started example by oliverholworthy · Pull Request #677 · NVIDIA-Merlin/Transformers4Rec

oliverholworthy · 2023-04-18T12:55:22Z

Demonstrates how we can serve Transformers4Rec and NVTabular together with ragged outputs from the workflow and ragged inputs into the model

review-notebook-app · 2023-04-18T12:55:27Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

github-actions · 2023-04-18T13:11:34Z

Documentation preview

https://nvidia-merlin.github.io/Transformers4Rec/review/pr-677

rnyak · 2023-04-21T05:24:52Z

@oliverholworthy this looks like ready to me, but it reads Draft.

rnyak · 2023-04-26T14:06:38Z

@@ -63,18 +63,7 @@
   "execution_count": 2,


@oliver this number 486 does not match with the cardinality in the workflow.output_schema. are you using the different saved workflow by any chance? each time you rerun ETL you will get a different num of unique items since we gen data.

Reply via ReviewNB

let me re-run both notebooks again. I only commited the part of the first notebook that had changed

run all the notebooks now and should match up now with the schema

rnyak · 2023-05-01T19:08:22Z

@oliverholworthy I pulled the latest main branches and then your PR and I am getting error from unit test from first ETL notebook.

E           File /usr/local/lib/python3.8/dist-packages/merlin/dag/executors.py:287, in DaskExecutor.transform(self, dataset, graph, output_dtypes, additional_columns, capture_dtypes, strict)
E               283 nodes = self._executor._output_nodes(graph)
E               285 self._clear_worker_cache()
E           --> 287 ddf = dataset.to_ddf()
E               289 # Check if we are only selecting columns (no transforms).
E               290 # If so, we should perform column selection at the ddf level.
E               291 # Otherwise, Dask will not push the column selection into the
E               292 # IO function.
E               293 if not nodes:
E           
E           File /usr/local/lib/python3.8/dist-packages/dask/dataframe/core.py:4686, in DataFrame.__getattr__(self, key)
E              4684     object.__getattribute__(self, key)
E              4685 else:
E           -> 4686     raise AttributeError("'DataFrame' object has no attribute %r" % key)
E           
E           AttributeError: 'DataFrame' object has no attribute 'to_ddf'

Remove padding from NVTabular getting started example

dc6b28c

oliverholworthy added the area/examples label Apr 18, 2023

oliverholworthy added this to the Merlin 23.04 milestone Apr 18, 2023

oliverholworthy self-assigned this Apr 18, 2023

rnyak added the chore Maintenance for the repository label Apr 18, 2023

rnyak self-requested a review April 18, 2023 15:53

oliverholworthy mentioned this pull request Apr 18, 2023

Have a padding operator and add Support for ragged inputs that we can use consistently with Transformers for rec NVIDIA-Merlin/systems#322

Closed

rnyak approved these changes Apr 21, 2023

View reviewed changes

oliverholworthy marked this pull request as ready for review April 26, 2023 10:57

Select sample of rows from dataframe used to trace model

f87547c

rnyak reviewed Apr 26, 2023

View reviewed changes

Run getting started notebooks

4917c65

oliverholworthy modified the milestones: Merlin 23.04, Merlin 23.05 Apr 26, 2023

rnyak added 4 commits April 26, 2023 11:30

Merge branch 'main' into serve-session-based-with-ragged-inputs-outputs

3d7f1b0

Merge branch 'main' into serve-session-based-with-ragged-inputs-outputs

0b0ce4d

Merge branch 'main' into serve-session-based-with-ragged-inputs-outputs

ba1ae65

Merge branch 'main' into serve-session-based-with-ragged-inputs-outputs

e79259b

Merge branch 'main' into serve-session-based-with-ragged-inputs-outputs

edf68a8

oliverholworthy merged commit e655580 into NVIDIA-Merlin:main May 4, 2023

oliverholworthy deleted the serve-session-based-with-ragged-inputs-outputs branch May 4, 2023 12:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Remove padding from NVTabular getting started example#677

Remove padding from NVTabular getting started example#677
oliverholworthy merged 8 commits intoNVIDIA-Merlin:mainfrom
oliverholworthy:serve-session-based-with-ragged-inputs-outputs

oliverholworthy commented Apr 18, 2023

Uh oh!

review-notebook-app bot commented Apr 18, 2023

Uh oh!

github-actions bot commented Apr 18, 2023

Uh oh!

rnyak commented Apr 21, 2023

Uh oh!

rnyak Apr 26, 2023 •

edited

Loading

Uh oh!

oliverholworthy Apr 26, 2023

Uh oh!

oliverholworthy Apr 26, 2023

Uh oh!

rnyak commented May 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

oliverholworthy commented Apr 18, 2023

Uh oh!

review-notebook-app bot commented Apr 18, 2023

Uh oh!

github-actions bot commented Apr 18, 2023

Documentation preview

Uh oh!

rnyak commented Apr 21, 2023

Uh oh!

rnyak Apr 26, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

oliverholworthy Apr 26, 2023

Choose a reason for hiding this comment

Uh oh!

oliverholworthy Apr 26, 2023

Choose a reason for hiding this comment

Uh oh!

rnyak commented May 1, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

rnyak Apr 26, 2023 •

edited

Loading