Cleanup shapes in model.input_schema and output_schema by rnyak · Pull Request #628 · NVIDIA-Merlin/Transformers4Rec

rnyak · 2023-03-01T19:25:37Z

This work addresses the Capturing shapes everywhere work. There are a bunch of places in Merlin that should fill information into the shape. for now we modified only model input_schema and output_schema code, but once we move to Schema from core, we can do more updates, accordingly.

github-actions · 2023-03-01T19:36:21Z

Documentation preview

https://nvidia-merlin.github.io/Transformers4Rec/review/pr-628

karlhigley · 2023-03-01T23:01:12Z

-            is_list = column.value_count.max > 0
+            dims = None
+            if column.value_count.max > 0:
+                dims = (None, column.value_count.max)


Do the value_counts always have the same size here? If so, it should still be safe to record the second dimension as (column.value_count.min, column.value_count.max) just in case they differ at some point (like after we add ragged input support to T4R.)

makes sense. so if we dont use padding in NVT workflow, value_counts.min() and value_counts.max() are different for the transformed dataset coming out of NVT workflow. but the padding is applied in the datalader (if I am not mistaken model takes dense inputs..)

Yeah, as I understand it, the current state of the Merlin-verse is that the T4R input layers expect dense fixed-size inputs and the dataloader bridges the gap by applying padding. I think that might change in the semi-near-future though, since it seems like we've settled on supporting both fixed size and ragged inputs everywhere in Merlin.

into cleanup_shapes

…nsformers4Rec into cleanup_shapes

into HEAD

cleanup shapes in base.py

ad0f695

rnyak requested a review from karlhigley March 1, 2023 19:25

rnyak added chore Maintenance for the repository enhancement New feature or request labels Mar 1, 2023

rnyak changed the title ~~cleanup shapes in model.input_schema and output_schema~~ [DRAFT] cleanup shapes in model.input_schema and output_schema Mar 1, 2023

rnyak added this to the Merlin 23.03 milestone Mar 1, 2023

karlhigley reviewed Mar 1, 2023

View reviewed changes

rnyak added 8 commits March 6, 2023 18:16

Merge branch 'main' of https://github.com/NVIDIA-Merlin/Transformers4Rec

11df72d

into cleanup_shapes

Merge branch 'main' into cleanup_shapes

9c1fe8c

set dims properly

9baaba3

Merge branch 'cleanup_shapes' of https://github.com/NVIDIA-Merlin/Tra…

0fb3011

…nsformers4Rec into cleanup_shapes

fix dims

b9e3ec3

Merge branch 'main' into cleanup_shapes

d2cdfe7

Merge branch 'main' of https://github.com/NVIDIA-Merlin/Transformers4Rec

cf36ffe

into HEAD

add dims in model output_schema

98bacb7

rnyak changed the title ~~[DRAFT] cleanup shapes in model.input_schema and output_schema~~ Cleanup shapes in model.input_schema and output_schema Mar 8, 2023

Merge branch 'main' into cleanup_shapes

903fdd8

karlhigley approved these changes Mar 9, 2023

View reviewed changes

oliverholworthy approved these changes Mar 10, 2023

View reviewed changes

Merge branch 'main' into cleanup_shapes

7824381

rnyak merged commit 5047d17 into main Mar 10, 2023

rnyak deleted the cleanup_shapes branch March 10, 2023 19:32

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Cleanup shapes in model.input_schema and output_schema#628

Cleanup shapes in model.input_schema and output_schema#628
rnyak merged 11 commits intomainfrom
cleanup_shapes

rnyak commented Mar 1, 2023 •

edited

Loading

Uh oh!

github-actions bot commented Mar 1, 2023

Uh oh!

karlhigley Mar 1, 2023

Uh oh!

rnyak Mar 2, 2023

Uh oh!

karlhigley Mar 6, 2023

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

rnyak commented Mar 1, 2023 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Mar 1, 2023

Documentation preview

Uh oh!

karlhigley Mar 1, 2023

Choose a reason for hiding this comment

Uh oh!

rnyak Mar 2, 2023

Choose a reason for hiding this comment

Uh oh!

karlhigley Mar 6, 2023

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

rnyak commented Mar 1, 2023 •

edited

Loading