Problem:
Users should be able to serve pytorch models that were produced with Transformers4Rec or any other process using a Systems ensemble. This will work towards supporting Session-based models as well as expanding System's support for a new modeling framework.
Goal:
Systems should be able to serve all pytorch models that are currently supported by Triton.
Definition of Done
Have an example that serves PyT session based model in conjunction with a NVT workflow where the session based models scores the whole catalog
Open questions
Constraints:
Not all pytorch models can be served via Triton's pytorch backend, so we will need to be able to use multiple backends in order to serve all Triton-compatible pytorch models.
Starting Point:
Transformers4Rec
Systems
Integration Issues
Nice to have: (P1)
Documentation
Examples
Blockers:
Problem:
Users should be able to serve pytorch models that were produced with Transformers4Rec or any other process using a Systems ensemble. This will work towards supporting Session-based models as well as expanding System's support for a new modeling framework.
Goal:
Systems should be able to serve all pytorch models that are currently supported by Triton.
Definition of Done
Have an example that serves PyT session based model in conjunction with a NVT workflow where the session based models scores the whole catalog
Open questions
Constraints:
Not all pytorch models can be served via Triton's
pytorchbackend, so we will need to be able to use multiple backends in order to serve all Triton-compatible pytorch models.Starting Point:
Transformers4Rec
Systems
pytorchbackend for "torchscriptable" models for optimized performancePredictPytorchoperator systems#153Integration Issues
is_raggedin LocalExecutor_transform_datacore#173 should fix that issue.Nice to have: (P1)
Documentation
Examples
Blockers: