-
Notifications
You must be signed in to change notification settings - Fork 32.4k
Closed
Labels
CompilationIssues related to torchdynamo and torchinductorIssues related to torchdynamo and torchinductorFeature requestRequest for a new featureRequest for a new featureGood Difficult Issue
Description
Feature request
As per #28981, LLaVA is planned to receive torch.compile support. Seeing to the fact that LLaVA is composed of a vision tower and a LLM, both of which can be separately compiled with fullgraph=True (after support has been added, which is not the case for Mistral), it seems much easier to compile both parts separately as well.
Motivation
The _merge_input_ids_with_image_features function that connects the two parts is difficult to compile as PyTorch has yet to add support for many of the functions used that require dynamic input sizes, which are necessary here as the number of input image tokens is subject to change.
Your contribution
I'd love to try submitting a PR if possible but I'm not sure what the best way to do so is given the current circumstances.
Reactions are currently unavailable
Metadata
Metadata
Assignees
Labels
CompilationIssues related to torchdynamo and torchinductorIssues related to torchdynamo and torchinductorFeature requestRequest for a new featureRequest for a new featureGood Difficult Issue