What is the problem the feature request solves?
Currently Comet cannot be triggered if Spark users read data from cached RDD. To support this use case, we'll need to add support for Spark's InMemoryRelation.
It looks like we may need to implement Arrow for CachedBatchSerializer.
Describe the potential solution
Add Comet support for InMemoryRelation, so that Spark query starts from cached RDD can also use Comet native execution.
Additional context
It is not a priority as of now, but will be something good to have in future.
What is the problem the feature request solves?
Currently Comet cannot be triggered if Spark users read data from cached RDD. To support this use case, we'll need to add support for Spark's
InMemoryRelation.It looks like we may need to implement Arrow for CachedBatchSerializer.
Describe the potential solution
Add Comet support for
InMemoryRelation, so that Spark query starts from cached RDD can also use Comet native execution.Additional context
It is not a priority as of now, but will be something good to have in future.