[data] Remove dead FastFileMetadataProvider code#59027
[data] Remove dead FastFileMetadataProvider code#59027bveeramani merged 3 commits intoray-project:masterfrom
Conversation
There was a problem hiding this comment.
Code Review
This pull request does a great job of removing the dead code for FastFileMetadataProvider. The changes are clean and cover all necessary areas including implementation, documentation, and tests. I have one suggestion to improve a test that becomes a bit confusing after the code removal.
|
Thanks for your contribution! |
|
Hey @rushikeshadhav, just checking in on this when you get a chance |
d7013d6 to
079fdc7
Compare
There was a problem hiding this comment.
Bug: Test execution code removed instead of replaced with alternative (Bugbot Rules)
The test_read_binary_meta_provider test function has its main positive test case removed entirely. The code that reads binary files with FastFileMetadataProvider and asserts the result was deleted, leaving only the setup code and the negative pytest.raises test. Per reviewer comment, this functionality needs to be preserved using DefaultFileMetadataProvider() instead of simply deleting the test execution.
python/ray/data/tests/test_binary.py#L88-L89
ray/python/ray/data/tests/test_binary.py
Lines 88 to 89 in 079fdc7
|
@owenowenisme , I have made the required changes, please check |
Thanks for the heads up, I have resolved the comments |
|
This pull request has been automatically marked as stale because it has not had You can always ask for help on our discussion forum or Ray's public slack channel. If you'd like to keep this open, just leave any comment, and the stale label will be removed. |
After removing the deprecated read_parquet_bulk API, FastFileMetadataProvider became dead code with no remaining usage in the codebase. This commit removes: - FastFileMetadataProvider class implementation - All imports and exports of FastFileMetadataProvider - Tests that specifically tested FastFileMetadataProvider - Documentation references to FastFileMetadataProvider - Code comments mentioning FastFileMetadataProvider The class was previously used as a faster alternative to DefaultFileMetadataProvider that skipped directory expansion and file size collection, but it's no longer needed after the bulk API removal. Signed-off-by: rushikesh.adhav <adhavrushikesh6@gmail.com>
Signed-off-by: Rushikesh Adhav <adhavrushikesh6@gmail.com>
Signed-off-by: Rushikesh Adhav <adhavrushikesh6@gmail.com>
6f79618 to
63e2a8f
Compare
## Description After removing the deprecated `read_parquet_bulk` API, `FastFileMetadataProvider` became dead code with no remaining usage in the codebase. This commit removes: - FastFileMetadataProvider class implementation - All imports and exports of FastFileMetadataProvider - Tests that specifically tested FastFileMetadataProvider - Documentation references to FastFileMetadataProvider - Code comments mentioning FastFileMetadataProvider ## Related issues > Fixes ray-project#59010 --------- Signed-off-by: rushikesh.adhav <adhavrushikesh6@gmail.com> Signed-off-by: Rushikesh Adhav <adhavrushikesh6@gmail.com> Signed-off-by: jasonwrwang <jasonwrwang@tencent.com>
…es (#60084) (#60091) Categorize APIs into Public APIs and Developer APIs, and sort them alphabetically by service name. Changes: - Reorganized loading_data.rst and saving_data.rst with Public APIs first, then Developer APIs - Sorted all APIs alphabetically by service name within each section - Sections that originally had APIs for both Public and Developer APIs were divided to respective sections - Removed datasource.FastFileMetadataProvider API that has been removed ([reference](#59027)) Fixes #60084 Signed-off-by: mgchoi239 <mg.choi.239@gmail.com> --------- Signed-off-by: mgchoi239 <mg.choi.239@gmail.com> Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Co-authored-by: mgchoi239 <mg.choi.239@gmail.com> Co-authored-by: Balaji Veeramani <bveeramani@berkeley.edu>
…es (ray-project#60084) (ray-project#60091) Categorize APIs into Public APIs and Developer APIs, and sort them alphabetically by service name. Changes: - Reorganized loading_data.rst and saving_data.rst with Public APIs first, then Developer APIs - Sorted all APIs alphabetically by service name within each section - Sections that originally had APIs for both Public and Developer APIs were divided to respective sections - Removed datasource.FastFileMetadataProvider API that has been removed ([reference](ray-project#59027)) Fixes ray-project#60084 Signed-off-by: mgchoi239 <mg.choi.239@gmail.com> --------- Signed-off-by: mgchoi239 <mg.choi.239@gmail.com> Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Co-authored-by: mgchoi239 <mg.choi.239@gmail.com> Co-authored-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: jinbum-kim <jinbum9958@gmail.com>
## Description After removing the deprecated `read_parquet_bulk` API, `FastFileMetadataProvider` became dead code with no remaining usage in the codebase. This commit removes: - FastFileMetadataProvider class implementation - All imports and exports of FastFileMetadataProvider - Tests that specifically tested FastFileMetadataProvider - Documentation references to FastFileMetadataProvider - Code comments mentioning FastFileMetadataProvider ## Related issues > Fixes ray-project#59010 --------- Signed-off-by: rushikesh.adhav <adhavrushikesh6@gmail.com> Signed-off-by: Rushikesh Adhav <adhavrushikesh6@gmail.com>
…es (ray-project#60084) (ray-project#60091) Categorize APIs into Public APIs and Developer APIs, and sort them alphabetically by service name. Changes: - Reorganized loading_data.rst and saving_data.rst with Public APIs first, then Developer APIs - Sorted all APIs alphabetically by service name within each section - Sections that originally had APIs for both Public and Developer APIs were divided to respective sections - Removed datasource.FastFileMetadataProvider API that has been removed ([reference](ray-project#59027)) Fixes ray-project#60084 Signed-off-by: mgchoi239 <mg.choi.239@gmail.com> --------- Signed-off-by: mgchoi239 <mg.choi.239@gmail.com> Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Co-authored-by: mgchoi239 <mg.choi.239@gmail.com> Co-authored-by: Balaji Veeramani <bveeramani@berkeley.edu>
## Description After removing the deprecated `read_parquet_bulk` API, `FastFileMetadataProvider` became dead code with no remaining usage in the codebase. This commit removes: - FastFileMetadataProvider class implementation - All imports and exports of FastFileMetadataProvider - Tests that specifically tested FastFileMetadataProvider - Documentation references to FastFileMetadataProvider - Code comments mentioning FastFileMetadataProvider ## Related issues > Fixes ray-project#59010 --------- Signed-off-by: rushikesh.adhav <adhavrushikesh6@gmail.com> Signed-off-by: Rushikesh Adhav <adhavrushikesh6@gmail.com> Signed-off-by: peterxcli <peterxcli@gmail.com>
…es (ray-project#60084) (ray-project#60091) Categorize APIs into Public APIs and Developer APIs, and sort them alphabetically by service name. Changes: - Reorganized loading_data.rst and saving_data.rst with Public APIs first, then Developer APIs - Sorted all APIs alphabetically by service name within each section - Sections that originally had APIs for both Public and Developer APIs were divided to respective sections - Removed datasource.FastFileMetadataProvider API that has been removed ([reference](ray-project#59027)) Fixes ray-project#60084 Signed-off-by: mgchoi239 <mg.choi.239@gmail.com> --------- Signed-off-by: mgchoi239 <mg.choi.239@gmail.com> Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Co-authored-by: mgchoi239 <mg.choi.239@gmail.com> Co-authored-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: peterxcli <peterxcli@gmail.com>
…es (ray-project#60084) (ray-project#60091) Categorize APIs into Public APIs and Developer APIs, and sort them alphabetically by service name. Changes: - Reorganized loading_data.rst and saving_data.rst with Public APIs first, then Developer APIs - Sorted all APIs alphabetically by service name within each section - Sections that originally had APIs for both Public and Developer APIs were divided to respective sections - Removed datasource.FastFileMetadataProvider API that has been removed ([reference](ray-project#59027)) Fixes ray-project#60084 Signed-off-by: mgchoi239 <mg.choi.239@gmail.com> --------- Signed-off-by: mgchoi239 <mg.choi.239@gmail.com> Signed-off-by: Balaji Veeramani <bveeramani@berkeley.edu> Co-authored-by: mgchoi239 <mg.choi.239@gmail.com> Co-authored-by: Balaji Veeramani <bveeramani@berkeley.edu> Signed-off-by: peterxcli <peterxcli@gmail.com>
Description
After removing the deprecated
read_parquet_bulkAPI,FastFileMetadataProviderbecame dead code with no remaining usage in the codebase.This commit removes:
Related issues