ARROW-8244: [Python] Fix parquet.write_to_dataset to set file path in metadata_collector#6797
ARROW-8244: [Python] Fix parquet.write_to_dataset to set file path in metadata_collector#6797jorisvandenbossche wants to merge 2 commits intoapache:masterfrom
Conversation
Thank you for working on this @jorisvandenbossche! Yes, these changes are exactly what I had in mind. I don't think there is any danger in setting the file_path here, because the returned metadata will never be included in the file where the data is stored (the only case where the value of |
2f446b9 to
1eb7ef9
Compare
|
@rjzamora Thanks for the feedback! I agree that just setting the file path is probably the only sensible behaviour, so we can simply change that. I added a test for the non-partitioned case as well. |
This explores a potential fix for ARROW-8244, it seems rather straightforward to set the file path in
write_to_dataset(write_tabledoes not do this, because there the user passes a full path, so no relative path is known).cc @rjzamora does this look the correct logic?