Skip to content

initialize the read with the OpenFile info and not just the path#17652

Merged
Mytherin merged 1 commit intoduckdb:v1.3-ossivalisfrom
Tmonster:fix_parallel_parquet_read_initialize_with_open_file_info
May 27, 2025
Merged

initialize the read with the OpenFile info and not just the path#17652
Mytherin merged 1 commit intoduckdb:v1.3-ossivalisfrom
Tmonster:fix_parallel_parquet_read_initialize_with_open_file_info

Conversation

@Tmonster
Copy link
Contributor

Fixes https://github.com/duckdblabs/duckdb-internal/issues/4961

Another part of the fix is to fix the HTTPFileCache, but that needs to be done in the httpfs extension.
This uses the OpenFileInfo file to create the new file handle. This way, we also copy all of the extended properties of the OpenFileInfo like validate_external_cache and file_size. This makes it easier to prevent unnecessary extra (1) head requests on large files. But if there are multiple files that are all a bit larger, this these HEAD requests add up

@Mytherin Mytherin merged commit 243eb89 into duckdb:v1.3-ossivalis May 27, 2025
77 of 85 checks passed
@Mytherin
Copy link
Collaborator

Thanks!

krlmlr added a commit to duckdb/duckdb-r that referenced this pull request Jun 2, 2025
initialize the read with the OpenFile info and not just the path (duckdb/duckdb#17652)
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants