Reading nested field does not work with use_legacy_dataset=False.
This works:
import pyarrow.parquet as pq
t = pq.read_table(
source=*filename*,
columns=['store_key', 'properties.country'],
use_legacy_dataset=True,
).to_pandas()
This does not work (for the same parquet file):
import pyarrow.parquet as pq
t = pq.read_table(
source=*filename*,
columns=['store_key', 'properties.country'],
use_legacy_dataset=False,
).to_pandas()
Reporter: Tom Scheffers
Assignee: Miles Granger / @milesgranger
Related issues:
PRs and other links:
Note: This issue was originally created as ARROW-14596. Please see the migration documentation for further details.