[Data] include_paths=True does not add "path" column key to the (lazy) schema and fails in groupby("path")

### What happened + What you expected to happen

* `includes_paths=True` does not add `paths` to the schema
* it would fail some checks in the `groupby`

For example, `ds = ray.data.read_parquet("data/", include_paths=True)`, gives
```python
In [24]: ds
Out[24]: Dataset(num_rows=?, schema={id: int64, data: double, uuid: string})
```
without the expected path column.

Then if we want to do
```python
ds.groupby("path").count().take_all()
```
It fails in `SortKey.validate_schema(self, schema)`:
```python
     81 for column in self._columns:
     82     if column not in schema_names_set:
---> 83         raise ValueError(
     84             f"You specified the column '{column}', but there's no such "
     85             "column in the dataset. The dataset has columns: "
     86             f"{schema.names}"
     87         )

ValueError: You specified the column 'path', but there's no such column in the dataset. The dataset has columns: ['id', 'data', 'uuid']
```

For debugging purpose, it would work if:
* disable that line of check
* or use materialize()


### Versions / Dependencies

master

### Reproduction script

Any dataset:
```python
ds = ray.data.read_parquet("data/", include_paths=True)
ds.groupby("path").count()
```

### Issue Severity

Medium: It is a significant difficulty but I can work around it.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Data] include_paths=True does not add "path" column key to the (lazy) schema and fails in groupby("path") #60027

What happened + What you expected to happen

Versions / Dependencies

Reproduction script

Issue Severity

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Data] include_paths=True does not add "path" column key to the (lazy) schema and fails in groupby("path") #60027

Description

What happened + What you expected to happen

Versions / Dependencies

Reproduction script

Issue Severity

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions