Currently boolean columns are automatically encoded with RLE_DICTIONARY, there is nothing wrong with that as parquet specification is not limiting that in any way but...
- Apache spark is not supporting that
- It brings very little value as booleans can anyway be RLE encoded
We should put a rule in the code that would use the dictionary encoding on any other column type except Booleans.