CountVectorizer for text preprocessing

I understand why hash seems to be a better solution for distributed text preprocessing, but I also need a way to make my features human-readable. It seems like spark has a [CountVectorizer](https://spark.apache.org/docs/latest/ml-features#countvectorizer). Would it be possible to implement one for dask-ml?