Hi!
Is it possible to add option to Exclude encodings?
Or to specify a list of encodings to choose from?
It would provide a solution for some consistent codepage misdetections, for example see these issues:
chardet detect UTF-8 XML File as EUC_KR - Possibility to exclude encodings? #287
Missdetect win1251 as MacCyrillic #132
Clearly there's more similar issues that could be resolved by Excluding encoding/specifying a list of encodings.
This is especially important if a language has a number of similar encodings and/or text is small.
Thank you.