-
Notifications
You must be signed in to change notification settings - Fork 5.1k
Description
Hi,
I create very long UTF-8 encoded text files with Python (anywhere from hundreds of thousands to millions of rows) and when I open the files in Notepad ++, it assigns the WIN EOL. When I change the EOL to Unix, Notepad ++ compromises some of the UTF-8 characters, inserting black-filled squares with white in the beginning and middle of my words sporadically throughout the file. This has happened to me with Russian (Cyrillic), Croatian (Latin with special characters) and Slovene (Latin with special characters).
The same error has happened to me sometimes when editing the file in Notepad++ and searching and replacing throughout the file. The time/actions that cause the characters to be inserted in that scenario seem more random.
Because of this, I consider Notepad++ unreliable for editing UTF-8 encoded files. But I would love to be able to use it as I love the functionalities in other situations.
Please let me know if you have any questions. Thanks in advance.