@@ -7,10 +7,13 @@ complexSymbols:
77# identifier regexp
88# Sentence endings.
99. sentence ending (?<=[^\s.])\.(?=[\"'”’)\s]|$)
10+ 。 sentence ending (?<=[^\s.])\.(?=[\"'”’)\s]|$)
1011! sentence ending (?<=[^\s!])\!(?=[\"'”’)\s]|$)
1112? sentence ending (?<=[^\s?])\?(?=[\"'”’)\s]|$)
13+ ؟ sentence ending (?<=[^\s?])\?(?=[\"'”’)\s]|$)
1214# Phrase endings.
1315; phrase ending (?<=[^\s;]);(?=\s|$)
16+ ؛ phrase ending (?<=[^\s;]);(?=\s|$)
1417: phrase ending (?<=[^\s:]):(?=\s|$)
1518# Others
1619decimal point (?<![^\d -])\.(?=\d)
@@ -22,9 +25,12 @@ symbols:
2225
2326# Complex symbols
2427. sentence ending dot all always
28+ 。 sentence ending dot all always
2529! sentence ending bang all always
2630? sentence ending question all always
31+ ؟ sentence ending arabic question mark all always
2732; phrase ending semi most always
33+ ؛ phrase ending arabic semicolon most always
2834: phrase ending colon most always
2935decimal point none always
3036in-word ' tick all norep
@@ -59,6 +65,8 @@ $ dollar all norep
5965) right paren most always
6066* star some
6167, comma all always
68+ 、 ideographic comma all always
69+ ، arabic comma all always
6270- dash most
6371. dot some
6472/ slash some
0 commit comments