Skip to content

Update IndicNormalizer.java#14657

Closed
Trey314159 wants to merge 2 commits intoapache:mainfrom
Trey314159:patch-1
Closed

Update IndicNormalizer.java#14657
Trey314159 wants to merge 2 commits intoapache:mainfrom
Trey314159:patch-1

Conversation

@Trey314159
Copy link
Contributor

Remove Telugu normalization of vu వు to ma మ from IndicNormalizer.

Description

Telugu vu వు and ma మ are visually similar—akin to English "rn" and "m"—but they should not be conflated. Names like వెంకటరామ (Venkatarama) and వెంకటరావు (Venkatarao) and words like మండే and వుండే (links to Telugu Wiktionary) are distinct.

It's like conflating burn/bum and corn/com. It could happen when reading quickly or with poor handwriting, but it is not something that should happen for search indexing.

Telugu vu వు and ma మ are visually similar—akin to English "rn" and "m"—but should not be conflated. Names like వెంకటరామ (Venkatarama) and వెంకటరావు (Venkatarao) are distinct.
@github-actions
Copy link
Contributor

This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the label to it and you will stop receiving this reminder on future updates to the PR.

@github-actions
Copy link
Contributor

This PR does not have an entry in lucene/CHANGES.txt. Consider adding one. If the PR doesn't need a changelog entry, then add the skip-changelog-check label to it and you will stop receiving this reminder on future updates to the PR.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant