Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Option to avoid transliterating punctuation marks as regular letters #66

Open
eyaler opened this issue Jul 22, 2021 · 1 comment
Open

Comments

@eyaler
Copy link

eyaler commented Jul 22, 2021

Always transliterating punctuation marks as regular letters could be an issue for some applications. While the paragraph sign ¶ is transliterated to P, I would like to have an option to treat it as unknown.

(I started this issue following 81f938d and regarding the exotic inverted nun ׆‎ that was changed to be transliterated into n as the regular nun נ. but the inverted one is an editorial/punctuation mark)

@eyaler eyaler changed the title do not transliterate inverted nun as regular nun Option to avoid transliterating punctuation marks as regular letters Jul 25, 2021
@avian2
Copy link
Owner

avian2 commented Aug 2, 2021

Thank you for the suggestion, but I am not going to implement this in Unidecode. I want to keep Unidecode a simple function with no configuration. The reason is similar to why I don't want to have language configuration in this library. I don't have time or knowledge to maintain the additional complexity. There are other transliteration libraries (unihandecode, for example) that are more configurable and might accept of your proposal.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants