Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Arabic text #242

Open
fasihrana opened this issue Dec 19, 2018 · 3 comments
Open

Arabic text #242

fasihrana opened this issue Dec 19, 2018 · 3 comments
Assignees
Labels
A: lib-api Library API A: lib-impl Library Implementation C: joining enhancement Enhancements to existing features S: arabic Script: Arabic

Comments

@fasihrana
Copy link

fasihrana commented Dec 19, 2018

Is there something that allows converting a string of individual Arabic characters to the proper joined form of Arabic text?

For example my name written in individual letters is:
ف ص ی ح
while the joined form is (if you view this in any modern browser):
فصیح

@behnam
Copy link
Member

behnam commented Jan 2, 2019

Thanks for filing this, @fasihrana.

We have two crates in the plans regarding the Unicode Arabic Cursive Joining algorithm:

  • unic-ucd-joining, which provides character properties data used for UACJ,
  • unic-joining (or unic-string-joining), to implement the algorithm.

But, should mention that, the output of such algorithm won't be a new string, but a list of joining forms, per character.

I'm guessing what you need is actually to convert the string (plug the output of UACJ algorithm) to (deprecated) Presentation Form code-points. Is that so?

If yes, could you please elaborate on why you need it, so we can see what are the options (and API design) to provide such functionality in UNIC?

@behnam behnam added A: lib-api Library API A: lib-impl Library Implementation S: arabic Script: Arabic enhancement Enhancements to existing features C: joining labels Jan 2, 2019
@fasihrana
Copy link
Author

I don't know enough about the deprecation of Presentation Code Points. Though typing on a keyboard, I doubt you could get any of the joined forms whch is the issue I'm trying to resolve, to convert a keyboard typed word in a joining alphabet to be presented correctly.

What I'm guessing is that once I have the code points for joined alphabets, the presentation will become easier?

@xclud
Copy link

xclud commented Sep 16, 2023

@fasihrana This package solves the Problem: https://crates.io/crates/bidi

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
A: lib-api Library API A: lib-impl Library Implementation C: joining enhancement Enhancements to existing features S: arabic Script: Arabic
Projects
None yet
Development

No branches or pull requests

3 participants