Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Can this compression library be used when the input encoding is not UTF8 or the character set is not Unicode? #51

Open
qnxsgwy opened this issue Feb 3, 2023 · 1 comment

Comments

@qnxsgwy
Copy link

qnxsgwy commented Feb 3, 2023

Can this compression library be used when the input encoding is not UTF8(eg. short binary data) or the character set is not Unicode(eg. gb2312 big5 Shift_JIS)?

@siara-cc
Copy link
Owner

siara-cc commented Feb 3, 2023

Presently the code has been written only for UTF-8. It can be implemented for UTF-16 and other encoding methods. It will benefit some of the languages, but there may not be much savings in space for languages such as Chinese, Japanese and Korean as the number of characters are high.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants