Replies: 8 comments 7 replies
-
|
Beta Was this translation helpful? Give feedback.
-
BTW, you can experiment with Unicode by yourself, without even recompiling the emulator - go to
|
Beta Was this translation helpful? Give feedback.
-
There is no automatic way to do this. Files contains comments and explanations. For the list of Unicode blocks and characters, I like the page https://www.compart.com/en/unicode/block - just note, that we don't want to support everything, DOS cannot handle hieroglyphs :) In |
Beta Was this translation helpful? Give feedback.
-
Example: CYRILLIC CAPITAL LETTER I WITH GRAVE - https://www.compart.com/en/unicode/U+040D
|
Beta Was this translation helpful? Give feedback.
-
In MAIN.TXT you can declare as many fallbacks (aliases) as needed. |
Beta Was this translation helpful? Give feedback.
-
OK, should I make a PR with the following? DECOMPOSITION.TXT
CAPITAL_SMALL.TXT ASCII.TXT
MAIN.TXT
In addition to previous discussion - CYRILLIC LETTER SHORT I is also decomposable and I added that here. Below is the romanization for source codepage 3021 - differences from default (Russian?) ASCII.TXT - to be added after 0.81/when language/country can be detected? (currently I tested it by changing the TXT files)
|
Beta Was this translation helpful? Give feedback.
-
While testing the above I found out it seems DOSbox ECHO doesn't support caret sign to print special characters like pipe and others: |
Beta Was this translation helpful? Give feedback.
-
Done in #2621 |
Beta Was this translation helpful? Give feedback.
-
Following #2548 by @FeralChild64 and looking forward to #1864
A few characters can get better analogues
Host Unicode -> DOS
CYRILLIC CAPITAL LETTER I WITH GRAVE Ѝ -> Й CYRILLIC CAPITAL LETTER SHORT I (currently it gives "?")
CYRILLIC SMALL LETTER I WITH GRAVE ѝ -> й CYRILLIC SMALL LETTER SHORT I (currently it gives "?")
CYRILLIC CAPITAL LETTER BYELORUSSIAN-UKRAINIAN I (Windows keyboard layout output for Roman number 1
Shift -
=І
) -> LATIN CAPITAL LETTER I (instead of the current DOSbox output ofИ
CYRILLIC CAPITAL LETTER I)CYRILLIC CAPITAL LETTER I WITH GRAVE Ѝ -> ì LATIN SMALL LETTER I WITH GRAVE
CYRILLIC SMALL LETTER I WITH GRAVE ѝ -> ì LATIN SMALL LETTER I WITH GRAVE
EURO € -> ε/∈ per Wikipedia "It was later unofficially used as the euro sign [U+20AC, €]"
#
(except when using the most limited set of "Printable ASCII characters"). Currently with CP437 it gives#
?Transliteration - as explained at Informal romanizations of Cyrillic, transliteration depends not only on the source codepage/symbols, but also on the language - current one (e.g. Cyrillic letters representation via CP437) fits some languages, but is unusable for others. Would be nice to have check the source "locale"/region/language and adjust the mapping accordingly.
Beta Was this translation helpful? Give feedback.
All reactions