Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Language Display Names data needs to be recreated #201

Open
echeran opened this issue Mar 19, 2024 · 4 comments
Open

Language Display Names data needs to be recreated #201

echeran opened this issue Mar 19, 2024 · 4 comments

Comments

@echeran
Copy link
Collaborator

echeran commented Mar 19, 2024

We need to recreate language display name data so that we can have a source of test input data which:

  • is based on open source ICU
  • provides the ICU version it is based on
  • provides the values for the various options for display name formatting used
@saumyabharti294
Copy link

Hello @echeran
I picked up the ICU For python testing and i want to know that json file for the same needs to be converted to python? If yes then are the steps below is what i need to follow?

1.Load JSON Data: Parse the JSON data for each test case.

2.Access Test Scenario: Retrieve the test scenario from the parsed data.

3.Access Verifications: Iterate through the verifications for the test case.

4.Perform Actions: Based on the test scenario and verifications, perform any necessary actions such as executing tests, verifying results, and generating reports.

5.Repeat for Each Test Case: Repeat the process for each test case in your dataset.

Also after this we have to run the code on some specific platform or we can run on local python environment?

@sffc sffc added this to the Priority ⟨P2⟩ milestone Apr 29, 2024
@sven-oly
Copy link
Collaborator

sven-oly commented Aug 7, 2024

Based on discussion today, here are notes on what we plan to do:

  1. Update this to "locale display names" because the CLDR data includes more information than just language names, e.g., the names of digits in languages.
  2. The newly generated CLDR localeDisplayName.txt data includes a subset of test locales for each locale, not all locales names in all locales. This substantially reduces the size of the testing needed while still covering interesting test cases.
  3. The data now includes "regular" names for regions vs. "dialect" names. These should be added to the JSON data as a parameter and to each executor as a testing option.
  4. The format of localeDisplayName.txt requires new code to parse for creating the JSON test cases.
  5. Executors will need to be updated to handle the "dialect" option in test data.
  6. The code for generating localeDisplayName.txt is being added to CLDR 76. To use locale display names for previous ICU/CLDR versions, we need to run this test code with previous CLDR versions, e.g., 45, 44, etc.

@sven-oly
Copy link
Collaborator

sven-oly commented Aug 7, 2024

One more thing: to actually incorporate the localeDisplayName.txt data into DDT, we'll copy the file from the CLDR github repository into the testgen/icuXX directory. Perhaps this data could be automatically fetched from github rather than being manually added to the conformance repository.

@sffc
Copy link
Member

sffc commented Aug 7, 2024

All sounds good.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants