Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Feature request: aggregate of non-ASCII/system file/directory characters #100

Open
kieranjol opened this issue Sep 21, 2022 · 1 comment
Labels
enhancement a feature by any other name In-progress this is actively being worked on

Comments

@kieranjol
Copy link
Contributor

I'm enjoying digging into the lists of non-ASCII and troublesome characters. I think it could be useful to show an aggregate of the characters that appear in a report, and how often they appear. For example:

 characters outside of ASCII range: '0xc9, LATIN CAPITAL LETTER E WITH ACUTE: É' (248)
 characters outside of ASCII range: '0xf028, None: ' (4)
 non-recommended character: '0x5b, LEFT SQUARE BRACKET: [' (1474)

It could be useful so that during appraisal, if I know that latin letters with acutes are supported within the repository, but perhaps bullets or other characters are not, then it would speed up the process of identifying problematic characters.

@ross-spencer
Copy link
Member

Sorry it took a while to get back to this. It's a good suggestion. I need some sample data so finally got round to creating a proper repo for some of my other test work here: https://github.com/ross-spencer/big-list-of-naughty-files which generates a lot of output that will appear in these kinds of aggregates. Unfortunately it breaks a few more things so I'll try and fix those first then add some more sample data to this issue to create the aggregates.

@ross-spencer ross-spencer added enhancement a feature by any other name In-progress this is actively being worked on labels Sep 10, 2023
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement a feature by any other name In-progress this is actively being worked on
Projects
None yet
Development

No branches or pull requests

2 participants