Skip to content

KevinColemanInc/awesome-pii

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

4 Commits
 
 
 
 
 
 
 
 

Repository files navigation

awesome-pii

Awesome-PII is a tool collection related to detecting, extracting, and removing PII from data.

Regex

The regexes.yaml schema may change. I may change the schema later for future versions.

Regexes are tested with ruby 2.5.9 on Rubular.

Regexes are downcase sensitive. (meaning downcase text before using them)

Regexes are incomplete. Pull Requests welcome.

Categories

  • Phone
    • US
  • Email
  • Socials
    • Instagram
    • TikTok
    • Snapchat
    • LinkedIn
    • WhatsApp
    • Telegram
    • Twitter
    • Facebook
  • ID card
    • Social Security Number
  • Internet
    • IP Address
    • Domain Name
  • Personal
    • Date of Birth (US)
    • Race
    • Religion

Roadmap

  • Regexes

    • Implement regex groups to extract parts of regex
    • (ongoing) add support for obsfuncation (e.g. "s.c." means snapchat)
    • regexes with PII
    • remove PII from text (with ChatGPT)
  • Images

    • Image PII (with OCR)
    • Image PII removal (with OCR and Stable Diffusion)

[ ] Multi-language support

Special Thanks

Releases

No releases published

Packages

No packages published