Palavras_PT-BR.db: A sqlite database
Palavras_PT-BR.txt: Text file, one word per line
Current number of words: 1.916.706
To access a preview of the words, visit https://alfredofilho.github.io/Palavras_PT-BR/
db_create.py
: Recreates the database.db_delete.py
: Deletes words from the database.db_insert.py
: Inserts words into the database.db_select.py
: Selects data from the database.AVLTree.py
: Implements an AVL Binary Tree.example_usage_tree.py
: Contains a example of AVL Tree usage.
- As a first base I used the IntelliJ dictionary
- Web scraping to get some popular Brazilian Portuguese verbs
- Spacy to separate all verbs in the infinitive tense
- Web scraping again to conjugate in all verb tenses with the website https://www.conjugacao.com.br/
- Deleted roman numerals
- Some suspicious words that seemed wrong were deleted
- I added a few manually that I saw were missing