Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Parser handling #33

Open
EPICmynamesBG opened this issue Jan 9, 2018 · 1 comment
Open

Parser handling #33

EPICmynamesBG opened this issue Jan 9, 2018 · 1 comment
Assignees

Comments

@EPICmynamesBG
Copy link
Owner

Rows like

day soup
2018-01-09 Tuesday, 1/30
2018-01-31 Title
2018-02-02 Title

Are getting into the database. Update the parser and/or add another column where entries that will be repeatedly imported can be marked as incorrect, and the view can ignore those.

@EPICmynamesBG EPICmynamesBG self-assigned this Jan 9, 2018
@EPICmynamesBG
Copy link
Owner Author

A good time to learn some machine learning for classifying "is soup", "is not soup". Here are some helpful links.

http://scikit-learn.org/stable/datasets/twenty_newsgroups.html
https://towardsdatascience.com/machine-learning-nlp-text-classification-using-scikit-learn-python-and-nltk-c52b92a7c73a

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant