-
Notifications
You must be signed in to change notification settings - Fork 2
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Stylometry Workshop Prep #1
Comments
Alternative: (possibly a longer collaborative research project post 25 May)Question: Does Mitford writing prose sound "more like" Jane Austen, or to herself when she writes plays? And/or to Byron when he writes plays? Think about structural characteristics from the markup (markup data) that might be helpful for stylometry. (This is something Patrick's curious to know...) (the ontological categories are more important than the hierarchy) |
methods / parameters:string-length() These quantitative metrics aren't really great distinguishers. Use of function words (= words whose meaning is defined by context) stop words = words that are so common that processing them doesn't help Sometimes Stylometrists filter out everything except stop words, because these show the most distinctiveness |
Plays
Possible questions for stylometry:
This needs just performed versions
Maybe: List of Macready-only variants (based on difference from variants)
What evidence do we see of distinct voices in the play documents?
Training set of files:
Can we get 3 plays by two directors
And an "Unknown" testing set (but really an outsider that we Know, so we know what the right answer)
(Maybe the director on a different author?)
Processing:
Question: Can any of these perform as well as just the plain text for Stylometric analysis?
@juola
The text was updated successfully, but these errors were encountered: