Stylometry Workshop Prep #1

ebeshero · 2018-05-09T16:08:40Z

Plays

Possible questions for stylometry:

Role of Actor-Managers in altering plays:
This needs just performed versions
Maybe: List of Macready-only variants (based on difference from variants)
What evidence do we see of distinct voices in the play documents?

Training set of files:

Can we get 3 plays by two directors
And an "Unknown" testing set (but really an outsider that we Know, so we know what the right answer)
(Maybe the director on a different author?)

Processing:

entirely plain text of the just the play (no metadata or cast list)
structural markup only (stage directions, acts, scenes, actors, and speeches)
Data pulled from structural markup:

numbers of actors in scenes
director-specific variants we've identified
stage directions only
stage directions NOT in manuscript
Question: Can any of these perform as well as just the plain text for Stylometric analysis?

@juola

ebeshero · 2018-05-09T16:47:48Z

Alternative: (possibly a longer collaborative research project post 25 May)

Question: Does Mitford writing prose sound "more like" Jane Austen, or to herself when she writes plays? And/or to Byron when he writes plays?

Think about structural characteristics from the markup (markup data) that might be helpful for stylometry. (This is something Patrick's curious to know...) (the ontological categories are more important than the hierarchy)

ebeshero · 2018-05-09T16:51:36Z

methods / parameters:

string-length()
number of words per sentence (sentences determined by end-stop punctuation followed by white space)

These quantitative metrics aren't really great distinguishers.

Use of function words (= words whose meaning is defined by context)

stop words = words that are so common that processing them doesn't help

Sometimes Stylometrists filter out everything except stop words, because these show the most distinctiveness

ebeshero added the enhancement New feature or request label May 9, 2018

ebeshero self-assigned this May 9, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stylometry Workshop Prep #1

Stylometry Workshop Prep #1

ebeshero commented May 9, 2018

ebeshero commented May 9, 2018 •

edited

Loading

ebeshero commented May 9, 2018

Stylometry Workshop Prep #1

Stylometry Workshop Prep #1

Comments

ebeshero commented May 9, 2018

Plays

Training set of files:

Processing:

ebeshero commented May 9, 2018 • edited Loading

Alternative: (possibly a longer collaborative research project post 25 May)

ebeshero commented May 9, 2018

methods / parameters:

ebeshero commented May 9, 2018 •

edited

Loading