Text Mining: The Long Walk

Standard

Now that we’ve established that there is a link between key scenes in the plot progress of a Stephen King novel and mapped sentiment peaks coded from the text, we can spend significantly less time on analyzing each peak to show this. This will allow us to go through books with a little less ponderous text.

The Long Walk (1979)

The Long Walk is another short Bachman novel about sexually frustrated young men. This time it’s about the contestants of a gruelling, cruel national sport instituted after America’s loss in the Second World War and the institution of military rule by “The Squads.” The backdrop is briefly described but evocative for that when it is mentioned. At any rate, the protagonist is one of 100 contestants who start the Long Walk. They have to keep walking at a certain speed or they are shot by soldiers who are driving around beside them. They get three warnings to get their speed back up, otherwise the guns ring out and down goes another contestant. It’s a pretty horrifying idea when it comes right down to it, if only for how weirdly plausible it is given the modern love of both spectacle and fascism. It’s also pretty psychologically taxing, especially once the weakest contestants die off and it becomes a game to walk your opponents into the ground.

Continue reading

Advertisements

Text Mining: Salem’s Lot

Standard

 

SALEM’S LOT (1975)

Alright, now that we’ve established there’s some preliminary evidence of a link between emotional sentiment peaks and the plot progress of a Stephen King novel let’s keep going so we can start to see if there are patterns and also to generate a corpus of King material that we can use for topic modeling and other fun supervised/unsupervised machine learning stuff.

So, let’s go to the Lot as it slowly turns into a vampire colony.

Continue reading

Literary Fun With Text Mining

Standard

My wife is doing her PhD in political science on the topic of political interest groups and how they use social media to disseminate information and reach new audiences, and how they utilize this new(ish wow we’re old) medium to effect voting behaviour. Part of this has meant learning how to mine Twitter data and analyze it through the R programming language; in order to provide technical support and to have someone to troubleshoot coding issues, I’ve also been learning to use R to mine and analyze texts. What I’ve been concentrating on, in order to learn the language and the processes, is using it to mine and visualize data gathered from fictional texts, specifically the bibliography of Stephen King. What I want to do is to analyze plot trajectories drawn from sentiment data – quantitative measures of emotional sentiment words based on established dictionaries used for that sort of thing. Research questions on this would include things like: is there a pattern that King has for his plots, based on emotional language cues? Is this pattern, if any, different from other well-known horror writers? Furthermore, are there established “archetypal” emotional plot patterns for horror books, and do these patterns differ when you switch genres – say, to fantasy, military science fiction, paranormal romance, etc. etc. down the fracture lines of human experience.

Continue reading