{"id":40,"date":"2009-07-07T15:34:14","date_gmt":"2009-07-07T20:34:14","guid":{"rendered":"http:\/\/winedarksea.com\/?p=40"},"modified":"2025-02-10T17:58:55","modified_gmt":"2025-02-10T22:58:55","slug":"a-genre-map-of-shakespeares-plays-from-the-first-folio-1623","status":"publish","type":"post","link":"https:\/\/winedarksea.org\/?p=40","title":{"rendered":"A Genre Map of Shakespeare&#8217;s Plays from the First Folio (1623)"},"content":{"rendered":"<p><a href=\"http:\/\/winedarksea.com\/wp-content\/uploads\/2009\/07\/PCquadrantsCH.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-51\" title=\"Click for Full Size\" src=\"http:\/\/winedarksea.com\/wp-content\/uploads\/2009\/07\/PCquadrantsCH.jpg\" alt=\"PCquadrantsCH\" width=\"626\" height=\"564\" srcset=\"https:\/\/winedarksea.org\/wp-content\/uploads\/2009\/07\/PCquadrantsCH.jpg 993w, https:\/\/winedarksea.org\/wp-content\/uploads\/2009\/07\/PCquadrantsCH-300x270.jpg 300w\" sizes=\"auto, (max-width: 626px) 100vw, 626px\" \/><\/a>The following image, produced using the \u00a0statistical package R, represents the position of the plays from Shakespeare&#8217;s First Folio according patterns discerned using inferential statistics. The plays themselves were &#8220;tagged&#8221; by Docuscope and then the frequency scores of each play were analyzed with standard unsupervised statistical procedures (prcomp, center=True, scale=False), producing two principal components that define underlying patterns of correlation and opposition in Shakespeare&#8217;s use of certain kinds of words in different genres. \u00a0In this post, the first of several, I want to discuss what can be learned from such a map and how it either confirms or diverges from current critical understandings of Shakespeare&#8217;s genres as understood by Shakespeare&#8217;s contemporaries or later critics (who became interested in the so called &#8220;Late Plays&#8221;).<\/p>\n<p>The first thing to notice is that these two components are doing a reasonable job of separating out at least two types of plays &#8212; Comedies and Histories &#8212; by placing them in opposite corners of the scatterplot. \u00a0The two scatterplots you see here are really the same scatter viewed from two positions, so we will concentrate on the one in the upper right, which rates the plays on Principal Component 1 (PC1) on the horizontal axis and Principal Component 2 (PC2) on the vertical axis. \u00a0In Principal Component Analysis (PCA), the earlier components tend to &#8220;suck up&#8221; more variation than later ones, which means they define progressively less powerful avenues for simplifying relationships among the variable &#8212; here, the types of words Shakespeare uses (or doesn&#8217;t use) in different types of plays. \u00a0So PC1 does an excellent job of pushing the pink circles, the Histories, to the right of the scatter, which means they all use a proportionally higher degree of the types of words that are either favored or shunted by this factor. \u00a0(More anon.) \u00a0Note that the pink dots are plays classed as Histories in the First Folio, except for one green dot which is <em>Henry VIII<\/em> (we&#8217;ve pulled out four &#8220;late plays&#8221; in green: <em>Henry VIII<\/em>, <em>Cymbeline<\/em>, <em>Winter&#8217;s Tale<\/em> and <em>Tempest<\/em>). \u00a0PC2 finished the job, pushing these histories down on the vertical axis and so partitioning them (roughly) in the lower right-hand corner of the scatter. \u00a0Here we would say that the Histories or pink dots have comparatively fewer of the types of words favored by PC2 (and more of the words that it discriminates against). \u00a0Think of components, then, as representing correlations of things that Shakespeare does and doesn&#8217;t do over the course of all his writing published in F: \u00a0later we will see how his choice of one type of word &#8212; for example, first person singular pronouns &#8212; almost always &#8220;goes with&#8221; a lack of other types of words. \u00a0Here the point is to show that a basic pattern emerges with so called unsupervised statistical techniques, the most powerful because they do not use any groupings provided by us, but rather look for latent patterns among the words classed by Docuscope.<\/p>\n<p>Let&#8217;s &#8220;believe&#8221; what we&#8217;re seeing here and assume that these linguistic patterns correspond to something like the genre distinctions that Shakespeare&#8217;s editors, Heminges and Condell, saw when they classed the plays into three genres on the contents page of the First Folio. \u00a0(Hope and I have argued for this assumption\u00a0<a title=\"The Very Large Textual Object\" href=\"http:\/\/extra.shu.ac.uk\/emls\/09-3\/hopewhit.htm\">elsewhere<\/a>.) \u00a0Now there are two ways to proceed at this point. \u00a0We could provide a <a title=\"PC Loadings for Full Plays\" href=\"http:\/\/winedarksea.com\/?attachment_id=60\" target=\"_self\">spreadsheet<\/a>\u00a0detailing the loadings of the variables on each of the components, which would be interesting to you if you already knew Docuscope well and had a decent sense of how PCA works. \u00a0But a more immediately useful thing to do would be to show passages that contain lots of the tagged words that are pulling the plays in these different directions &#8212; the one&#8217;s that &#8220;magnetize&#8221; the plays into groups, so to speak &#8212; so that we can have a sense of what is an &#8220;exemplary&#8221; History passage according to PCA, or an exemplary Comedy passage. \u00a0(Note that the comedies are also being situated in the opposite quadrant &#8212; the upper left &#8212; from the histories: this is a pattern we see again and again at different levels of analysis.) \u00a0So what does a historical passage look like from <em>Richard II<\/em>? \u00a0What, more interestingly, does a &#8220;historical&#8221; passage look like from <em>Love&#8217;s Labour&#8217;s Lost<\/em>, which is &#8220;out of its quadrant&#8221; here? What does a typically comic passage from <em>Twelfth Night<\/em> look like? \u00a0And what kind of passages in Othello are pushing it up into the comic quadrant? \u00a0Do these &#8220;correct&#8221; and &#8220;incorrect&#8221; classings make sense to us from a literary critical standpoint? \u00a0It has been claimed before that <em>Othello<\/em> contain many elements of Shakespeare&#8217;s comedy writing, so this would be a good place to start thinking about the value of statistically assisted linguistic analysis of genre, which I will do in the next post. \u00a0(Jonathan may join in with comments, which will make this more of a back and forth.)<\/p>\n","protected":false},"excerpt":{"rendered":"<p>The following image, produced using the \u00a0statistical package R, represents the position of the plays from Shakespeare&#8217;s First Folio according patterns discerned using inferential statistics. The plays themselves were &#8220;tagged&#8221; by Docuscope and then the frequency scores of each play were analyzed with standard unsupervised statistical procedures (prcomp, center=True, scale=False), producing two principal components that [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[8],"tags":[],"class_list":["post-40","post","type-post","status-publish","format-standard","hentry","category-shakespeare"],"_links":{"self":[{"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/posts\/40","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/winedarksea.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=40"}],"version-history":[{"count":24,"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/posts\/40\/revisions"}],"predecessor-version":[{"id":2821,"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/posts\/40\/revisions\/2821"}],"wp:attachment":[{"href":"https:\/\/winedarksea.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=40"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/winedarksea.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=40"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/winedarksea.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=40"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}