{"id":1596,"date":"2012-08-17T06:28:46","date_gmt":"2012-08-17T11:28:46","guid":{"rendered":"http:\/\/winedarksea.org\/?p=1596"},"modified":"2025-02-10T17:41:43","modified_gmt":"2025-02-10T22:41:43","slug":"what-happens-in-hamlet","status":"publish","type":"post","link":"https:\/\/winedarksea.org\/?p=1596","title":{"rendered":"What happens in Hamlet?"},"content":{"rendered":"<p>We perform digital analysis on literary texts not to answer questions, but to generate questions. The questions digital analysis <em>can<\/em> answer are generally not &#8216;interesting&#8217; in a humanist sense: but the questions digital analysis <em>provokes<\/em> often are. And these questions have to be answered by &#8216;traditional&#8217; literary methods. Here&#8217;s an example.<\/p>\n<p><a href=\"http:\/\/www.kcl.ac.uk\/artshums\/depts\/english\/people\/staff\/associates\/visit\/fells\/karimcooper.aspx\">Dr Farah Karim-Cooper<\/a>, head of <a href=\"http:\/\/www.shakespearesglobe.com\/education\/library-research\">research<\/a> at Shakespeare&#8217;s Globe\u00a0just asked on Twitter if I had any suggestions for a lecture on <em>Hamlet<\/em> she was due to give. Ten minutes later I had some &#8216;interesting&#8217; questions for her.<\/p>\n<p>I began with <a href=\"http:\/\/wordhoard.northwestern.edu\/userman\/index.html\">Wordhoard<\/a>&#8216;s\u00a0<a href=\"http:\/\/wordhoard.northwestern.edu\/userman\/analysis-comparewords.html#loglike\">log-likelihood<\/a> function, comparing <em>Hamlet<\/em> to the rest of Shakespeare&#8217;s plays. You can view the results of this as a tag cloud:<\/p>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_1598\" aria-describedby=\"caption-attachment-1598\" style=\"width: 422px\" class=\"wp-caption alignleft\"><a href=\"http:\/\/winedarksea.org\/?attachment_id=1598\" rel=\"attachment wp-att-1598\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-1598 \" title=\"Hamlet tag cloud\" src=\"http:\/\/winedarksea.org\/wp-content\/uploads\/2012\/08\/Hamlet-tag-cloud.jpg\" alt=\"a tag cloud: looks good, immediate, doesn't tell you much\" width=\"422\" height=\"246\" srcset=\"https:\/\/winedarksea.org\/wp-content\/uploads\/2012\/08\/Hamlet-tag-cloud.jpg 1173w, https:\/\/winedarksea.org\/wp-content\/uploads\/2012\/08\/Hamlet-tag-cloud-300x174.jpg 300w, https:\/\/winedarksea.org\/wp-content\/uploads\/2012\/08\/Hamlet-tag-cloud-1024x597.jpg 1024w\" sizes=\"auto, (max-width: 422px) 100vw, 422px\" \/><\/a><figcaption id=\"caption-attachment-1598\" class=\"wp-caption-text\">Tag cloud for Hamlet vs the rest of Shakespeare: black words are raised in frequency; grey words lowered; size indicates strength of effect<\/figcaption><\/figure>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>which is nice, but for real text analytics you need to read the spreadsheet of figures. Word-frequency analysis is limited in many ways, but it can surprise you if you look in the right places and at the right things.<\/p>\n<p><a href=\"http:\/\/winedarksea.org\/?attachment_id=1605\" rel=\"attachment wp-att-1605\"><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-full wp-image-1605\" title=\"Hamlet LLresults\" src=\"http:\/\/winedarksea.org\/wp-content\/uploads\/2012\/08\/Hamlet-LLresults.jpg\" alt=\"not nice to look at, but much more information\" width=\"1173\" height=\"684\" srcset=\"https:\/\/winedarksea.org\/wp-content\/uploads\/2012\/08\/Hamlet-LLresults.jpg 1173w, https:\/\/winedarksea.org\/wp-content\/uploads\/2012\/08\/Hamlet-LLresults-300x174.jpg 300w, https:\/\/winedarksea.org\/wp-content\/uploads\/2012\/08\/Hamlet-LLresults-1024x597.jpg 1024w\" sizes=\"auto, (max-width: 1173px) 100vw, 1173px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<p>When I run log-likelihood, I always look first for the items that are <em>lower<\/em> than expected, rather than those that are raised (which tend to be content words associated with the topic of the text, and thus fairly obvious). I also tend to look at <em>function words<\/em> (pronouns, articles, auxiliary verbs) rather than nouns or adjectives.<\/p>\n<p>If you look for absences of high-frequency items, you are using digital text analysis to do the things it does best compared to human reading: picking up absence, and analysing high-frequency items. Humans are good at spotting the presence of low frequency items, items that disrupt a pattern (outliers, in statistical terms) &#8211; but we are not good at noticing things that are not there (dogs that don&#8217;t bark in the night) and we are not good at seeing woods (we see trees, especially unusual trees).<\/p>\n<p>The <em>Hamlet<\/em> results were pretty outstanding in this respect: very high up the list, with 3 stars, indicating very strong statistical significance, is a minus result for the pronoun &#8216;I&#8217;. A check across the figures shows that &#8216;I&#8217; occurs in <em>Hamlet<\/em> about 184 times every 10,000 words (see the column headed &#8216;Analysis parts per 10,000&#8217; &#8211; <em>Hamlet<\/em> is the &#8216;analysis text&#8217; here), whereas in the rest of Shakespeare it occurs about 228 times every 10,000 words (see the column headed &#8216;Reference parts per 10,000) &#8211; the reference corpus is the rest of Shakespeare) &#8211; so every 10,000 words in <em>Hamlet<\/em> have about 40 fewer &#8216;I&#8217; pronouns than we&#8217;d expect.<\/p>\n<p><a href=\"http:\/\/winedarksea.org\/?attachment_id=1610\" rel=\"attachment wp-att-1610\"><img loading=\"lazy\" decoding=\"async\" class=\"alignleft size-full wp-image-1610\" title=\"Hamlet I result\" src=\"http:\/\/winedarksea.org\/wp-content\/uploads\/2012\/08\/Hamlet-I-result.jpg\" alt=\"\" width=\"918\" height=\"100\" srcset=\"https:\/\/winedarksea.org\/wp-content\/uploads\/2012\/08\/Hamlet-I-result.jpg 918w, https:\/\/winedarksea.org\/wp-content\/uploads\/2012\/08\/Hamlet-I-result-300x32.jpg 300w\" sizes=\"auto, (max-width: 918px) 100vw, 918px\" \/><\/a><\/p>\n<p>&nbsp;<\/p>\n<p>Or, to put it another way, Shakespeare normally uses &#8216;I&#8217; 228 times every 10,000 words. <em>Hamlet<\/em> is about 30,000 words long, so we&#8217;d expect, all other things being equal, that Shakespeare would use &#8216;I&#8217; 684 times. In fact, he uses it just 546 times &#8211; and Wordhoard checks the figures to see if we could expect this drop due to chance or normal variation. The three stars next to the log likelihood score for &#8216;I&#8217; tell us that this figure is very unlikely to be due to chance &#8211; something is causing the drop.<\/p>\n<p>Digital analysis can&#8217;t explain the cause of the drop: the only question it is answering here is, &#8216;How frequently does Shakespeare use &#8220;I&#8221; in <em>Hamlet<\/em> compared to his other plays?&#8217;. On its own, this is not a very interesting question. But the analysis provokes the much more interesting question, &#8216;<em>Why<\/em> does Shakespeare use &#8220;I&#8221; far less frequently in <em>Hamlet<\/em> than normal?&#8217;.<\/p>\n<p>Given literary-critical claims that <em>Hamlet<\/em> marks <a href=\"http:\/\/bostonreview.net\/BR24.1\/atwan.html\">the birth of the modern consciousness<\/a>, it is surprising to find a drop in the frequency of first-person forms. But for an <em>explanation<\/em> of why this might happen, you&#8217;ll have to attend Dr Karim-Cooper&#8217;s lecture, ask on Twitter: <a href=\"https:\/\/twitter.com\/drfarahkc\">@DrFarahKC<\/a> &#8211; or <a href=\"http:\/\/internetshakespeare.uvic.ca\/Library\/plays\/Ham.html\">go back to the play yourself<\/a>.<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n<p>&nbsp;<\/p>\n","protected":false},"excerpt":{"rendered":"<p>We perform digital analysis on literary texts not to answer questions, but to generate questions. The questions digital analysis can answer are generally not &#8216;interesting&#8217; in a humanist sense: but the questions digital analysis provokes often are. And these questions have to be answered by &#8216;traditional&#8217; literary methods. Here&#8217;s an example. Dr Farah Karim-Cooper, head [&hellip;]<\/p>\n","protected":false},"author":4,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[51,8],"tags":[134,135,109,136,110],"class_list":["post-1596","post","type-post","status-publish","format-standard","hentry","category-early-modern-drama","category-shakespeare","tag-farah-karim-cooper","tag-hamlet","tag-log-likelihood","tag-pronoun-i","tag-wordhoard"],"_links":{"self":[{"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/posts\/1596","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/users\/4"}],"replies":[{"embeddable":true,"href":"https:\/\/winedarksea.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=1596"}],"version-history":[{"count":29,"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/posts\/1596\/revisions"}],"predecessor-version":[{"id":1628,"href":"https:\/\/winedarksea.org\/index.php?rest_route=\/wp\/v2\/posts\/1596\/revisions\/1628"}],"wp:attachment":[{"href":"https:\/\/winedarksea.org\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=1596"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/winedarksea.org\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=1596"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/winedarksea.org\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=1596"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}