{"id":7428,"date":"2023-07-11T10:44:07","date_gmt":"2023-07-11T15:44:07","guid":{"rendered":"https:\/\/blog.lib.uiowa.edu\/studio\/?p=7428"},"modified":"2023-07-11T10:44:07","modified_gmt":"2023-07-11T15:44:07","slug":"data-mining-for-medieval-messengers","status":"publish","type":"post","link":"https:\/\/blog.lib.uiowa.edu\/studio\/2023\/07\/11\/data-mining-for-medieval-messengers\/","title":{"rendered":"[Data] Mining for Medieval Messengers"},"content":{"rendered":"\r\n<p>Prior to Samuel Morse\u2019s invention of the telegraph in the first half of the nineteenth century, communication technology was chiefly limited to oral or textual messages delivered by a messenger. British sci-fi savant Arthur C. Clarke expands upon this fact, stating that \u201cWhen Queen Victoria came to the throne in 1837, she had no swifter means of sending messages to the far parts of her empire than had Julius Caesar\u2014or, for that matter, Moses.\u201d My dissertation, \u201cMessengers and Messages in Middle English Literature,\u201d examines the under-explored role of messengers in fourteenth-century English romances, where they often prove to be crucial elements of the plot or interesting stand-ins for an authorial or narrative function. <br \/><br \/>While each chapter of my dissertation focuses upon an analytical close reading of specific medieval text or texts, such as The Canterbury Tales, The Death of Arthur, and Richard the Lionheart, I realized early in the project that I would also need a broader perspective on how medieval authors utilize messengers and messages throughout the corpus of the Middle English literary canon. To that end, my work this summer will be to perform what scholar Franco Morreti has dubbed \u201cdistant reading\u201d to refer to the process of \u201cunderstanding literature not by studying particular texts, but by aggregating and analyzing massive amounts of data.\u201d Because I am, unfortunately, not able to read a corpus of 300 medieval texts over the summer, I will be using Python coding scripts to \u201cread\u201d the texts for me and to extract data on keywords pertaining to messengers which I will then be able to interpret and incorporate into my more traditional dissertation work. <br \/><br \/>This is a particularly challenging undertaking given the lack of any spelling standard in Middle English, which makes the number of possible search terms for any keyword positively daunting. For instance, according to the Middle English Dictionary, in Middle English the word \u201cmessenger\u201d is most often written as \u201cmess\u0101\u0306\u01e7\u0113\u0306r\u201d, but also appears as messagere, messagier,missanger, mansonger, and at least 30 other derivations. This is further complicated by the myriad synonyms for the word messenger in Middle English\u2014each with their own spelling eccentricities. Navigating through this linguistic labyrinth will, I hope, eventually result in a chapter of the dissertation displaying the value of a Digital Humanities approach to Middle English literature, complete with data visualizations and discussion of process, while also allowing me to support my own literary analysis with the data I\u2019ve collected from the textual analysis project. <br \/><br \/>This may all seem a bit \u201chigh-tech\u201d and futuristic, but much of the work is decidedly unexciting. With the help of my Studio contact, Nikki White, I\u2019ve acquired a corpus of 300 Middle English text files. These files are in XML format, an encoding language designed to be both human and machine readable, so I\u2019ve been able to write scripts to extract valuable metadata from the files. such as the title of each text, the author, and when the text was written (if these things are known\u2014the most common medieval author is \u201cAnonymous\u201d). This metadata has allowed me to build an index which will support the queries that guide my textual analysis. Before the fun part can begin, however, I have to \u201cclean\u201d the data from the raw XML files. Cleaning in this case doesn\u2019t involve a bucket and mob but is instead the process of fixing or removing incorrect, corrupted, incorrectly formatted, duplicate, or incomplete data within a dataset. When combining multiple data sources, there are many opportunities for data to be duplicated or mislabeled and, since the raw XML files from the Middle English Dictionary have been aggregated from numerous sources, there is an abundance of duplications and mislabeling to be found. I haven\u2019t done this much cleaning during the summer since the time I talked back to my Mother between 4th and 5th grade, but I am hopeful the results will be worth it. <\/p>\r\n","protected":false},"excerpt":{"rendered":"<p>Prior to Samuel Morse\u2019s invention of the telegraph in the first half of the nineteenth century, communication technology was chiefly limited to oral or textual messages delivered by a messenger. British sci-fi savant Arthur C. Clarke expands upon this fact, stating that \u201cWhen Queen Victoria came to the throne in 1837, she had no swifter<a class=\"more-link\" href=\"https:\/\/blog.lib.uiowa.edu\/studio\/2023\/07\/11\/data-mining-for-medieval-messengers\/\">Continue reading <span class=\"screen-reader-text\">&#8220;[Data] Mining for Medieval Messengers&#8221;<\/span><\/a><\/p>\n","protected":false},"author":338,"featured_media":7380,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[32],"tags":[],"syndication":[21],"_links":{"self":[{"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/posts\/7428"}],"collection":[{"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/users\/338"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/comments?post=7428"}],"version-history":[{"count":2,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/posts\/7428\/revisions"}],"predecessor-version":[{"id":7449,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/posts\/7428\/revisions\/7449"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/media\/7380"}],"wp:attachment":[{"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/media?parent=7428"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/categories?post=7428"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/tags?post=7428"},{"taxonomy":"syndication","embeddable":true,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/syndication?post=7428"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}