{"id":5038,"date":"2017-04-25T15:16:57","date_gmt":"2017-04-25T20:16:57","guid":{"rendered":"https:\/\/blog.lib.uiowa.edu\/studio\/?p=5038"},"modified":"2017-04-25T15:16:57","modified_gmt":"2017-04-25T20:16:57","slug":"saving-endangered-data-what-can-digital-humanists-and-libraries-do","status":"publish","type":"post","link":"https:\/\/blog.lib.uiowa.edu\/studio\/2017\/04\/25\/saving-endangered-data-what-can-digital-humanists-and-libraries-do\/","title":{"rendered":"Saving Endangered Data: What Can Digital Humanists and Libraries Do?"},"content":{"rendered":"<p>In a blog post last week, I addressed Endangered Data Week and\u00a0<a href=\"https:\/\/www.forbes.com\/sites\/drsarahbond\/2017\/04\/19\/saving-endangered-data-from-ancient-rome-to-trumps-america\/#5861a6055d67\" target=\"_blank\" rel=\"noopener noreferrer\">the history of political parties hiding, removing, or altogether abolishing\u00a0public access to government documents.<\/a>\u00a0However,\u00a0my post wasn&#8217;t alone in trying to shed\u00a0light on this serious issue. In schools, universities, libraries, and classrooms across the\u00a0world, hundreds of concerned people came together\u00a0to bring awareness to the issue of endangered and disappearing data.\u00a0And while\u00a0<a href=\"http:\/\/endangereddataweek.org\/\">Endangered Data Week<\/a>\u00a0is now over, the threat is not. So this week, I teamed up with the Digital Scholarship &amp; Publishing Studio to highlight some of the excellent work currently being done by digital humanists and to provide some advice on how to get involved.<\/p>\n<figure id=\"attachment_5040\" aria-describedby=\"caption-attachment-5040\" style=\"width: 586px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/F4.large-1.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-5040\" src=\"https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/F4.large-1-300x121.jpg\" alt=\"\" width=\"586\" height=\"236\" srcset=\"https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/F4.large-1-300x121.jpg 300w, https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/F4.large-1-768x310.jpg 768w, https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/F4.large-1-1024x414.jpg 1024w, https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/F4.large-1-620x250.jpg 620w, https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/F4.large-1.jpg 1200w\" sizes=\"(max-width: 586px) 100vw, 586px\" \/><\/a><figcaption id=\"caption-attachment-5040\" class=\"wp-caption-text\"><em>The <\/em>Adlocutio<em> relief of the Plueti Traiani (Late 2nd c. CE). Now in the Curia Senatus in the Forum Romanum, Rome. Photo from Wikimedia but originally taken by Diane Favro (UCLA) for her &#8220;Death in Motion&#8221; article. It depicts debt records being burned.<\/em><\/figcaption><\/figure>\n<p>First, I visited with\u00a0Tom Keegan, Head of the <a href=\"https:\/\/www.lib.uiowa.edu\/studio\/\">Digital Scholarship &amp; Publishing Studio,<\/a> and\u00a0Matt Butler, the Studio&#8217;s Senior Developer, to discuss the services offered by university libraries to keep scholarly data safe. They stressed the\u00a0import of digital institutional repositories in helping scholars to maintain their own data and make it accessible to others free of charge. The University of Iowa&#8217;s institutional repository,\u00a0<a href=\"http:\/\/ir.uiowa.edu\/\">Iowa Research Online<\/a>, houses an array of faculty, graduate, and undergraduate work. Librarians work closely with faculty, staff, and students to ensure these materials are properly archived and made available according to agreed upon standards. As I have pointed out before, non-university repositories\u00a0<a href=\"https:\/\/www.forbes.com\/sites\/drsarahbond\/2017\/01\/23\/dear-scholars-delete-your-account-at-academia-edu\/#13b655042d62\">like Academia.edu<\/a> are for-profit and will indeed use your data in order to make them money.<\/p>\n<p>Profit\u00a0is a big factor to consider when thinking about where to put data. As Eric Kansa, founder of <a href=\"https:\/\/opencontext.org\/\">Open Context<\/a>\u00a0emphasized to me: &#8220;We need to maintain nonprofit (civil society) infrastructure to help maintain data (and backup internationally) during political crises. Organizations like the <a href=\"https:\/\/archive.org\/\">Internet Archive, <\/a>and other libraries (including university libraries) are critical, because they have the expertise and infrastructure needed to maintain public records.&#8221; Kansa rightfully points out that libraries are integral to this fight, but notes that individuals need to know more about\u00a0the vulnerability of data as well.<\/p>\n<p>So,\u00a0what do we do about all the government data (e.g. climate data) that is currently being pulled from government websites? This was just one question addressed by the group behind the formation of Endangered Data Week. Like most DH projects, EDW was\u00a0forged by proactive academics who wanted to make a difference by using the biggest megaphone in the world: The Web. Michigan State University professor and digital humanist\u00a0<a href=\"https:\/\/twitter.com\/brandontlocke\">Brandon Locke<\/a>, in collaboration with Jason A. Heppler, Bethany Nowviskie, and Wayne Graham, designed EDW on the model provided by\u00a0<a href=\"http:\/\/www.bannedbooksweek.org\/\">Banned Books Week<\/a> and <a href=\"http:\/\/www.openaccessweek.org\/\">Open Access Week<\/a>. From there they brought the project to the\u00a0<a href=\"https:\/\/diglib.org\/\">Digital Library Federation<\/a>&#8216;s new interest group on <a href=\"https:\/\/www.diglib.org\/transparency-accountability\/\">Government Records Transparency\/Accountability<\/a>, directed\u00a0by Rachel Mattson.<\/p>\n<p>In order to find out more about this initiative and the problems they are addressing, I spoke to\u00a0<a href=\"http:\/\/nowviskie.org\/\">Bethany Nowviskie,<\/a>\u00a0<span class=\"s1\">Director of the <a href=\"https:\/\/www.diglib.org\/\">Digital Library Federation <\/a>(DLF) at CLIR and a\u00a0<\/span><span class=\"s1\">Research Associate Professor of Digital Humanities, UVa. Prof. Nowviskie was kind enough to answer a number of questions I had about endangered data and how to get more involved in the fight to save it:\u00a0<\/span><\/p>\n<p class=\"p1\"><strong><em><span class=\"s1\">SB: Who owns federal data? In other words, should data be available to us because we pay taxes and fund data-producing institutions like HUD? The EPA? Why is the Executive in control of so much of this open data?\u00a0<\/span><\/em><\/strong><\/p>\n<p class=\"p3\"><span class=\"s1\">BN: Except where issues of personal privacy and cultural sensitivity are involved, data collected or produced by taxpayer-funded agencies of the federal government should be openly available to everyone. It\u2019s a matter of transparency for the health of the republic \u2014 sunlight being, as they say, the best disinfectant \u2014 and of accountability of the government to its people. These are our datasets, and we should have the ability to analyze and build on them \u2014 using them to understand our world better, as it is, and to be able to *make it better.*<\/span><\/p>\n<p class=\"p1\"><strong><em><span class=\"s1\">SB: How do we create a more centralized, non-profit infrastructure that can maintain data during political crises?<\/span><\/em><\/strong><\/p>\n<p class=\"p3\"><span class=\"s1\">BN: Most pieces of our needed infrastructure are already in place. We call them libraries. The DLF will join a large number of allied groups in early May, convened by DataRefuge (our Endangered Data Week partner) and the Association of Research Libraries, to discuss a new \u201cLibraries+ Network,\u201d to take on exactly this issue: <a href=\"https:\/\/libraries.network\/about\/\"><span class=\"s2\">https:\/\/libraries.network\/about\/<\/span><\/a> Some questions that will motivate us:\u00a0how can we create greater coherence among the many governmental, non-profit, and even commercial groups with longstanding commitments and expertise in particular areas of the data preservation enterprise? Might we re-energize and re-imagine something like the Federal Depository Library program for the digital age? What would it take for governmental agencies to implement data management plans for the full lifecycle of their information, just as researchers who receive federal funds are now typically required to do?\u00a0<\/span><\/p>\n<p class=\"p1\"><strong><em><span class=\"s1\">SB: What can regular non-specialists do to contribute?<\/span><\/em><\/strong><\/p>\n<p class=\"p3\"><span class=\"s1\">BN: This is one reason DLF jumped at the chance to support grassroots efforts to organize the first annual Endangered Data Week. The goals expressed and audiences implied in our capsule summary (&#8220;raising awareness of threats to publicly available data; exploring the power dynamics of data creation, sharing, and retention; and teaching ways to make endangered data more accessible and secure\u201d) go far beyond the professional research data management and data stewardship community. Probably the most useful thing a non-specialist can do is to educate herself on the issues and represent the value of open data legislation and the advances in open government we saw under the Obama administration to her representatives. We also need to urge follow-through on past bipartisan commitments in this sphere, such as the OPEN Government Data Act: <a href=\"https:\/\/www.datacoalition.org\/open-government-data-act\/\"><span class=\"s2\">https:\/\/www.datacoalition.org\/open-government-data-act\/<\/span><\/a> \u00a0<\/span><\/p>\n<p class=\"p1\"><strong><em><span class=\"s1\">SB: Can you give some examples of digital projects or initiatives that depend on federal data to reveal racial inequity (e.g. redlining projects), bias, or certain dangers (e.g. lead poisoning)?<\/span><\/em><\/strong><\/p>\n<p class=\"p3\"><span class=\"s1\">BN: Well, FOIA requests played an important role [<a href=\"https:\/\/www.scientificamerican.com\/article\/students-reveal-how-they-broke-the-lead-contamination-case-in-flint-mich\/\">in the Flint water crisis<\/a>]\u2014 as they have done in Title IX enforcement on college campuses. In this sphere, I also think it\u2019s worth mentioning that identical bills were recently introduced in both the House and Senate that would prohibit federal funds from being \u201cused to design, build, maintain, utilize, or provide access to a Federal database of geospatial information on community racial disparities or disparities in access to affordable housing.\u201d [<a href=\"\/\/www.congress.gov\/bill\/115th-congress\/house-bill\/482\/text\">House<\/a> Bill, <a href=\"https:\/\/www.congress.gov\/bill\/115th-congress\/senate-bill\/103\/text\">Senate<\/a> Bill].\u00a0<\/span>They went nowhere, and were ostensibly meant to \u201cprotect local zoning decisions,\u201d but *what is up with that?* This is the kind of thing that should energize non-specialist readers.<\/p>\n<p class=\"p1\"><strong><em><span class=\"s1\">SB: How can we have trust in the integrity of datasets that have been given over to private institutions or saved by non-federal entities? In other words, who will hold the \u201ccontrol\u201d copy (e.g. like a seed bank) that can assure us that datasets that have been saved were not then tampered with?<\/span><\/em><\/strong><\/p>\n<p class=\"p3\"><span class=\"s1\">BN: So, there\u2019s a huge professional community \u2014 many of them are DLF members or members of the National Digital Stewardship Alliance which we host \u2014 whose whole focus is on questions like this, and there are excellent protocols and procedures for ensuring data integrity. I\u2019m not familiar enough with the ins and outs to give you a good quote, but it\u2019s not a new problem, for sure, and methods for auditing and certifying digital repositories and verifying the integrity and security of the data within them are well established. As always, matters of policy, funding, and the professional development and nurturing of the communities who do the work are a bigger challenge than the technology!<\/span><\/p>\n<figure id=\"attachment_5043\" aria-describedby=\"caption-attachment-5043\" style=\"width: 412px\" class=\"wp-caption aligncenter\"><a href=\"https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/data-rescue-developers-1024x760.jpg\"><img loading=\"lazy\" decoding=\"async\" class=\" wp-image-5043\" src=\"https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/data-rescue-developers-1024x760-300x223.jpg\" alt=\"\" width=\"412\" height=\"306\" srcset=\"https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/data-rescue-developers-1024x760-300x223.jpg 300w, https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/data-rescue-developers-1024x760-768x570.jpg 768w, https:\/\/blog.lib.uiowa.edu\/studio\/files\/2017\/04\/data-rescue-developers-1024x760.jpg 1024w\" sizes=\"(max-width: 412px) 100vw, 412px\" \/><\/a><figcaption id=\"caption-attachment-5043\" class=\"wp-caption-text\"><em>Software developers at UC Berkeley (School of Information, South Hall) coding to crawl and archive federal databases (Photo via Eric Kansa).<\/em><\/figcaption><\/figure>\n<p class=\"p3\">Bethany&#8217;s comments above echo what others on campuses across the US are saying: data is a resource. Like water or electricity, access to it ought not be taken for granted. We must continue to be vigilant in the face of lazy and aggressive attitudes, alike. Libraries and library associations remain a big part of the fight to preserve this data, but all of us can play a part by being more aware, spreading the word, and getting involved in the movement.<\/p>\n<p class=\"p1\">\n","protected":false},"excerpt":{"rendered":"<p>In a blog post last week, I addressed Endangered Data Week and\u00a0the history of political parties hiding, removing, or altogether abolishing\u00a0public access to government documents.\u00a0However,\u00a0my post wasn&#8217;t alone in trying to shed\u00a0light on this serious issue. In schools, universities, libraries, and classrooms across the\u00a0world, hundreds of concerned people came together\u00a0to bring awareness to the issue<a class=\"more-link\" href=\"https:\/\/blog.lib.uiowa.edu\/studio\/2017\/04\/25\/saving-endangered-data-what-can-digital-humanists-and-libraries-do\/\">Continue reading <span class=\"screen-reader-text\">&#8220;Saving Endangered Data: What Can Digital Humanists and Libraries Do?&#8221;<\/span><\/a><\/p>\n","protected":false},"author":111,"featured_media":0,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"footnotes":""},"categories":[9,5,6,11,1],"tags":[],"syndication":[30,21],"_links":{"self":[{"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/posts\/5038"}],"collection":[{"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/users\/111"}],"replies":[{"embeddable":true,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/comments?post=5038"}],"version-history":[{"count":18,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/posts\/5038\/revisions"}],"predecessor-version":[{"id":5058,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/posts\/5038\/revisions\/5058"}],"wp:attachment":[{"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/media?parent=5038"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/categories?post=5038"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/tags?post=5038"},{"taxonomy":"syndication","embeddable":true,"href":"https:\/\/blog.lib.uiowa.edu\/studio\/wp-json\/wp\/v2\/syndication?post=5038"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}