Skip to content

The University of Iowa Libraries

Skip to content
Go to
InfoHawk+
University of Iowa Libraries University of Iowa Libraries The University of Iowa The University of Iowa Libraries

Library News

Go to the University of Iowa Libraries home page

Tag: research data

Mar 07 2023

The Power of Persistent Identifiers in Data Sharing

Posted on March 7, 2023 by Nancy Henke

Every researcher has likely seen Digital Object Identifiers (DOIs) attached to the articles they read, cite, and publish. As unique codes that distinguish a specific research article, dataset, or other digital object, DOIs make it much easier to find and cite the work of other scholars. What you may not know, however, is that DOIs are just one type of persistent identifier (PID). PIDs make your research findable now and in the future, reusable for the long term, and offer new insights into your research.

PIDs, sometimes referred to as persistent digital identifiers (PDIs) or globally unique identifiers (GUIDs) take many different forms. You may have seen an Open Researcher and Contributor ID (ORCID) attached to someone’s email signature, or a Research Resource ID (RRID) in a journal article identifying a specific antibody used in the study, or even an International Standard Book Number (ISBN) in a book you’ve read. All of these are PIDs, but don’t let the alphabet soup of initials intimidate you. In essence, all persistent identifiers have two traits that make them powerful.

The first is the part you see and use: the string of letters and/or numbers that uniquely define an entity (like an article, researcher, dataset, or institution). These codes will not change, will not be reused, and are a major reason finding information is so easy with PIDs. The second part of a PID is behind the scenes: the service that locates the resource (or “resolves” it) if its URL changes, ensuring that people who use the PID can always access what they’re looking for. This means, for instance, that even if the URL for an article changes the DOI never will.

Here’s an example: When a researcher deposits their dataset in the University of Iowa’s institutional repository, Iowa Research Online (IRO), a DOI for the dataset is reserved. A data librarian at UI Libraries will then review and curate the data. When the creator of the dataset gives the green light, IRO will register the dataset, its corresponding metadata, and the DOI with an organization called DataCite, activating the DOI and telling DataCite the landing page for the dataset (i.e., the URL that describes the data and provides access to the files). If IRO reorganized its servers in the future and the URL of the landing page changed, IRO would update this “address change” with DataCite. This ensures that the DOI will still point people to the new, correct URL.

The image below demonstrates how the DOI for a dataset resolves to the specific URL in IRO.

Diagram with DOI and corresponding citation on the left, and URL and corresponding landing page on right.
Image adapted from: Nosé, M. (2019). “Practice of research data management in solar-terrestrial physics [PowerPoint Slides]. Institute for Space-Earth Environment Research, Nagoya University. https://slideplayer.com/slide/17406586/

Another important benefit of using persistent identifiers is connection: linking the digital object to everything else that’s associated with it through PIDs, like the people who contributed to the work (through ORCIDs), the places they work (through Research Organization Registry IDs, or RORs), the cell lines they used in the research process (through RRIDs), and articles that use the data (through DOIs). 

PIDs thus create a stable network of linked data that makes research outputs more FAIR (Findable, Accessible, Interoperable, and Reusable), helping people find new connections between data, articles, researchers, institutions, and granting bodies. A researcher who finds your dataset might then be curious if you’re the same person who wrote an article on a similar topic they read in a journal. This might lead them to click your ORCID to discover other research you’ve done, pointing them to the DOI of another dataset you’ve published, which they discover they could use for their own project. This discovery was facilitated by PIDs.

The image below offers an example of how PIDs connect these different entities.

Diagram showing how 3 researchers with ORCIDs create datasets with DOIs, using cell lines with RRIDs, writing articles with DOIs.

You can do a few things to make the most of PIDs and leverage these connections:

  • Most importantly, deposit your data and code into repositories that support PIDs. This enables these vital connections between your dataset and the people, articles, institutions, and granting bodies associated with it. These valuable links are the reason the National Institutes of Health strongly encourage using PID-friendly repositories in their guidelines for Data Management and Sharing Plans.
  • Register for an ORCID if you don’t already have one, and use it in your repository deposits, code documentation, CV, personal website, grant applications, and anywhere else you can. It connects others to your research and keeps people from confusing you with another researcher with a similar name. The ORCID system will harvest and connect your research outputs – meaning you don’t have to.
  • Use RRIDs in your articles if the journal you’re submitting to supports them. The RRID website includes a list of journals that specifically ask for RRIDs (including Cell, Nature, and PLOS One) as well as a list of journals that will include them in articles if the authors add them.

Each of these steps is small and may not seem impactful on its own, but when all researchers do a few small things to enable FAIR data through PIDs, the entire research ecosystem benefits. If you’d like to learn more about repositories like IRO that support PIDs, reach out to Research Data Services in the Scholarly Impact Department of UI Libraries by email or through our website.

Posted in Research Data, Scholarly Impact, UncategorizedTagged persistent identifiers, PIDs, research data
Feb 27 2023

Data Curation: Adding Value to Your Dataset

Posted on February 27, 2023 by Nancy Henke

You may already know that the University of Iowa’s institutional repository, Iowa Research Online (IRO), provides both preservation and access to your dataset for the long term. You may not know, however, that Research Data Services in the Scholarly Impact Department at the UI Libraries also offers another key service to researchers depositing their data in IRO: data curation.

Although the term curation (from the Latin “care” or “attention”) might be a practice you associate with historical artifacts or priceless paintings, data curation is a collaborative, value-added process that provides care and attention to a dataset. It helps make data more FAIR (findable, accessible, interoperable, and reusable).

""

When you deposit your dataset in IRO, a data librarian at the Libraries will work with you to ensure that it is as complete, understandable, and accessible as possible. Data curation is different than peer review; its purpose is to ensure that the data can be found and used, not to judge the scientific methods that went into its creation.

Think of data curation as an investment, and working with a data librarian up front can get you a great return on that investment. It gives you a dataset that’s more valuable to a potential user because it’s easier to find, use, and interpret. This is a service that sets IRO apart from many other repositories, most of which simply don’t have the staff to offer this value-added process.

Depending on the specific dataset, data curation may entail:

  • checking to ensure all files open properly
  • reviewing file naming and organization strategies to ensure they’re transparent for future users
  • identifying proprietary file formats and recommending open alternatives
  • analyzing documentation, like data dictionaries or README files, to ensure others with knowledge of the discipline can understand them
  • ensuring tabular data in spreadsheets is clean, organized, and optimized for reuse

Confirming that the dataset has rich and complete metadata attached to it is an essential aspect of data curation. Metadata is something librarians talk about often, and with good reason. It’s the information needed for others to find, understand, and use the data. Different types of metadata – descriptive, administrative, and technical – all add value to your data in different ways, and a data librarian can help ensure the thoroughness of it all.

Descriptive metadata, for instance, is vital for discoverability (i.e., ensuring your data will appear in the results when someone does a relevant search on Google or InfoHawk+, the University of Iowa Libraries’ discovery tool). It includes having a clear and distinct title for the dataset, that all collaborators are named with contact information and ORCIDs (Open Researcher and Contributor IDs) included, and that there is a full abstract.

A data librarian can also help with technical metadata, like specifying the type of software someone needs to open the file, and administrative metadata, such as helping choose the license for the dataset. This information helps a potential user know how to get access to your data and what they can do with it once they do.

If you’d like help with data curation or depositing your data into IRO, Research Data Services is here to help. Contact us by email or visit our website to set up a consultation. If you’re ready to deposit data in IRO, we’ve created a metadata guide and a data deposit guide to walk you through the steps.

Posted in Research Data, Scholarly Impact, UncategorizedTagged Iowa Research Online, research data
Feb 20 2023

Why Iowa Research Online is an Ideal Place for Your Data

Posted on February 20, 2023February 28, 2023 by Nancy Henke

University of Iowa researchers are increasingly taking advantage of the university’s institutional repository, Iowa Research Online (IRO), to house their research and creative works. IRO currently holds nearly 115,000 research outputs from Iowa faculty, staff, and students, and has seen more than 12 million downloads of content since 2009. On top of preserving articles, books, conference proceedings, theses, and dissertations, IRO is also an ideal place for researchers to deposit their datasets and code.

""“Research Data Management” by Janneke Staaks CC BY-NC 2.0

 

IRO provides preservation, access, and curation of your data. Here’s what that means in practice:

Preservation

Your dataset or code will be housed on a secure server for the long term, maintaining it for future use. Despite the perception that digital files never wear out, they can deteriorate. Sometimes this happens due to bit loss – when the binary code that makes up the file degrades as the data is transferred from one place to another – or sometimes because of corrupted files. In other cases, file formats evolve and need to be converted to a different format to enable access and use.

The items in IRO are proactively managed to guard against these situations. Regular fixity checks act as “check-ups” for files to ensure they’re healthy and haven’t changed, multiple copies of the data are archived in different geographic locations in the case of a natural disaster, and corrupted files have self-healing capabilities thanks to the cloud infrastructure that houses them. All of this ensures that the products of your hard work are available now and in the future.

Using open formats for your work also helps with preservation, since these formats – like .sav, .mp3, and .mp4, to name a few – are more likely to remain functional in the future. In fact, using open formats also facilitates another benefit of archiving in IRO, access, since the files don’t require specialized proprietary software to open and use them.

Access

Your dataset will be accessible to researchers all over the world, increasing the reach and impact of your work. This access is made possible by a few key features.

First, all IRO deposits have a metadata record with pertinent information about the dataset – like the title, collaborators, abstract, grant information, dates of data collection, etc. Since all items in the repository are discoverable on Google and are indexed and searchable in InfoHawk+, the University of Iowa Libraries’ discovery tool, robust metadata makes it more likely that your dataset will appear in relevant searches. This is vital for helping others find your work.

Equally important is the stable, persistent URL your dataset will receive. Since the URL won’t change, it eliminates the tedium of identifying and updating broken links on your CV or personal website and makes it easier for you to share your work with others. And if you’re ever curious about the number of views and downloads a dataset receives, the metrics are readily available.

All IRO deposits also receive a digital object identifier (DOI) which makes it easy for others to cite your work when they use it and ensures you get credit when they do. When you deposit your data, you can also link to the DOI of the article or articles where the data is used. Research Data Services in the Scholarly Impact Department at the UI Libraries can even reserve a DOI for your dataset and keep it inactive so you can put the citation in a manuscript during the peer review process. After the article is published, just ask us and we’ll activate it.

Curation

A data librarian at the University of Iowa Libraries will also help curate your data when you deposit it in IRO. In addition to ensuring your file names and organization strategies are understandable to potential users, the librarian can also help you find open formats for your files, look at your documentation, and assist with the all-important metadata record that helps others find your work.

Get Started

Ultimately, depositing your data and code in IRO is a win-win. It helps you preserve your research outputs, disseminate them to increase the influence of your work, and enable scholars the world over to find and use your data and code for their own projects. And now that The National Institutes of Health require that researchers identify appropriate repositories for their data in their Data Management and Sharing Plans, IRO could be the answer – especially if no discipline-specific repositories exist in your field. IRO’s preservation, access, and curation features put it a step above other generalist repositories.

If you want help depositing your data or code in IRO or have questions about choosing a repository, Research Data Services is here to assist you. We have a guide on our website walking you through the steps to upload your content in IRO, do one-on-one consultations, and are available by email, too.

Posted in Research Data, Scholarly ImpactTagged Iowa Research Online, research data

Categories

  • Anti-racism
  • Art Library
  • Business
  • Collection Connection
  • Cultural Center Liasions
  • Did You Know
  • Digital Scholarship & Publishing Studio
  • DVD Display
  • Employment
  • Engineering
  • Event
  • Faculty News
  • Hardin
  • History
  • ICBF
  • ICBF2010
  • Iowa Digital Library
  • IWA
  • Learning Commons
  • Main Library
  • Music
  • New Books
  • News
  • Preservation
  • Research Data
  • Scholarly Communication
  • Scholarly Impact
  • Sciences
  • Special Collections
  • Transitions
  • Uncategorized
  • University Librarian
  • What's new

Archives

Meta

  • Log in
  • Entries feed
  • Comments feed
  • WordPress.org

Stories about the Libraries published in the University of Iowa's Iowa Now

Proudly powered by WordPress | Theme: Zoia by Automattic.
University of Iowa Libraries University of Iowa Libraries The University of Iowa The University of Iowa Libraries
  • Contact the Libraries
  • Library locations & hours
  • News & Events
  • Help using the Libraries
  • Assistance for people with disabilities
  • Our diversity statement
  • Thank a Librarian
  • Web site/page feedback OR general suggestions
  • UI Libraries other links UI Libraries in the Internet Archive Use and reuse of UI Libraries web content - Creative Commons Staff SharePoint (authentication required)
  • UI Libraries on social media UI Libraries on Instagram UI Libraries on Facebook UI Libraries on Twitter UI Libraries on Pinterest UI Libraries on Tumblr UI Libraries on YouTube UI Libraries on Flickr UI Libraries blogs
  • 100 Main Library (LIB)
  • 125 West Washington St.
  • Iowa City, IA 52242-1420
  • 319-335-5299 (Service Desk)
  • ©2019 The University of Iowa
  • Give a gift to the Libraries!