Appraising
(data assessment, evaluation, review)
In this stage of the lifecycle, a data deposit is reviewed for completeness and compliance with a repository’s terms of use, and the overall FAIRness of the deposit may be assessed. A curator may work with the research team to enhance the usability and accessibility of the dataset. The Data Curation Network’s CURATED checklist is one framework that will guide a curator through the curation process. Steps include checking and understanding the data files and associated documentation, requesting missing information, augmenting metadata for findability, transforming files if necessary, evaluating the FAIRness of a deposit, and documenting all curation activities.
Some of the tasks a curator may perform:
- Check datasets for compliance with the repository terms of service
- Perform quality assurance through file audit and code review
- Assess files for disclosure risk or intellectual property concerns
- Review, enhance, or create documentation
- Help select an appropriate license for the data
- Make recommendations on file formats for accessibility in the short and long term
- Verify metadata quality and augment or restructure to facilitate discoverability and interoperability
- Request missing or ambiguous information or changes from data creators
Relevant tools and resources:
- DataOne Data Management Skillbuilding Hub
- GBIF Generalizing Sensitive Species Occurrence Data
- Sensitive Data Toolkit for Researchers Part 2 - Human Participant Research Data Risk Matrix
- Sensitive Data Toolkit for Researchers Part 1 - Glossary of Terms for Sensitive Data used for Research Purposes
- Data Curation Network Data Curation Primers
- DCC Curation Briefing Papers
- SATIFYD
- CSIRO 5-star Data Rating Tool
- Format Identification for Digital Objects (FIDO)
- UK Data Archive - Research Data Management
- Queen's University Library Research Data Management (RD) Workflow
- Primer on Data Management - What You Always Wanted to Know
- Portage Dataverse North Metadata Best Practices Guide - Version 2.0
- OpenRefine
- JHOVE
- DDC Glossary of frequently used terminology
- DCN Data Curation Tools List
- DCN curation resources
- DCN CURATE(D) model
- DCC Curation Reference Manual
- DCC Curation Lifecycle Model (with checklists)
- DCC Curation How-to Guides and Checklists
- BitCurator
- ARDC FAIR self assessment tool
- Apache Tika
- ANDS File Formats
- ANDS Curation Continuum