- Find Information
- Research Guides
- Research Data Management
Research Data Management
Repositories
Data repositories are great for storing and promoting data. There are hundreds of repositories with options available for all datasets.
Lists of Repositories (search by discipline):
- Registry of Research Data Repositories (re3data)Comprehensive registry of data repositories, browsable by discipline
- OpenDOARAuthoritative directory of academic open access repositories
- Nature.com Data RepositoriesList of repositories for natural sciences as well as general purpose
- Open Access Directory (OAD)Extensive list of open data repositories maintained by Simmons University
Data Repositories:
- Harvard DataverseOpen source research data repository.
- DRYADFree general purpose repository for a wide diversity of data types
- FigshareOpen access data repository. Store, share, discover.
Preserving your data
Long-term preservation is the final stage in the data life cycle but requires some planning and maintenance from the project onset. Different from storage, preservation may require data to be migrated from format to format as newer storage models emerge while retaining its integrity. Some things to consider when developing a preservation plan include:
- Funder/journal requirements
- Some funders and journals have preservation requisites
- Longevity of preservation
- How long should it be kept?
- Preservation responsibility
- Who will maintain and/or fund preservation?
- Documentation
- Is there sufficient documentation and context so that others can use the data without you?
- File format
- As previously covered, are the files in sustainable formats?
- Repository
- Will you be using a repository? If not, what is the shelf-life of the hardware being used?
Image by Elco van Staveren, from Flickr
Sharing your data
It is increasingly common for major funding agencies and many journals to stipulate Open Data/Data Sharing as part of their requirements for funding or publication.
In addition to meeting publisher and funder requirements, open data also:
- Increases transparency
- Improves reproducibility & reliability
- Facilitates research expansion
- Creates opportunities for collaboration
- Increases impact by increasing citation rates
- Allows data to be used for educational purposes
Repositories exist to collect and preserve scholarly data. Data deposited in a repository helps facilitate not only preservation, but also discovery and citation. Just keep in mind that not every subject repository will accept all data nor are all repositories suited for long-term preservation.
Image by Ansonlobo, from Wikimedia Commons
Unique Identifiers
Dataset identifiers will allow your data to be referenced and shared. Data identifiers must be globally unique and persistent: they must not be repeated elsewhere and they must not change over time.
Identifier schemes:
- URIA universal set of names in registered name spaces and addresses referring to registered protocols or name space
- PURLPersistent URLs provide permanent addresses for resources on the web
- DOIProvides a technical and social infrastructure for the registration and use of persistent interoperable identifiers, called DOIs, for use on digital network
- IUPACA non-proprietary identifier for chemical substances that can be used in printed and electronic data sources thus enabling easier linking of diverse data compilations