A data repository allows researchers to upload and publish their data, thereby making the data available for other researchers to re-use. Similarly, a data archive allows users to deposit and publish data but will generally offer greater levels of curation to community standards, have specific guidelines on what data can be deposited and is more likely to offer long-term preservation as a service. Sometimes the terms data repositories and data archives are used interchangeably.
A data repository or archive will provide services such as:
Choose early so that you can familiarise yourself with the repository’s requirements. Requirements may include depositing in certain file formats, or using a specific metadata standard; and the inclusion of documentation to help describe your data. Understanding such requirements will enable you to design your data collection materials for easier metadata and documentation creation.
Initial Questions
Ask:
Other questions may pertain depending on your requirements. For more information see the UK’s Digital Curation Centre’s checklist.
Re3data.org
This is the primary place to locate a data repository. Search by specific research discipline and then filter by access categories, data usage licenses, whether the repository gives the data a persistent identifier etc. Re3data uses a series of symbols to indicate these key services. E.g.:
To be registered in re3data.org a research data repository must:
If there is no disciplinary-specific repository in your area select a general repository. These can handle a variety of different data types. Charges may apply but can be included in a funding application. Key general repositories are listed in the table below. This list is for information purposes only and is not exhaustive:
GitHub is the main platform for hosting and reviewing code:
GitHub offers a number of advantages such as assigning DOIs (which facilitates discoverability and citeability) and allowing integration from Zenodo and FigShare repositories to enable the citing of your GitHub repository in academic literature.