Open access to digital information
A robust infrastructure is needed
With approximately 1.5 billion objects to be digitised, bringing natural science collections to the information age is expected to result in 90 petabytes of new data over the next decades, used on average by 5,000 – 15,000 unique users every day. A robust technical infrastructure is required to support working with large digital datasets over their entire research data life cycle and to provide unified open access to the digital information.
The three building blocks of the technical infrastructure:
Repositories with data provided by the DiSSCo Facilities
The Infrastructure will link data that is provided by the DiSSCo facilities in trusted repositories. These may include local institutional repositories as well as global thematic repositories such as GBIF. All data that can be linked to collection objects (specimens) are in scope.
Digital Object Infrastructure
The data will be linked through a Digital Object (DO) infrastructure. For this it is planned to use CORDRA as a Natural Specimen Identifier Repository (NSIDR) and nsidr.org is being used as a sandbox to demonstrate and develop it. The DO infrastructure will include tools for federation and linkage as well as services to support annotation and enrichment of the data by the scientific community. It will draw upon common services provided, e.g. global and European Open Science Cloud services for authentication and authorization.
The infrastructure will provide community services to discover, consume and interact with the federated Digital Specimen data. Part of these services will be provided in collaboration with other research infrastructures to enable innovative services for multi-disciplinary science.