Before data are submitted an archive agreement (86 KB) must be signed by archive and the depositor. This agreement covers arrangements regarding usage rights, authenticity, data protection responsibilities, and disposal. The depositor also selects an access category (Usage Regulations) defining conditions under which data and documentation are released.
Data can be submitted when outstanding legal questions have been answered and it’s established the depositor has authority to transfer non-exclusive rights for archiving and dissemination to the archive.
The depositor compiles a Submission Information Package (SIP) which should include the following:
PDF files must be free of protection otherwise they cannot be processed, e.g. for the migration into other data formats.
To support the long-term preservation, interpretability, and accessibility of data, choosing suitable file formats is of particular importance. Just like hardware, software constantly evolves. For example, new functions are added to software programs, or software is adapted to new operating systems. Both can lead to changes in the file format. In consequence, digital data is constantly at risk from changes in the hard- and software environment. This risk can be mitigated if suitable file formats are used.
The GESIS Data Archive recommends using the following formats for the most important object classes:
Submitted datasets should be usable in one of the widely used statistical packages (SPSS, Stata or SAS). More specifically, data can be submitted in the following forms:
1. As so-called system files in the proprietary formats of common statistical packages (e.g. SPSS System File).
2. In software-specific portable file formats (e.g. SAS Transport File).
3. As text files (comma-, tab-delimited formats) with the required setup or syntax files to enable importing into statistical packages.
|Type of data||Preferred formats||Acceptable formats|
The Data Archive will accept additional formats, especially for data, which (upon consultation) can be converted into preferred formats for preservation. Regardless of the specific file format datasets should always be structured in a manner allowing third parties to read and understand them.
Data may therefore not be encrypted. In addition, functions such as printing or copying should not be disabled.
The Submission Information Package (SIP) can be submitted to the archive as follows: