This repository holds the prepared [data set](https://gitlab.hzdr.de/hifis/overall/surveys/survey-2021/data/-/blob/master/data-hifis-survey-2021.csv) of responses to the HIFIS Survey 2021.
Data preparation included the following steps after manual screening of free text responses:
* Responses of 8 participants were excluded from subset of software development related question: {100, 324, 266, 75, 442, 566, 506, 572}
* Responses of 4 participants were excluded from data set due to inconsistent responses: {241, 449, 349, 501}
* New catgories were added: {'Q010_SQ009': 'it problems/network problems/remote access problems', 'Q010_SQ010': 'operation/maintenance of software services'}
* New catgories were added: {'Q011_SQ010': 'characteristic-based', 'Q011_SQ011': 'collaboration tools/sync & share', 'Q011_SQ012': 'communication tools/conferencing'}
* New catgories were added: {'Q013_SQ010': 'public authorities', 'Q013_SQ011': 'research institutes/networks'}
* New catgories were added: {'Q014_SQ010': 'ability to be self hosted & interoperability', 'Q014_SQ011': 'information security', 'Q014_SQ012': 'usability/user friendliness'}
* New catgories were added: {'Q021_SQ038': 'fortran', 'Q021_SQ039': 'labview', 'Q021_SQ040': 'mathematica', 'Q021_SQ041': 'vhdl'}
* New catgories were added: {'Q024_SQ007': 'hands-on experience'}
* New catgories were added: {'Q043_SQ012': 'automation', 'Q043_SQ013': 'data handling software', 'Q043_SQ014': 'data management software'}
* Raw free text responses were removed after categorization
Details can be found in the [data preparation script](https://gitlab.hzdr.de/hifis/overall/surveys/survey-2021/data/-/blob/master/data-preparation.py) or the [corresponding issue](https://gitlab.hzdr.de/hifis/overall/surveys/survey-2021/data/-/issues/2)